Submitted by OffswitchToggle t3_zd6r90 in deeplearning
I have a project where I need to detect the orientation of machined parts on an assembly line. I have a dataset of hundreds of thousands of correctly labeled images (i.e. "correct orientation", "rotated left", "rotated right", "upside down"). Each image is 1024x768.
I'm relatively new to DL but have done a lot of reading about CNNs. I've come across many articles that discuss the process of hyperparameter tuning. But, I haven't been able to find anything related to creating an initial CNN architecture based upon the type of problem you are trying to solve. I've seen setups as basic as "dogs vs cats" up through implementing VGG16 from scratch.
How do I choose an initial architecture that is appropriate for my problem? TIA
descript_account t1_iyzzou9 wrote
There's no such guide. Just trial and error friend.