Submitted by AKavun t3_105na47 in deeplearning
FastestLearner t1_j3c0yju wrote
You are not using non-linearity. Yours is just a linear model. Deep CNNs thrive on non-linearity. Try adding a ReLU layer after every MaxPool. Also, for better convergence, add BN layers after each Conv. Don’t use two Linear layers (mostly redundant). Use AvgPool instead of Flatten. Replace Softmax with LogSoftmax. Set Adam lr=1e-4, decay=1e-4.
PM me if you face any more issues.
Viewing a single comment thread. View all comments