The architecture probably isn't the problem. You only have 100 images, that's your problem.
If you can't get more labeled data, you should pretrain on unlabeled data that's as close as possible to your task - preferably other dental x-rays. Then you can finetune on your real dataset.
Viewing a single comment thread. View all comments