Submitted by somebodyenjoy t3_zc24rg in MachineLearning
I'm trying to figure out if the various versions of YOLO, such as YOLOv7 are better than the various versions of RCNN in terms of accuracy alone if speed is not much of an issue. Let's say I'm trying to detect various objects on a 2D floor plan, and I only care about accuracy.
How would a classifier that would go square by square to find the objects perform? This may not be as efficient as the standard object detection models, but would it be more accurate if I am willing to throw as much compute power as it wants for this brute force approach?
SeucheAchat9115 t1_iyusz2k wrote
I guess on Coco the best accuracy is given by transformer networks like Swin, but I would assume your dataset is not as big as coco, therefore transformers might not generalize well.