Submitted by PegasusInvasion t3_yz7dig in MachineLearning
IntelligenXia t1_iwyyhyn wrote
Detectron2 ( Facebook ) or for ease of use - Yolo v5 onwards
eg : For Yolo v5 based on how complex you want your inference to run, change the yaml file .
Here coco.yaml has around 80 objects
The model on Imagenet has around 1000 classes . That should be more than enough for anything you want to do !
Here are the 80 classes from coco.yaml file
0: person 1: bicycle 2: car 3: motorcycle 4: airplane 5: bus 6: train 7: truck 8: boat 9: traffic light 10: fire hydrant 11: stop sign 12: parking meter 13: bench 14: bird 15: cat 16: dog 17: horse 18: sheep 19: cow 20: elephant 21: bear 22: zebra 23: giraffe 24: backpack 25: umbrella 26: handbag 27: tie 28: suitcase 29: frisbee 30: skis 31: snowboard 32: sports ball 33: kite 34: baseball bat 35: baseball glove 36: skateboard 37: surfboard 38: tennis racket 39: bottle 40: wine glass 41: cup 42: fork 43: knife 44: spoon 45: bowl 46: banana 47: apple 48: sandwich 49: orange 50: broccoli 51: carrot 52: hot dog 53: pizza 54: donut 55: cake 56: chair 57: couch 58: potted plant 59: bed 60: dining table 61: toilet 62: tv 63: laptop 64: mouse 65: remote 66: keyboard 67: cell phone 68: microwave 69: oven 70: toaster 71: sink 72: refrigerator 73: book 74: clock 75: vase 76: scissors 77: teddy bear 78: hair drier 79: toothbrush
​
A slight change in the inference code is to needed to write into a csv file , in your case
Photo 1 | Car |
Photo 2 | Car | Person
...
etc
Viewing a single comment thread. View all comments