__lawless t1_j76vq7h wrote
Just finished reading. Although imho not a very fair comparison with GPT it still is super impressive
jaqws t1_j76wfb1 wrote
Why do you say it isn't a fair comparison?
__lawless t1_j76xpgk wrote
Just 2 points a) They fine tuned this model to death. Where as GPT3.5 has a handful of examples to fine tune b) This is a multi modal model which consumes the image directly. Where as GPT can only consume text, so they fed it caption of the image
jaqws t1_j76zkhu wrote
Ah, yeah I would agree that's not a fair comparison. Thanks for sharing.
kermunnist t1_j9kpp3s wrote
I wonder how flamingo would compare
Viewing a single comment thread. View all comments