Viewing a single comment thread. View all comments

rshah4 t1_jbtsl7o wrote

Also, not sure about a recent comparison, but Nils Reimers also tried to empirically analyze OpenAI's embeddings here: https://twitter.com/Nils_Reimers/status/1487014195568775173

He found across 14 datasets that the OpenAI 175B model is actually worse than a tiny MiniLM 22M parameter model that can run in your browser.

8

Non-jabroni_redditor t1_jbu2shx wrote

That’s to be expected, no? No model is going to be perfect regardless of how it performs on a set (of datasets) as a whole

1

JClub t1_jbwu3lx wrote

more than that, GPT is unidirectional, which is really not great a sentence embedder

1