rshah4 t1_jbtsl7o wrote on March 11, 2023 at 5:34 PM

Also, not sure about a recent comparison, but Nils Reimers also tried to empirically analyze OpenAI's embeddings here: https://twitter.com/Nils_Reimers/status/1487014195568775173

He found across 14 datasets that the OpenAI 175B model is actually worse than a tiny MiniLM 22M parameter model that can run in your browser.

Non-jabroni_redditor t1_jbu2shx wrote on March 11, 2023 at 6:45 PM

That’s to be expected, no? No model is going to be perfect regardless of how it performs on a set (of datasets) as a whole

more than that, GPT is unidirectional, which is really not great a sentence embedder