dancingnightly t1_je0o082 wrote on March 28, 2023 at 3:51 PM

Reply to comment by antonivs in [D] FOMO on the rapid pace of LLMs by 00001746

The benefit of finetuning or training your own text model (e.g. in the olden days on BERT), now through the OpenAI API vs the benefit of just using contextual semantic search is reducing day-by-day... especially with the extended context window of GPT-4.

If you want something in house, finetuning GPT-J or so could be the way to go, but it's definitely not the career direction I'd take.

antonivs t1_je1d8o0 wrote on March 28, 2023 at 6:30 PM

The training corpus size here is in the multi-TB range, so probably isn't going to work with the OpenAI API currently, from what I understand.