Yardanico t1_jdls342 wrote on March 25, 2023 at 10:05 AM

Reply to comment by wojtek15 in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700

Yeah, I think there's a lot of overhyping going around "running ChatGPT-grade language models on consumer hardware". They can "follow" instructions they same way as ChatGPT, but obviously those models know far, far less than the ClosedAI models do, and of course they'll hallucinate much more.

Although it's not an entirely bad thing, at least the community will innovate more so we might get something interesting in the future from this "push" :)

fiftyfourseventeen t1_jdnhbn0 wrote on March 25, 2023 at 6:43 PM

OpenAI is also doing a lot of tricks behind the scenes, so it's not really fair to just type two things into both, because they are getting nowhere near the same prompt. Llama is promising but it just needs to be properly instruction tuned

devl82 t1_jdmf6b3 wrote on March 25, 2023 at 2:09 PM

no they is no overhype, you just don't understand what Alpaca is trying to do & I am sure others will also reply similar