Necessary_Ad_9800 t1_jcjge23 wrote on March 17, 2023 at 7:42 AM

Reply to comment by blueSGL in [R] Stanford-Alpaca 7B model (an instruction tuned version of LLaMA) performs as well as text-davinci-003 by dojoteef

Everyone with their own private oracle in their hands. Pretty cool tbh

blueSGL t1_jcjgsl1 wrote on March 17, 2023 at 7:48 AM

Exactly.

I'm just eager to see what fine tunes are going to be made on LLaMA now, and how model merging effects them. The combination of those two techniques has lead to some crazy advancements in the Stable Diffusion world. No idea if merging will work with LLMs as it does for diffusion models. (has anyone even tried yet?)

Necessary_Ad_9800 t1_jcjj8b6 wrote on March 17, 2023 at 8:23 AM

Interesting. However I find some merges in SD to be terrible. But I have no doubt the open source community will make something amazing