Viewing a single comment thread. View all comments

generatorman_ai t1_jc5vsbw wrote on March 14, 2023 at 7:23 AM

Reply to comment by extopico in [R] Stanford-Alpaca 7B model (an instruction tuned version of LLaMA) performs as well as text-davinci-003 by dojoteef

T5 is below the zero-shot phase transition crossed by GPT-3 175B (and presumably by LLaMA 7B). Modern models with instruction and HF finetuning will not need further task-specific finetuning for most purposes.