Submitted by President_Xi_ t3_113tuwb in MachineLearning
Is there a blog post or a paper comparing open source / open weights models? I know flant t5 is really good at instruction following, but I am specifically refering to performance after finetuning. Preferably it compares models from somewhere around 1b to 11b parameters.
borisfin t1_j8upatv wrote
There is some interesting comparisons found in the flan t5 paper. Checkout the paper "Scaling Instruction-Finetuned Language Models". Hope this helps.