[D] Can someone explain the discrepancy between the findings of LLaMA and Chinchilla? Submitted by __Maximum__ t3_11l3as6 on March 7, 2023 at 4:06 PM in MachineLearning 18 comments 11
cztomsik t1_jbgdoar wrote on March 8, 2023 at 9:17 PM Reply to comment by currentscurrents in [D] Can someone explain the discrepancy between the findings of LLaMA and Chinchilla? by __Maximum__ but this is likely going to take forever because of LR decay, right? Permalink Parent 1
Viewing a single comment thread. View all comments