soggy_mattress t1_jdl4zkg wrote on March 25, 2023 at 4:46 AM Reply to [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700 I think of the 100b parameter models as analogous to the first room-sized computers that were built in the 70s. Seems the pattern is to first prove a concept, no matter how inefficiently, and then optimize it as much as possible. Permalink 188
soggy_mattress t1_jdl4zkg wrote
Reply to [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700
I think of the 100b parameter models as analogous to the first room-sized computers that were built in the 70s. Seems the pattern is to first prove a concept, no matter how inefficiently, and then optimize it as much as possible.