Submitted by Vegetable-Skill-9700 t3_121a8p4 in MachineLearning
CacheMeUp t1_jdxvq8t wrote
Reply to comment by hadaev in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700
Perhaps the challenge is not the size of the internet (it's indeed big and easy to generate new content), but rather the uniqueness and novelty of the information. Anecdotally, looking at the first page of Google results often shows various low-informativeness webpages, where only a few sentences provide information and the rest is boilerplate, disclaimers, generic advice or plain spam.
Viewing a single comment thread. View all comments