fourcornerclub
fourcornerclub OP t1_iylpygx wrote
Reply to comment by no_witty_username in [Discussion] - "data sourcing will be more important than model building in the era of foundational model fine-tuning" by fourcornerclub
u/no_witty_username and yet the standard in data sourcing still seems to be "let me see what's open source, and what I can scrape from the internet, and then I'll tune the model from there". Makes no sense to me!
fourcornerclub t1_it76lgq wrote
Reply to comment by seiqooq in [D] Is it worth paying a data sourcing company to crowdsource a bespoke dataset? by quantifiedvagabond
Interesting - what was your experience like here? And what did you use? Thanks!
fourcornerclub OP t1_iylq3v5 wrote
Reply to comment by alex_lite_21 in [Discussion] - "data sourcing will be more important than model building in the era of foundational model fine-tuning" by fourcornerclub
u/alex_lite_21 yeah the status quo of data augmentation to me seems to read like "oh i scraped together quite a shit training set here. Maybe I'll play with it to make it less shit". Rather than thinking "how do I robustly collect a highly suitable dataset from the outset and then iterate from there"