Submitted by Tlaloc-Es t3_10z1jxz in MachineLearning
goj-145 t1_j83801h wrote
Reply to comment by 2blazen in [D] Is it legal to use images or videos with copyright to train a model? by Tlaloc-Es
It would have been MUCH harder to prove if they spent a day preprocessing the images first!
currentscurrents t1_j85rpol wrote
They use the open LAION 50B dataset, everybody knows what's in there.
Still, some preprocessing and deduplication would have been a good idea just for output quality.
Viewing a single comment thread. View all comments