Submitted by von-hust t3_11jyrfj in MachineLearning
clueless1245 t1_jb5khy8 wrote
Reply to comment by [deleted] in [R] We found nearly half a billion duplicated images on LAION-2B-en. by von-hust
You want this done in a controlled, methodical and documented manner, not earlier research which showed SD 1.5 to verbatim copy every line and minute contour of wood grain in a specific copyrighted "wooden table" background, found after training to be repeated tens of thousands of times in the input dataset (due to websites selling phone cases photoshopping phones onto it).
Viewing a single comment thread. View all comments