Submitted by mvujas t3_zo5imc in MachineLearning
jasondads1 t1_j0kzfdk wrote
The made the same approach with dalle2 before charging for it.
mvujas OP t1_j0l0l1n wrote
Does dalle2 use human feedback in any form other than labeling false positives? I haven't played much with dalle2 to be honest, but I can definitely see how they could have been collecting data for a future iteration of the model that may use reinforcement learning in some form.
rikliem t1_j0l8wa6 wrote
When generating an image. The one you download they take it as positive feedback . My theory is that if you repeat a prompt twice or more they probably can label it as bad result. They could also use the enlarging of pictures after they are generated as additional feedback
Viewing a single comment thread. View all comments