Submitted by mvujas t3_zo5imc in MachineLearning
mvujas OP t1_j0l0l1n wrote
Reply to comment by jasondads1 in [D] ChatGPT, crowdsourcing and similar examples by mvujas
Does dalle2 use human feedback in any form other than labeling false positives? I haven't played much with dalle2 to be honest, but I can definitely see how they could have been collecting data for a future iteration of the model that may use reinforcement learning in some form.
rikliem t1_j0l8wa6 wrote
When generating an image. The one you download they take it as positive feedback . My theory is that if you repeat a prompt twice or more they probably can label it as bad result. They could also use the enlarging of pictures after they are generated as additional feedback
Viewing a single comment thread. View all comments