mvujas OP t1_j0l0l1n wrote on December 17, 2022 at 1:30 PM

Reply to comment by jasondads1 in [D] ChatGPT, crowdsourcing and similar examples by mvujas

Does dalle2 use human feedback in any form other than labeling false positives? I haven't played much with dalle2 to be honest, but I can definitely see how they could have been collecting data for a future iteration of the model that may use reinforcement learning in some form.

rikliem t1_j0l8wa6 wrote on December 17, 2022 at 2:46 PM

When generating an image. The one you download they take it as positive feedback . My theory is that if you repeat a prompt twice or more they probably can label it as bad result. They could also use the enlarging of pictures after they are generated as additional feedback