jasondads1 t1_j0kzfdk wrote on December 17, 2022 at 1:19 PM

The made the same approach with dalle2 before charging for it.

mvujas OP t1_j0l0l1n wrote on December 17, 2022 at 1:30 PM

Does dalle2 use human feedback in any form other than labeling false positives? I haven't played much with dalle2 to be honest, but I can definitely see how they could have been collecting data for a future iteration of the model that may use reinforcement learning in some form.

rikliem t1_j0l8wa6 wrote on December 17, 2022 at 2:46 PM

When generating an image. The one you download they take it as positive feedback . My theory is that if you repeat a prompt twice or more they probably can label it as bad result. They could also use the enlarging of pictures after they are generated as additional feedback