bluehands t1_j2cyz5r wrote on December 31, 2022 at 9:07 AM

Reply to comment by designer1one in [R] 2022 Top Papers in AI — A Year of Generative Models by designer1one

Your list of "text-to-X" highlights for me the need for "X-to-text". Captioning is nice but are names attached, is meaning extracted? (it maybe that I am just not aware of the state of the art)

currentscurrents t1_j2czmdk wrote on December 31, 2022 at 9:16 AM

Basically anything you can generate, you can also classify. Most of the image generators use CLIP for guidance, so if they can generate a sad face (and they can), CLIP can tell you whether or not a face is sad.