Viewing a single comment thread. View all comments

-zharai t1_irzsfgh wrote

I don't think it's possible to have a high degree of certainty, at least with diffusion models. There is too much information lost, and too much noise injected.

E.g. how can you know, with any decent certainty, which features of the image were described in the prompt? And further, if you manage to know which information in the image is specified, and which is improvised by the model, there are so many ways to describe the same information.

1