RaccoonProcedureCall t1_j9o1gl1 wrote
Reply to comment by adt in Question for any AI enthusiasts about an obvious (?) solution to a difficult LLM problem in society by LettucePrime
Forgive me for not reading the entire post you linked, but is the plan that this watermarking would not be detectable by the general public out of concerns for “privacy”? Also, has this been implemented with ChatGPT (or do we know)?
Also, it surprises me that someone from OpenAI would acknowledge the shortcomings of their current measures for identifying AI-generated content.
wbsgrepit t1_j9qyscw wrote
The problem is these types of watermarks where the model layers are tweaked with a key to bend the output are easily obliterated by double dipping -- Chatgpt to generate then another paraphrase llm to rewrite. text canaries are brittle af.
RaccoonProcedureCall t1_j9rcsir wrote
Yeah, and I believe the author of that blog post acknowledges as much. I suppose being able to detect some text is better than being able to detect no text. Maybe that’s why watermarking is being pursued, but I can hardly speak for that author or for OpenAI.
Viewing a single comment thread. View all comments