wbsgrepit t1_j9qyscw wrote on February 23, 2023 at 10:53 PM

Reply to comment by RaccoonProcedureCall in Question for any AI enthusiasts about an obvious (?) solution to a difficult LLM problem in society by LettucePrime

The problem is these types of watermarks where the model layers are tweaked with a key to bend the output are easily obliterated by double dipping -- Chatgpt to generate then another paraphrase llm to rewrite. text canaries are brittle af.

RaccoonProcedureCall t1_j9rcsir wrote on February 24, 2023 at 12:31 AM

Yeah, and I believe the author of that blog post acknowledges as much. I suppose being able to detect some text is better than being able to detect no text. Maybe that’s why watermarking is being pursued, but I can hardly speak for that author or for OpenAI.