wbsgrepit t1_j9qyscw wrote
Reply to comment by RaccoonProcedureCall in Question for any AI enthusiasts about an obvious (?) solution to a difficult LLM problem in society by LettucePrime
The problem is these types of watermarks where the model layers are tweaked with a key to bend the output are easily obliterated by double dipping -- Chatgpt to generate then another paraphrase llm to rewrite. text canaries are brittle af.
RaccoonProcedureCall t1_j9rcsir wrote
Yeah, and I believe the author of that blog post acknowledges as much. I suppose being able to detect some text is better than being able to detect no text. Maybe that’s why watermarking is being pursued, but I can hardly speak for that author or for OpenAI.
Viewing a single comment thread. View all comments