大模型水印系列文章汇总

写在前面

大家好,我是刘聪NLP。

今天早上刷到一篇大模型水印相关论文《Three Bricks to Consolidate Watermarks for Large Language Models》,发给群友们。picture.image

结果群友们竟然找到了大模型水印系列文章汇总的Github,特此分享给大家。


          
https://github.com/hzy312/Awesome-LLM-Watermark  

      

大模型水印系列论文

  • Advancing Beyond Identification: Multi-bit Watermark for Language Models

          
https://arxiv.org/abs/2308.00221  

      
  • Three Bricks to Consolidate Watermarks for Large Language Models

          
https://arxiv.org/abs/2308.00113  

      
  • Towards Codable Text Watermarking for Large Language Models

          
https://arxiv.org/abs/2307.15992  

      
  • A Private Watermark for Large Language Models

          
https://arxiv.org/abs/2307.16230  

      
  • Robust Distortion-free Watermarks for Language Models

          
https://arxiv.org/abs/2307.15593  

      
  • Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy

          
https://arxiv.org/abs/2307.13808  

      
  • Provable Robust Watermarking for AI-Generated Text

          
https://arxiv.org/abs/2306.17439  

      
  • On the Reliability of Watermarks for Large Language Models

          
https://arxiv.org/abs/2306.04634  

      
  • Undetectable Watermarks for Language Models

          
https://arxiv.org/abs/2306.09194  

      
  • GPTs Don’t Keep Secrets: Searching for Backdoor Watermark Triggers in Autoregressive Language Models

          
https://aclanthology.org/2023.trustnlp-1.21/  

      
  • Watermarking Text Data on Large Language Models for Dataset Copyright Protection

          
https://arxiv.org/abs/2305.13257  

      
  • Baselines for Identifying Watermarked Large Language Models

          
https://arxiv.org/abs/2305.18456  

      
  • Who Wrote this Code? Watermarking for Code Generation

          
https://arxiv.org/abs/2305.15060  

      
  • Evading Watermark based Detection of AI-Generated Content

          
https://arxiv.org/abs/2305.03807  

      
  • Robust Multi-bit Natural Language Watermarking through Invariant Features

          
https://arxiv.org/abs/2305.01904  

      
  • Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark

          
https://arxiv.org/abs/2305.10036  

      
  • Watermarking Text Generated by Black-Box Language Models

          
https://arxiv.org/abs/2305.08883  

      
  • Protecting Language Generation Models via Invisible Watermarking

          
https://arxiv.org/abs/2302.03162  

      
  • A Watermark for Large Language Models

          
https://arxiv.org/abs/2301.10226  

      
  • Distillation-Resistant Watermarking for Model Protection in NLP

          
https://arxiv.org/abs/2210.03312  

      

总结

大模型水印技术感觉是可信AI的关键,后面估计会有一大批文章。

请多多关注知乎「刘聪NLP」,有问题的朋友也欢迎加我微信「logCong」私聊,交个朋友吧,一起学习,一起进步。我们的口号是“生命不止,学习不停”。PS:交流2群已经成立,欢迎加入。

往期推荐:

0
0
0
0
评论
未登录
暂无评论