five

Towards Better Statistical Understanding of Watermarking LLMs

收藏
Figshare2026-01-30 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Towards_Better_Statistical_Understanding_of_Watermarking_LLMs/31211964
下载链接
链接失效反馈
官方服务:
资源简介:
In this paper, we study the problem of watermarking large language models (LLMs). We consider the trade-off between model distortion and detection ability and formulate it as a constrained optimization problem based on the red-green list watermarking algorithm. We show that the optimal solution to the optimization problem enjoys a nice analytical property which provides a better understanding and inspires the algorithm design for the watermarking process. We develop an online dual gradient ascent watermarking algorithm in light of this optimization formulation and prove its asymptotic Pareto optimality between model distortion and detection ability. Such a result guarantees an averaged increased green list probability and henceforth detection ability explicitly (in contrast to previous results). Moreover, we provide a systematic discussion on the choice of the model distortion metrics for the watermarking problem. We justify our choice of KL divergence and present issues with the existing criteria of “distortion-free” and perplexity. Finally, we empirically evaluate our algorithms on extensive datasets against benchmark algorithms.
创建时间:
2026-01-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作