PulseAugur
实时 23:10:51

新的LLM水印技术旨在保护知识产权和追踪使用情况

研究人员开发了新的大型语言模型(LLM)水印方法,以保护知识产权和追踪使用情况。ArcMark是一种新技术,可以在不改变LLM输出分布或困惑度的情况下,将多个字节的信息嵌入文本中。另一种方法SAFESEAL使用密钥条件采样来保持语义保真度并检测所有权,即使在对抗性攻击下也是如此。TextSeal是第三种方法,提供本地化检测,并且可以通过模型蒸馏转移其水印信号,使其能够有效防止未经授权的使用和复制。 AI

影响 这些水印技术的进步可以更好地追踪LLM生成的内容,并防止未经授权的使用和蒸馏。

排序理由 该集群包含多篇详细介绍LLM水印技术新研究的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

报道来源 [4]

  1. arXiv cs.AI TIER_1 English(EN) · Atefeh Gilani, Sajani Vithana, Carol Xuan Long, Oliver Kosut, Lalitha Sankar, Flavio P. Calmon ·

    ArcMark: Distortion-Free Multi-Byte LLM Watermark via Optimal Transport

    arXiv:2602.07235v2 Announce Type: replace-cross Abstract: Watermarking is an important tool for promoting the responsible use of large language models (LLMs). Existing watermarks insert a signal into generated tokens that either flags LLM-generated text (zero-bit watermarking) or…

  2. arXiv cs.CL TIER_1 English(EN) · Kieu Dang, Phung Lai, NhatHai Phan, Yelong Shen, Ruoming Jin ·

    Robust LLM Watermarking with Minimal Semantic Distortion for IP Protection

    arXiv:2605.23175v1 Announce Type: cross Abstract: Proprietary large language models (LLMs) face risks of intellectual property (IP) violation, as adversaries can replicate an LLM by collecting input-output pairs to train a surrogate model, causing financial setbacks. Watermarks o…

  3. arXiv cs.CL TIER_1 English(EN) · Tom Sander, Hongyan Chang, Tom\'a\v{s} Sou\v{c}ek, Tuan Tran, Valeriu Lacatusu, Sylvestre-Alvise Rebuffi, Alexandre Mourachko, Surya Parimi, Christophe Ropers, Rashel Moritz, Vanessa Stark, Hady Elsahar, Pierre Fernandez ·

    TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection

    arXiv:2605.12456v2 Announce Type: replace-cross Abstract: We introduce TextSeal, a state-of-the-art watermark for large language models. Building on Gumbel-max sampling, TextSeal introduces dual-key generation to restore output diversity, along with entropy-weighted scoring and m…

  4. arXiv cs.CL TIER_1 English(EN) · Ruoming Jin ·

    Robust LLM Watermarking with Minimal Semantic Distortion for IP Protection

    Proprietary large language models (LLMs) face risks of intellectual property (IP) violation, as adversaries can replicate an LLM by collecting input-output pairs to train a surrogate model, causing financial setbacks. Watermarks offer a promising defense to verify ownership, but …