Researchers have developed a new method called SSG (Sort-then-Split by Groups) to improve the effectiveness of watermarking for large language models (LLMs). Existing watermarking techniques, like the KGW scheme, struggle with low-entropy content such as code and mathematical reasoning. SSG addresses this by partitioning the vocabulary into logit-balanced subsets, which enhances watermark detectability. Experiments show SSG's effectiveness on code generation and mathematical reasoning tasks. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
IMPACT Improves LLM content traceability, particularly for code and mathematical outputs.
RANK_REASON Academic paper introducing a new method for LLM watermarking.