Brief

last 24h

[3/3] 222 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.AI English(EN) · 4d

Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations

A new research paper introduces cooperative memory paging, a technique designed to help Large Language Models (LLMs) manage conversations that exceed their context window. This method replaces evicted conversation segments with concise keyword bookmarks, allowing the LLM to retrieve full content using a recall tool when necessary. Experiments on the LoCoMo benchmark demonstrated that cooperative paging outperformed other methods in answer quality across multiple LLMs, though the effectiveness was significantly impacted by the distinctiveness of the generated bookmarks. AI

IMPACT Improves LLM ability to recall information from extended conversations, potentially enhancing user experience and task completion.
TOOL · arXiv cs.AI English(EN) · 4d

Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition

A research paper titled "Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition" was withdrawn from arXiv. The paper explored post-training W4A4 quantization on a 300M-parameter language model, aiming to reduce perplexity errors. It introduced a method called Depth Registers with a hinge loss, which significantly improved quantization results but still left a small gap compared to FP16. AI
- arXiv
- Ziyang Liu
TOOL · arXiv cs.AI English(EN) · 4d

Committed SAE-Feature Traces for Audited-Session Substitution Detection in Hosted LLMs

A new research paper proposes a commit-open protocol to detect when hosted large language model providers substitute cheaper models for advertised ones. The protocol uses Merkle trees to commit to sparse autoencoder (SAE) feature traces of model outputs, allowing verifiers to detect such substitutions. Experiments on Qwen3-1.7B, Gemma-2-2B, and a scaled-up Gemma-2-9B demonstrated the protocol's effectiveness in rejecting various substitution attacks, outperforming existing methods like SVIP. AI

IMPACT This protocol could enhance trust in hosted LLM services by providing a verifiable mechanism against deceptive model substitutions.
- Gemma-2-2B
- Gemma-2-9B
- SVIP
- Qwen3-1.7B
- Ziyang Liu

Brief

Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations

Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition

Committed SAE-Feature Traces for Audited-Session Substitution Detection in Hosted LLMs