PulseAugur
EN
LIVE 23:35:26

Research paper on AI model quantization withdrawn from arXiv

A research paper titled "Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition" was withdrawn from arXiv. The paper explored post-training W4A4 quantization on a 300M-parameter language model, aiming to reduce perplexity errors. It introduced a method called Depth Registers with a hinge loss, which significantly improved quantization results but still left a small gap compared to FP16. AI

RANK_REASON The cluster contains a withdrawn academic paper. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Ziyang Liu ·

    Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition

    arXiv:2604.18128v2 Announce Type: replace-cross Abstract: We study post-training W4A4 quantization in a controlled 300M-parameter SwiGLU decoder-only language model trained on 5B tokens of FineWeb-Edu, and ask which input-activation sites dominate the error. Naive round-to-neares…