Research paper on AI model quantization withdrawn from arXiv

By PulseAugur Editorial · [1 sources] · 2026-05-26 04:00

A research paper titled "Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition" was withdrawn from arXiv. The paper explored post-training W4A4 quantization on a 300M-parameter language model, aiming to reduce perplexity errors. It introduced a method called Depth Registers with a hinge loss, which significantly improved quantization results but still left a small gap compared to FP16. AI

RANK_REASON The cluster contains a withdrawn academic paper. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Research paper on AI model quantization withdrawn from arXiv

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Ziyang Liu · 2026-05-26 04:00

Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition

arXiv:2604.18128v2 Announce Type: replace-cross Abstract: We study post-training W4A4 quantization in a controlled 300M-parameter SwiGLU decoder-only language model trained on 5B tokens of FineWeb-Edu, and ask which input-activation sites dominate the error. Naive round-to-neares…

COVERAGE [1]

Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition

RELATED ENTITIES

RELATED TOPICS