New MX-SAFE format slashes AI energy use with adaptive quantization

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-26 04:00

Researchers have introduced MX-SAFE, a novel dynamic quantization format designed to reduce computational costs in deep learning. This format enhances the existing microscaling (MX) standard by adaptively allocating bits for exponents and mantissas, supporting both training and inference with improved accuracy. The proposed MX-SAFE format demonstrated an average accuracy improvement of up to 3.55% over existing MXFP formats and achieved comparable accuracy to BF16 baselines while consuming 24.9% less energy in a dedicated accelerator. AI

影响 This new quantization format could significantly reduce the energy consumption and computational cost of training and running AI models.

排序理由 The cluster contains an academic paper detailing a new technical format for AI hardware efficiency. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Dahoon Park, Jahyun Koo, Sangwoo Hwang, Jaeha Kung · 2026-05-26 04:00

MX-SAFE: Versatile Inference- and Training-Proof Microscaling Format with On-the-Fly Exponent and Mantissa Bit Allocation

arXiv:2605.24391v1 Announce Type: cross Abstract: As the demand for deep learning grows, cost reduction through quantization has become essential for both training and inference. In 2022, the Open Compute Project (OCP) consortium standardized narrow precision formats for deep lea…

报道来源 [1]

MX-SAFE: Versatile Inference- and Training-Proof Microscaling Format with On-the-Fly Exponent and Mantissa Bit Allocation

相关实体

相关话题