PulseAugur
EN
LIVE 21:25:53

New MX-SAFE format slashes AI energy use with adaptive quantization

Researchers have introduced MX-SAFE, a novel dynamic quantization format designed to reduce computational costs in deep learning. This format enhances the existing microscaling (MX) standard by adaptively allocating bits for exponents and mantissas, supporting both training and inference with improved accuracy. The proposed MX-SAFE format demonstrated an average accuracy improvement of up to 3.55% over existing MXFP formats and achieved comparable accuracy to BF16 baselines while consuming 24.9% less energy in a dedicated accelerator. AI

IMPACT This new quantization format could significantly reduce the energy consumption and computational cost of training and running AI models.

RANK_REASON The cluster contains an academic paper detailing a new technical format for AI hardware efficiency. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Dahoon Park, Jahyun Koo, Sangwoo Hwang, Jaeha Kung ·

    MX-SAFE: Versatile Inference- and Training-Proof Microscaling Format with On-the-Fly Exponent and Mantissa Bit Allocation

    arXiv:2605.24391v1 Announce Type: cross Abstract: As the demand for deep learning grows, cost reduction through quantization has become essential for both training and inference. In 2022, the Open Compute Project (OCP) consortium standardized narrow precision formats for deep lea…