SplitQ framework enhances low-bit quantization for vision-language models

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-19 14:49

Researchers have developed SplitQ, a new post-training quantization framework designed to improve the efficiency of large vision-language models (VLMs) on devices with limited resources. SplitQ addresses the accuracy degradation often seen in low-bit quantization by introducing a Modality-specific Outlier Channel Decoupling module to isolate modality-specific outliers and an Adaptive Cross-Modal Calibration module to correct remaining discrepancies. Experiments show SplitQ significantly outperforms existing methods across various quantization settings and datasets, preserving high performance even under challenging conditions. AI

影响 Enables more efficient deployment of advanced vision-language models on resource-constrained devices.

排序理由 The cluster contains a new academic paper detailing a novel technical approach for optimizing AI models. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Guolei Sun · 2026-05-19 14:49

Breaking Modality Heterogeneity in Low-Bit Quantization for Large Vision-Language Models

Low-bit post-training quantization (PTQ) is a pivotal technique for deploying Vision-Language Models (VLMs) on resource-constrained devices. However, existing PTQ methods often degrade VLMs' accuracy due to the heterogeneous activation distributions of text and vision modalities …

报道来源 [1]

Breaking Modality Heterogeneity in Low-Bit Quantization for Large Vision-Language Models

相关实体

相关话题