Researchers have developed SplitQ, a new post-training quantization framework designed to improve the efficiency of large vision-language models (VLMs) on devices with limited resources. SplitQ addresses the accuracy degradation often seen in low-bit quantization by introducing a Modality-specific Outlier Channel Decoupling module to isolate modality-specific outliers and an Adaptive Cross-Modal Calibration module to correct remaining discrepancies. Experiments show SplitQ significantly outperforms existing methods across various quantization settings and datasets, preserving high performance even under challenging conditions. AI
影响 Enables more efficient deployment of advanced vision-language models on resource-constrained devices.
排序理由 The cluster contains a new academic paper detailing a novel technical approach for optimizing AI models. [lever_c_demoted from research: ic=1 ai=1.0]
- Adaptive Cross-Modal Calibration
- Large Vision-Language Models
- Modality-specific Outlier Channel Decoupling
- SplitQ
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →