PulseAugur
实时 14:30:49
English(EN) Channel-wise Vector Quantization

新研究推进AI模型的向量量化技术

几篇最新的研究论文探讨了AI模型向量量化技术的进展。ArcVQ-VAE引入了球形角度裕度先验,以提高图像建模中的潜在表示多样性和码本利用率。高斯VAE被用于一种无需训练的方法(Gaussian Quant)中,将其转换为VQ-VAE,性能优于现有方法。DiVeQ提供了一种使用重参数化技巧进行向量量化端到端训练的可微分方法,提高了压缩和生成任务的性能。MGVQ通过集成多维敏感度感知和梯度-Hessian融合来实现超低比特量化,专注于压缩视觉-语言模型。最后,通道式向量量化(CVQ)提出了一种新颖的图像标记范式,对特征图的每个通道进行量化,从而提高了重建和文本到图像生成的效果。另一篇论文详细介绍了使用pgvector驱动的向量搜索系统的编码指南,展示了其在各种AI应用中的实用性。 AI

影响 向量量化的这些进展可能带来更高效的AI模型,特别是在资源受限设备上的部署以及加速LLM解码方面。

排序理由 多篇arXiv论文详细介绍了AI模型向量量化技术的新研究。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 9 个来源。 我们如何撰写摘要 →

新研究推进AI模型的向量量化技术

报道来源 [9]

  1. arXiv cs.AI TIER_1 English(EN) · Jaeyung Kim, YoungJoon Yoo ·

    ArcVQ-VAE:具有弧余弦加性裕度的球形向量量化框架

    arXiv:2605.13517v2 Announce Type: replace-cross Abstract: Vector Quantized Variational Autoencoder (VQ-VAE) has become a fundamental framework for learning discrete representations in image modeling. However, VQ-VAE models must tokenize entire images using a finite set of codeboo…

  2. arXiv cs.LG TIER_1 English(EN) · Tongda Xu, Wendi Zheng, Jiajun He, Jose Miguel Hernandez-Lobato, Yan Wang, Ya-Qin Zhang, Jie Tang ·

    通过高斯VAE进行无训练向量量化

    arXiv:2512.06609v3 Announce Type: replace Abstract: Vector-quantized variational autoencoders (VQ-VAEs) are discrete autoencoders that compress images into discrete tokens. However, they are difficult to train due to discretization. In this paper, we propose a simple yet effectiv…

  3. arXiv cs.LG TIER_1 English(EN) · Mohammad Hassan Vali, Tom B\"ackstr\"om, Arno Solin ·

    DiVeQ:使用重参数化技巧的可微分向量量化

    arXiv:2509.26469v4 Announce Type: replace Abstract: Vector quantization is common in deep models, yet its hard assignments block gradients and hinder end-to-end training. We propose DiVeQ, which treats quantization as adding an error vector that mimics the quantization distortion…

  4. arXiv cs.LG TIER_1 English(EN) · Zhong Wang, Zukang Xu, Xing Hu, Dawei Yang ·

    MGVQ:融合多维度敏感感知与梯度-Hessian 融合用于向量量化

    arXiv:2605.24019v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) achieve outstanding performance, yet their huge model size severely hinders deployment on edge devices with limited resources. As an efficient model compression technique, vector quantization (VQ) exc…

  5. arXiv cs.AI TIER_1 English(EN) · Wei Song, Tianhang Wang, Yitong Chen, Tong Zhang, Zuxuan Wu, Ming Li, Jiaqi Wang, Kaicheng Yu ·

    通道式向量量化

    arXiv:2605.26089v1 Announce Type: cross Abstract: We present Channel-wise Vector Quantization (CVQ), a novel image tokenization paradigm that replaces patch-wise tokens with channel-wise tokens. Unlike conventional vector quantization, which assigns a discrete token to each patch…

  6. arXiv cs.LG TIER_1 English(EN) · Bowen Duan, Cong Guo, Chiyue Wei, Haoxuan Shan, Yuzhe Fu, Xinhua Chen, Yifan Xu, Ziyue Zhang, Changchun Zhou, Hai Li, Yiran Chen ·

    EVA:通过高效向量量化架构加速LLM解码

    arXiv:2605.24144v1 Announce Type: cross Abstract: Large Language Models (LLMs) have achieved impressive performance across diverse domains but remain inefficient during the autoregressive decoding phase. Unlike the prefill stage, which employs compute-bound GEMM operations, decod…

  7. arXiv cs.AI TIER_1 English(EN) · Kaicheng Yu ·

    通道式向量量化

    We present Channel-wise Vector Quantization (CVQ), a novel image tokenization paradigm that replaces patch-wise tokens with channel-wise tokens. Unlike conventional vector quantization, which assigns a discrete token to each patch feature vector, CVQ quantizes each channel of the…

  8. Hugging Face Daily Papers TIER_1 English(EN) ·

    通道式向量量化

    Channel-wise Vector Quantization replaces patch-wise tokens with channel-wise tokens in image tokenization, enabling a next-channel prediction framework that generates images by sequentially refining visual details.

  9. MarkTechPost TIER_1 English(EN) · Sana Hassan ·

    实现 pgvector 驱动的语义化、混合、稀疏和量化向量搜索系统的编码指南

    <p>In this tutorial, we build a complete pgvector playground inside Google Colab and explore how PostgreSQL can work as a powerful vector database for modern AI applications. We start by installing PostgreSQL, compiling the pgvector extension, connecting through Psycopg, and regi…