PulseAugur
实时 23:33:32

ViM-Q enables efficient Vision Mamba model inference on FPGAs

Researchers have developed ViM-Q, a novel algorithm-hardware co-design specifically for accelerating Vision Mamba (ViM) model inference on FPGAs. This approach tackles challenges in quantizing dynamic activation outliers and adapting SSM computation for FPGA architectures. ViM-Q integrates a custom 4-bit weight quantization with a hardware accelerator featuring a linear engine and a pipelined SSM engine, enabling runtime configuration for diverse ViM models. Tests on an AMD ZCU102 FPGA demonstrated significant speedup and energy efficiency gains compared to a GPU baseline for low-batch inference. AI

影响 Enables efficient deployment of Vision Mamba models on resource-constrained edge devices.

排序理由 Academic paper detailing a new algorithm-hardware co-design for model inference. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

ViM-Q enables efficient Vision Mamba model inference on FPGAs

报道来源 [1]

  1. arXiv cs.CV TIER_1 English(EN) · Shengzhe Lyu, Yuhan She, Patrick S. Y. Hung, Ray C. C. Cheung, Weitao Xu ·

    ViM-Q: Scalable Algorithm-Hardware Co-Design for Vision Mamba Model Inference on FPGA

    arXiv:2605.01935v1 Announce Type: cross Abstract: Vision Mamba (ViM) models offer a compelling efficiency advantage over Transformers by leveraging the linear complexity of State Space Models (SSMs), yet efficiently deploying them on FPGAs remains challenging. Linear layers strug…