PulseAugur
实时 20:26:38

Fully Ternary Vision Transformer Achieves High Compression for Microcontrollers

Researchers have developed FTerViT, a fully ternary Vision Transformer that compresses all weight matrices and normalization parameters. This approach significantly reduces the model's memory footprint, making it more feasible for deployment on resource-constrained devices like microcontrollers. FTerViT achieves competitive accuracy on ImageNet while offering substantial compression compared to standard floating-point models. AI

影响 Enables more efficient deployment of advanced vision models on low-power edge devices.

排序理由 The cluster contains an academic paper detailing a new model architecture and its performance on benchmarks. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

报道来源 [1]

  1. arXiv cs.CV TIER_1 · Nadim Maamari ·

    FTerViT: Fully Ternary Vision Transformer

    Ternary Vision Transformers offer substantial model compression, however state-of-the-art methods only ternarize the encoder layers, leaving patch embeddings, LayerNorm parameters, and classifier heads in full precision. In compact models targeting resource-constrained processors…