PulseAugur
EN
LIVE 19:18:01
ENTITY quantization

quantization

PulseAugur coverage of quantization — every cluster mentioning quantization across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
9
9 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
6
6 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

7 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL
  1. RESEARCH · CL_109544 ·

    Quantization of LLMs inflates reasoning token usage, researchers find

    A new research paper highlights that while quantization techniques like INT4 and INT3 are effective at reducing the inference costs of large language models, they can unexpectedly inflate reasoning token usage. This phe…

  2. TOOL · CL_106885 ·

    Guide Explains Fine-Tuning, LoRA, and Quantization for LLMs

    This article provides a practical guide to fine-tuning large language models, focusing on techniques like LoRA (Low-Rank Adaptation) and quantization. It explains how these methods can be used to adapt pre-trained model…

  3. RESEARCH · CL_106765 ·

    Federated learning research tackles quantization, fairness, and noise · 4 sources tracked

    This cluster of research papers explores advancements in federated learning (FL), a method for distributed intelligence that preserves data privacy. One paper offers a comprehensive review of quantization techniques to …

  4. COMMENTARY · CL_102107 ·

    Quantization: Key Technique for Efficient LLM Deployment

    Quantization is a vital technique for deploying large language models (LLMs) efficiently by converting their weights and activations from floating-point to lower-precision integer formats. This process reduces memory fo…

  5. TOOL · CL_87068 ·

    Local LLM Hardware Guide: VRAM, Quantization, and Performance

    Running large language models (LLMs) locally, particularly those with 70 billion parameters, presents significant hardware challenges, primarily concerning VRAM capacity. While marketing often suggests minimal requireme…

  6. RESEARCH · CL_84334 ·

    Quantization Limits Dense Retrieval Dimension, Study Finds

    A new theoretical study published on arXiv explores the limitations imposed by quantization on dense top-k retrieval systems. The research demonstrates that achieving perfect retrieval with B bits per coordinate require…

  7. TOOL · CL_67998 ·

    LLM quantization benchmarks may miss critical tool-call failures

    A Reddit discussion on the r/LocalLLaMA subreddit questions the common practice of benchmarking quantized large language models (LLMs) solely on perplexity and prose quality. The user suggests that these metrics may not…

  8. RESEARCH · CL_11844 ·

    Survey maps dynamic neural networks for computer vision and sensor fusion

    This survey paper provides a comprehensive overview of Dynamic Neural Networks (DNNs), focusing on their application in computer vision and multi-modal sensor fusion. It addresses the challenge of deploying large models…

  9. RESEARCH · CL_103038 ·

    New research explores multilingual LLM scaling, knowledge integration, and specialized evaluation

    Researchers are developing new methods and benchmarks to improve the capabilities and evaluation of large language models (LLMs). Google DeepMind has introduced ATLAS, a framework for optimizing multilingual LLM trainin…