ENTITY quantization

quantization

PulseAugur coverage of quantization — every cluster mentioning quantization across labs, papers, and developer communities, ranked by signal.

Total · 30d

9

9 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

6

6 over 90d

TIER MIX · 90D

research 4
tool 4
commentary 1

TOPICS

SENTIMENT · 30D

7 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL

RESEARCH · CL_109544 · Jun 24 · 07:54

Quantization of LLMs inflates reasoning token usage, researchers find

A new research paper highlights that while quantization techniques like INT4 and INT3 are effective at reducing the inference costs of large language models, they can unexpectedly inflate reasoning token usage. This phe…
TOOL · CL_106885 · Jun 23 · 11:52

Guide Explains Fine-Tuning, LoRA, and Quantization for LLMs

This article provides a practical guide to fine-tuning large language models, focusing on techniques like LoRA (Low-Rank Adaptation) and quantization. It explains how these methods can be used to adapt pre-trained model…
RESEARCH · CL_106765 · Jun 21 · 12:18

Federated learning research tackles quantization, fairness, and noise · 4 sources tracked

This cluster of research papers explores advancements in federated learning (FL), a method for distributed intelligence that preserves data privacy. One paper offers a comprehensive review of quantization techniques to …
COMMENTARY · CL_102107 · Jun 20 · 23:10

Quantization: Key Technique for Efficient LLM Deployment

Quantization is a vital technique for deploying large language models (LLMs) efficiently by converting their weights and activations from floating-point to lower-precision integer formats. This process reduces memory fo…
TOOL · CL_87068 · Jun 12 · 06:22

Local LLM Hardware Guide: VRAM, Quantization, and Performance

Running large language models (LLMs) locally, particularly those with 70 billion parameters, presents significant hardware challenges, primarily concerning VRAM capacity. While marketing often suggests minimal requireme…
RESEARCH · CL_84334 · Jun 10 · 08:11

Quantization Limits Dense Retrieval Dimension, Study Finds

A new theoretical study published on arXiv explores the limitations imposed by quantization on dense top-k retrieval systems. The research demonstrates that achieving perfect retrieval with B bits per coordinate require…
TOOL · CL_67998 · Jun 3 · 01:52

LLM quantization benchmarks may miss critical tool-call failures

A Reddit discussion on the r/LocalLLaMA subreddit questions the common practice of benchmarking quantized large language models (LLMs) solely on perplexity and prose quality. The user suggests that these metrics may not…
RESEARCH · CL_11844 · May 1 · 04:00

Survey maps dynamic neural networks for computer vision and sensor fusion

This survey paper provides a comprehensive overview of Dynamic Neural Networks (DNNs), focusing on their application in computer vision and multi-modal sensor fusion. It addresses the challenge of deploying large models…
RESEARCH · CL_103038 · Jan 27 · 18:58

New research explores multilingual LLM scaling, knowledge integration, and specialized evaluation

Researchers are developing new methods and benchmarks to improve the capabilities and evaluation of large language models (LLMs). Google DeepMind has introduced ATLAS, a framework for optimizing multilingual LLM trainin…