ENTITY optimum

optimum

PulseAugur coverage of optimum — every cluster mentioning optimum across labs, papers, and developer communities, ranked by signal.

Total · 30d

2

2 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

2

2 over 90d

TIER MIX · 90D

RECENT · PAGE 1/1 · 2 TOTAL

RESEARCH · CL_01274 · May 24 · 00:00

Hugging Face introduces advanced quantization techniques for efficient LLMs

Researchers are developing advanced quantization techniques to make large language models (LLMs) more efficient. New methods like AutoRound, LATMiX, and GSQ aim to reduce model size and computational requirements, enabl…
RESEARCH · CL_01035 · Jan 10 · 17:00

Optimizing Transformer Inference: Techniques for Faster, Cheaper Large Models

Large transformer models present significant inference challenges due to their substantial memory footprint and computation costs, which scale quadratically with input length. Researchers and practitioners are exploring…