ENTITY
7b
7b
PulseAugur coverage of 7b — every cluster mentioning 7b across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
2 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
LLM inference speed bottlenecked by GPU memory bandwidth, not compute
This article explains that the primary bottleneck for LLM inference in production is often the model's raw speed on the GPU, rather than serving logic or network overhead. It details how LLM inference, particularly duri…
-
Tencent releases Hy-MT2 translation model for local deployment
Tencent has released Hy-MT2, a new version of its translation model, in both 1.8B and 7B parameter sizes. The open-source model is designed for local deployment, with tests exploring the impact of cache quantization. Th…