PulseAugur
LIVE 15:42:35
ENTITY Qwen3-35B-A3B

Qwen3-35B-A3B

PulseAugur coverage of Qwen3-35B-A3B — every cluster mentioning Qwen3-35B-A3B across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
0
0 over 90d
TIER MIX · 90D
TIMELINE
  1. 2026-05-18 research_milestone User achieves 267 tokens/sec inference speed for Qwen3-35B-A3B using llama.cpp MTP on an RTX 5090. source
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_36999 ·

    Qwen3-35B MoE model hits 267 tok/s on RTX 5090 with llama.cpp

    A user achieved 267 tokens per second for local LLM inference using Qwen3-35B-A3B with llama.cpp's Multi-Token Prediction (MTP) on an RTX 5090. This setup, running on electricity only, significantly outperformed cloud-b…