ENTITY Qwen 3.6:35B

Qwen 3.6:35B

PulseAugur coverage of Qwen 3.6:35B — every cluster mentioning Qwen 3.6:35B across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

10 over 90d

Releases · 30d

0 over 90d

Papers · 30d

1 over 90d

TIER MIX · 90D

research 2
tool 6
commentary 2

TOPICS

product 8
model release 7
infra 3
paper 1
other 1

TIMELINE

2026-05-24 product_launch Release of the uncensored Genesis APEX MTP version of the Qwen 3.6-35B model. source

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 10 TOTAL

COMMENTARY · CL_97442 · Jun 17 · 19:55

LLM community calls for urgent release of 80-160B parameter models

Users on the r/LocalLLaMA subreddit are expressing a strong need for new large language models (LLMs) in the 80-160 billion parameter range. Current models are either too small for users with high-capacity but slower un…
RESEARCH · CL_100066 · Jun 17 · 00:00

AI system ACIE achieves 96.5% accuracy in clinical data extraction

A new agentic retrieval-augmented generation (RAG) system called ACIE has been developed and deployed at University Medicine Essen for clinical information extraction. This system addresses limitations in standard RAG b…
TOOL · CL_72216 · Jun 5 · 03:29

Qwen 3.6 35B model runs on consumer hardware with 32k context

A user on Reddit shared their experience running the Qwen 3.6 35B model on a consumer-grade setup, including an RTX 3080 GPU and 32GB of RAM. They achieved a throughput of 26 tokens/second for generation and 1400 tokens…
RESEARCH · CL_81284 · Jun 5 · 02:38

Cohere releases North-Mini-Code-1.0 coding model

Cohere has released North-Mini-Code-1.0, a 30 billion parameter coding model. While its general artificial analysis score is lower than some competitors, it performs competitively in coding benchmarks. The model is avai…
COMMENTARY · CL_71784 · Jun 4 · 19:57

Qwen 3.6 35B model excels with KV cache in agentic tasks

A user on r/LocalLLaMA found that the Qwen 3.6 35B model significantly outperforms the 27B version, particularly in agentic tasks, when using KV cache. This user initially favored the 27B model for its perceived intelli…
TOOL · CL_57479 · May 28 · 15:43

DDR5 Bandwidth Bottlenecks Dual-LLM Inference on AMD APUs

A developer's experiment revealed that the DDR5 bandwidth on AMD APUs significantly limits the performance of running multiple large language models simultaneously. Despite a 35-billion-parameter model like Qwen 3.6:35B…
TOOL · CL_48203 · May 24 · 06:08

Qwen 3.6-35B model released with uncensored Genesis APEX MTP version

A new, uncensored version of the Qwen 3.6-35B model, named Genesis APEX MTP, has been released. This model boasts impressive performance, handling up to 200k context without glitches and successfully managing complex, i…
TOOL · CL_40625 · May 20 · 11:53

LM Studio adds MTP Speculative Decoding for faster local LLM inference

LM Studio has updated to version 0.4.14 Build 2 (Beta), integrating MTP Speculative Decoding to accelerate local large language model inference. This feature allows for faster text generation by predicting multiple toke…
TOOL · CL_34491 · May 16 · 12:41

Qwen 3.6 27B model shows strong local coding ability

The Qwen 3.6 27B model has demonstrated impressive coding capabilities, marking it as the first local model under 100 billion parameters to perform well on Codex tasks with minimal prompting. While the Qwen 3.6 35B vari…
TOOL · CL_24527 · May 9 · 21:33

Local LLMs get speed boost with BeeLlama.cpp, Qwen 3.6, and iOS app

New developments in local LLM inference include BeeLlama.cpp, a fork of llama.cpp that significantly boosts performance and adds multimodal capabilities using techniques like DFlash and TurboQuant. Separately, the Qwen …

LLM community calls for urgent release of 80-160B parameter models

AI system ACIE achieves 96.5% accuracy in clinical data extraction

Qwen 3.6 35B model runs on consumer hardware with 32k context

Cohere releases North-Mini-Code-1.0 coding model

Qwen 3.6 35B model excels with KV cache in agentic tasks

DDR5 Bandwidth Bottlenecks Dual-LLM Inference on AMD APUs

Qwen 3.6-35B model released with uncensored Genesis APEX MTP version

LM Studio adds MTP Speculative Decoding for faster local LLM inference

Qwen 3.6 27B model shows strong local coding ability

Local LLMs get speed boost with BeeLlama.cpp, Qwen 3.6, and iOS app