Qwen 3.6:35B
PulseAugur coverage of Qwen 3.6:35B — every cluster mentioning Qwen 3.6:35B across labs, papers, and developer communities, ranked by signal.
- 2026-05-24 product_launch Release of the uncensored Genesis APEX MTP version of the Qwen 3.6-35B model. source
3 day(s) with sentiment data
-
LLM community calls for urgent release of 80-160B parameter models
Users on the r/LocalLLaMA subreddit are expressing a strong need for new large language models (LLMs) in the 80-160 billion parameter range. Current models are either too small for users with high-capacity but slower un…
-
AI system ACIE achieves 96.5% accuracy in clinical data extraction
A new agentic retrieval-augmented generation (RAG) system called ACIE has been developed and deployed at University Medicine Essen for clinical information extraction. This system addresses limitations in standard RAG b…
-
Qwen 3.6 35B model runs on consumer hardware with 32k context
A user on Reddit shared their experience running the Qwen 3.6 35B model on a consumer-grade setup, including an RTX 3080 GPU and 32GB of RAM. They achieved a throughput of 26 tokens/second for generation and 1400 tokens…
-
Cohere releases North-Mini-Code-1.0 coding model
Cohere has released North-Mini-Code-1.0, a 30 billion parameter coding model. While its general artificial analysis score is lower than some competitors, it performs competitively in coding benchmarks. The model is avai…
-
Qwen 3.6 35B model excels with KV cache in agentic tasks
A user on r/LocalLLaMA found that the Qwen 3.6 35B model significantly outperforms the 27B version, particularly in agentic tasks, when using KV cache. This user initially favored the 27B model for its perceived intelli…
-
DDR5 Bandwidth Bottlenecks Dual-LLM Inference on AMD APUs
A developer's experiment revealed that the DDR5 bandwidth on AMD APUs significantly limits the performance of running multiple large language models simultaneously. Despite a 35-billion-parameter model like Qwen 3.6:35B…
-
Qwen 3.6-35B model released with uncensored Genesis APEX MTP version
A new, uncensored version of the Qwen 3.6-35B model, named Genesis APEX MTP, has been released. This model boasts impressive performance, handling up to 200k context without glitches and successfully managing complex, i…
-
LM Studio adds MTP Speculative Decoding for faster local LLM inference
LM Studio has updated to version 0.4.14 Build 2 (Beta), integrating MTP Speculative Decoding to accelerate local large language model inference. This feature allows for faster text generation by predicting multiple toke…
-
Qwen 3.6 27B model shows strong local coding ability
The Qwen 3.6 27B model has demonstrated impressive coding capabilities, marking it as the first local model under 100 billion parameters to perform well on Codex tasks with minimal prompting. While the Qwen 3.6 35B vari…
-
Local LLMs get speed boost with BeeLlama.cpp, Qwen 3.6, and iOS app
New developments in local LLM inference include BeeLlama.cpp, a fork of llama.cpp that significantly boosts performance and adds multimodal capabilities using techniques like DFlash and TurboQuant. Separately, the Qwen …