Qwen 3.5 35B model runs at 10.33 t/s on $300 laptop

By PulseAugur Editorial · [1 sources] · 2026-05-27 19:26

A user on Reddit's r/LocalLLaMA subreddit has detailed their experience running the Qwen 3.5 35B model on a budget laptop. They achieved an inference speed of 10.33 tokens per second on a $300 Lenovo Ideapad Slim 3i with 40GB of RAM. The setup involved specific optimizations and the use of the ik_llama.cpp inference backend. AI

IMPACT Demonstrates that powerful LLMs can be run on low-cost hardware, potentially increasing accessibility for AI enthusiasts.

RANK_REASON User-generated post detailing the performance of a specific model on consumer hardware.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Qwen 3.5 35B model runs at 10.33 t/s on $300 laptop

COVERAGE [1]

r/LocalLLaMA TIER_1 English(EN) · /u/OcelotOk8071 · 2026-05-27 19:26

Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tpfw50/inferencing_at_1033_ts_on_qwen_35_35b_on_a_300/"> <img alt="Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop" src="https://preview.redd.it/u8062juegq3h1.png?width=140&height=75&auto=we…

COVERAGE [1]

Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop

RELATED ENTITIES

RELATED TOPICS