PulseAugur
EN
LIVE 21:17:42

Qwen 3.5 35B model runs at 10.33 t/s on $300 laptop

A user on Reddit's r/LocalLLaMA subreddit has detailed their experience running the Qwen 3.5 35B model on a budget laptop. They achieved an inference speed of 10.33 tokens per second on a $300 Lenovo Ideapad Slim 3i with 40GB of RAM. The setup involved specific optimizations and the use of the ik_llama.cpp inference backend. AI

IMPACT Demonstrates that powerful LLMs can be run on low-cost hardware, potentially increasing accessibility for AI enthusiasts.

RANK_REASON User-generated post detailing the performance of a specific model on consumer hardware.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Qwen 3.5 35B model runs at 10.33 t/s on $300 laptop

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/OcelotOk8071 ·

    Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tpfw50/inferencing_at_1033_ts_on_qwen_35_35b_on_a_300/"> <img alt="Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop" src="https://preview.redd.it/u8062juegq3h1.png?width=140&amp;height=75&amp;auto=we…