PulseAugur
EN
LIVE 02:02:31

User questions viability of dual RX 9060xt for large language models

A user on the r/LocalLLaMA subreddit is inquiring about the viability of using two RX 9060xt graphics cards, each with 16GB of VRAM, for running large language models like Qwen 3.6 27B. The user is seeking to improve generation and prefill speeds for a coding agent application, as their current laptop setup with 64GB RAM is providing only 3-4 tokens/second for generation and an unusable 50 tokens/second for prefill. AI

RANK_REASON User-generated content on Reddit asking about hardware for running LLMs, not a primary source release or significant industry event.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

User questions viability of dual RX 9060xt for large language models

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/RKlehm ·

    2x RX 9060xt 16gb, is it worth it?

    <!-- SC_OFF --><div class="md"><p>I'm planning to buy 2x RX 9060xt with 16gb each to run Qwen 3.6 27B and alike. Would it be a good investment? How much tk/s should i expect in generation and prefill? I'm planning to use this as a coding agent in a large codebase.</p> <p>Currentl…