I've been playing with this setup for a week: # Qwen -3.6-35B-MXFP8 with MoE architecture for speed, # OMLX for hot/cold prompt caching, and # PiAgent as a lean
A user has been experimenting with a local AI setup for a week, combining the Qwen-3.6-35B-MXFP8 model with MoE architecture for enhanced speed. The system also incorporates OMLX for prompt caching and PiAgent as a harness. The user expressed surprise at the setup's effectiveness, noting that while not yet commercial-grade, it is the first time a local model has felt genuinely usable for basic agentic tasks. AI
IMPACT Demonstrates the increasing viability of local models for agentic tasks, potentially reducing reliance on cloud-based solutions.