OpenMythos benchmarks released, highlights Qwen 3.6 discrepancies

By PulseAugur Editorial · [2 sources] · 2026-06-23 18:56

The OpenMythos model has released its benchmarks, showcasing its performance across SWE-bench Pro, CyberGym, and cybench. While the model performs well for its size and cybersecurity focus, there's potential for further improvement. The release also highlighted discrepancies in Qwen 3.6 27B's SWE-bench results compared to official numbers, attributed to differences in evaluation harnesses and problem filtering. AI

IMPACT Provides performance data for the OpenMythos model and highlights potential issues with benchmark reporting for other models.

RANK_REASON The cluster reports on the release of benchmarks for a specific model, OpenMythos, and discusses its performance relative to other models on various benchmarks.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

OpenMythos benchmarks released, highlights Qwen 3.6 discrepancies

COVERAGE [2]

r/LocalLLaMA TIER_1 English(EN) · /u/RealKingNish · 2026-06-23 19:04

OpenMythos Benchmarks

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1udq9p6/openmythos_benchmarks/"> <img alt="OpenMythos Benchmarks" src="https://preview.redd.it/p1ghh67py29h1.png?width=640&crop=smart&auto=webp&s=a7277828dcd6e5fd5d0be6dec3246ff60d63cf40" title="Op…
r/LocalLLaMA TIER_1 English(EN) · /u/RealKingNish · 2026-06-23 18:56

OpenMythos benchmarks

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1udq2ac/openmythos_benchmarks/"> <img alt="OpenMythos benchmarks" src="https://preview.redd.it/z7q7df2aw29h1.png?width=640&crop=smart&auto=webp&s=cd790d8d81e0d1f2268182c79f1eeef13b4b5b84" title="Op…

COVERAGE [2]

OpenMythos Benchmarks

OpenMythos benchmarks

RELATED ENTITIES

RELATED TOPICS