PulseAugur
EN
LIVE 21:08:57

Gemma 4 12B tool-calling bugs fixed with custom chat template

A Reddit user has shared a workaround for issues with Google's Gemma 4 12B model, specifically concerning its tool-calling capabilities. The problem, which caused failures in coding tasks and harnesses like OpenCode, can be resolved by using a specific chat template file. This fix, available via a GitHub gist, requires compiling llama.cpp from source and then running the model with the custom template file, enabling a more accurate evaluation of its coding abilities. AI

IMPACT Enables more accurate evaluation of Gemma 4 12B's coding capabilities by fixing tool-calling bugs.

RANK_REASON User-shared workaround for an existing model's functionality.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/boutell ·

    PSA: Gemma 4 12B is NOT completely broken for coding and tool calling, you need a special chat template

    <!-- SC_OFF --><div class="md"><p>This is a PSA for people like me who tried it and hit the wall with tool calls failing left and right, so much so that harnesses like OpenCode just didn't work:</p> <p>There is a fix for that. You need to pass a better chat template file, <a href…