Codex CLI integrates local Gemma-4 LLM via llama.cpp on WSL2

By PulseAugur Editorial · [1 sources] · 2026-06-11 03:16

This guide details how to set up the Codex CLI to interact with a local LLM, specifically Gemma-4, using llama.cpp on Windows Subsystem for Linux (WSL2). The process involves installing Codex, configuring it to use llama.cpp as a model provider, and then running the llama.cpp server with the Gemma-4 model. The author shares specific commands and configuration file examples, including troubleshooting a context size error. AI

IMPACT Enables developers to run LLM tools locally, reducing reliance on cloud services and potentially improving privacy.

RANK_REASON The article describes a technical setup guide for integrating existing tools, not a new product or model release.

Read on dev.to — LLM tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Codex CLI integrates local Gemma-4 LLM via llama.cpp on WSL2

COVERAGE [1]

dev.to — LLM tag TIER_1 English(EN) · 0xkoji · 2026-06-11 03:16

Run Codex CLI with Local LLM - Gemma4 with llama.cpp on WSL2

<h2> requirements </h2> <ul> <li>llama.cpp</li> <li>nodejs (if you use <code>npm</code>)</li> </ul> <p><strong>I'm using NVIDIA GeForce RTX 3070.</strong></p> <h2> Step 1. Install codex </h2> <p>First install <code>codex</code> on WSL.<br /> If Node.js isn’t installed yet, I reco…

COVERAGE [1]

Run Codex CLI with Local LLM - Gemma4 with llama.cpp on WSL2

RELATED ENTITIES

RELATED TOPICS