User combines Gemma 4 and Nvidia Parakeet for local speech translation

By PulseAugur Editorial · [1 sources] · 2026-06-06 06:04

A user has combined Google's Gemma 4 model with Nvidia's Parakeet models to create a self-hosted, local speech-to-text translation pipeline. This pipeline, implemented in approximately 300 lines of Python code, can run on a GPU or a CPU, with acceptable performance on both. The project is available on Codeberg for others to use. AI

IMPACT Enables local, self-hosted speech translation pipelines, reducing reliance on cloud services.

RANK_REASON User-created tool combining existing models.

Read on Mastodon — fosstodon.org →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-06-06 06:04

Because I am fan of responsible # AI use and my needs, I have glued together # gemma4 and Nvidia # parakeet models into about 300 lines of # python completely s

Because I am fan of responsible # AI use and my needs, I have glued together # gemma4 and Nvidia # parakeet models into about 300 lines of # python completely self-hosted local live speech # translation pipeline (to text). It easily runs on decent GPU but also tested on Ryzen CPU…

LINKS codeberg.org/…/live_translator

COVERAGE [1]

Because I am fan of responsible # AI use and my needs, I have glued together # gemma4 and Nvidia # parakeet models into about 300 lines of # python completely s

RELATED ENTITIES

RELATED TOPICS