PulseAugur
EN
LIVE 19:32:21

Gemma 4 E2B variants show improved safety, some boost reasoning

A comprehensive analysis of 13 modified versions of Google's Gemma 4 E2B model revealed that while all variants significantly improved safety by increasing the refusal rate, some also enhanced reasoning capabilities. Specifically, two variants, coder3101 and llmfan46, outperformed the base model on the GSM8K math benchmark. However, more aggressive modifications led to a notable decrease in language modeling performance and reasoning efficiency, with some variants showing significantly higher perplexity and empty responses. AI

IMPACT Demonstrates that model fine-tuning can improve specific capabilities like safety and reasoning, but aggressive methods risk degrading core performance.

RANK_REASON Analysis of multiple fine-tuned variants of an existing open-source model. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/nathandreamfast ·

    13 abliterated Gemma 4 E2B variants, 44 GPU hours, Benchmark and Comparison - Abliterlitics

    <!-- SC_OFF --><div class="md"><p>I compared 13 abliterated variants of Gemma 4 E2B across weight analysis, KL divergence, HarmBench safety, and 8 benchmark tasks. 44 GPU hours on a single RTX 5090. Here is what actually works and what destroys capabilities.</p> <p>coder3101's va…