ENTITY Gemma 2 9B

Gemma 2 9B

PulseAugur coverage of Gemma 2 9B — every cluster mentioning Gemma 2 9B across labs, papers, and developer communities, ranked by signal.

Total · 30d

14

14 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

12

12 over 90d

TIER MIX · 90D

TOPICS

RELATIONSHIPS

competes with qwen2.5:7b 50%

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 14 TOTAL

TOOL · CL_113993 · Jun 27 · 21:05

Gemma 2 9B FP8 quantization shows prefill tax but faster generation

A benchmark evaluation of the self-hosted Gemma 2 9B model, particularly its FP8 quantized variant, revealed trade-offs when compared to frontier APIs. While FP8 quantization significantly increases the time to first to…
SIGNIFICANT · CL_100834 · Jun 19 · 15:02

Google's Gemma 2 models achieve high performance with efficient architecture

Google's new Gemma 2 models, particularly the 27B parameter version, are demonstrating significant performance gains through architectural innovations rather than just increased size. These models utilize a hybrid atten…
TOOL · CL_65544 · Jun 2 · 04:00

AI safety alignment fails in low-resource languages due to calibration

Researchers have found that AI models trained for safety in high-resource languages like English struggle to apply these safety measures to low-resource languages such as Swahili or Burmese. Despite the models retaining…
TOOL · CL_56171 · May 28 · 04:00

New ReSAE Method Enhances Transformer Model Interventions

Researchers have developed Residualized Sparse Autoencoders (ReSAEs) to improve multi-layer interventions in transformer models. Unlike traditional methods that train layers independently, ReSAEs account for the strong …
TOOL · CL_51194 · May 26 · 04:00

New protocol detects LLM provider model substitutions

A new research paper proposes a commit-open protocol to detect when hosted large language model providers substitute cheaper models for advertised ones. The protocol uses Merkle trees to commit to sparse autoencoder (SA…
RESEARCH · CL_51036 · May 26 · 04:00

New AI text detector READER outperforms larger models

Researchers have developed READER, a novel system for detecting AI-generated text that outperforms larger models by incorporating a reasoning-based approach. This system, fine-tuned on a curated dataset of rationales an…
RESEARCH · CL_48843 · May 21 · 21:00

New method enhances multilingual LLM control with sparse autoencoders

Researchers have developed a new method for improving multilingual language control in large language models using sparse autoencoders (SAEs). Their approach involves training SAEs on multilingual data to enhance cross-…
RESEARCH · CL_41786 · May 20 · 05:20

New RL methods tackle LLM training issues

Two new research papers introduce methods to improve the training of large language models using reinforcement learning. One paper addresses the issue of "advantage collapse" in Group Relative Policy Optimization (GRPO)…
RESEARCH · CL_29382 · May 12 · 08:39

LLMs evaluated for air traffic safety analysis

Researchers are exploring the use of large language models (LLMs) for enhancing safety in air traffic control (ATC) and around non-towered airports. One study proposes a vision-language model approach to analyze radio c…
RESEARCH · CL_27585 · May 10 · 16:23

LLMs show promise and pitfalls for mental health screening

Researchers have developed an agentic LLM framework designed for large-scale mental health screening, which uses a policy-guided evaluation system to ensure trustworthiness and adaptability in clinical settings. A separ…
TOOL · CL_22450 · May 8 · 04:00

AI safety research reveals regional LLM bias disparities

A new research paper introduces a causal analysis framework to audit Large Language Model (LLM) safety mechanisms, moving beyond observational bias measurements. The study applies Pearl's do-operator to isolate the caus…
RESEARCH · CL_09806 · Apr 29 · 16:32

New MoRFI method identifies latent directions causing LLM hallucinations

Researchers have developed MoRFI (Monotonic Sparse Autoencoder Feature Identification) to better understand how large language models hallucinate. By fine-tuning models like Llama 3.1 8B and Gemma 2 9B on new knowledge,…
RESEARCH · CL_36289 · May 28 · 00:00

LLM inference and reasoning techniques advance with new research and hardware

Researchers are exploring novel methods to enhance the efficiency and reasoning capabilities of large language models (LLMs). Google Research is developing techniques to train LLMs to reason in a Bayesian manner, improv…
RESEARCH · CL_01620 · Oct 10 · 00:00

Google DeepMind releases T5Gemma encoder-decoder LLMs adapted from Gemma

Google DeepMind has introduced T5Gemma, a new family of encoder-decoder large language models derived from their existing Gemma 2 models. This adaptation technique allows for flexible combinations of encoder and decoder…