Gemma 2-2B
PulseAugur coverage of Gemma 2-2B — every cluster mentioning Gemma 2-2B across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
Google's Gemma 2 models achieve high performance with efficient architecture
Google's new Gemma 2 models, particularly the 27B parameter version, are demonstrating significant performance gains through architectural innovations rather than just increased size. These models utilize a hybrid atten…
-
New research identifies actionable directions to mitigate AI model misalignment
Researchers have identified a method to detect and mitigate emergent misalignment in language models by analyzing activation directions. This approach, tested across four model families including Qwen2.5-1.5B, Gemma-2-2…
-
New method improves neural network interpretability by addressing dense activations
Researchers have proposed a new method to improve the interpretability of neural networks by questioning the assumption that all activation content can be sparsely decomposed. They hypothesize that activations contain a…
-
Specialized AI judge fails to cut audit costs, offers limited help
A researcher explored using a lightweight, specialized judge model (Gemma 2-2B) to assist AI agents in identifying misalignment within audits. While the judge was consistently used by the agents, it only proved helpful …
-
Transformer residual streams show geometry of time, concentrate context
Researchers have discovered that the residual stream in transformers, often likened to working memory, exhibits a distinct geometry related to time. By analyzing the Gemma-2-2B model, they found that information persist…
-
LLM Vulnerability Detection Relies on Safety Patterns, Not Direct Signatures
Researchers have employed mechanistic interpretability to analyze how Large Language Models (LLMs) detect software vulnerabilities, focusing on the Gemma-2-2b model. Their study revealed that the model primarily identif…
-
Decision Trees Enhance LLMs for Molecular Property Prediction
Researchers have developed a new method called TreeKD to improve the accuracy of large language models (LLMs) in molecular property prediction, a crucial task in drug discovery. TreeKD works by distilling knowledge from…
-
AI models detect PCOS, eating disorders with explainability
Researchers have developed open-source language models to detect a triple burden of polycystic ovary syndrome (PCOS), body image distress, and disordered eating in social media posts. Using a dataset of 1,000 PCOS-relat…
-
New FiPS framework compresses transformer models with minimal accuracy loss
Researchers have developed a new framework called Fine-grained Parameter Sharing (FiPS) to compress large transformer models. FiPS combines cross-block parameter sharing, low-rank factorization, and sparsity within a si…
-
New protocol detects LLM provider model substitutions
A new research paper proposes a commit-open protocol to detect when hosted large language model providers substitute cheaper models for advertised ones. The protocol uses Merkle trees to commit to sparse autoencoder (SA…
-
LLM analysis method reveals training data secrets and ethical risks
Researchers have developed a method using singular value decomposition (SVD) of a large language model's weight matrix to reveal interpretable semantic subspaces. This technique, requiring minimal code and no model infe…
-
CorrSteer method enhances LLM steering using correlated sparse autoencoder features
Researchers have developed CorrSteer, a novel method for steering large language models (LLMs) during generation using features extracted from Sparse Autoencoders (SAEs). This technique correlates sample correctness wit…
-
DB-KSVD algorithm offers scalable approach to disentangling high-dimensional embedding spaces
Researchers have introduced DB-KSVD, a novel dictionary learning algorithm designed to disentangle high-dimensional embedding spaces in large transformer models. This method adapts the classic KSVD algorithm to scale ef…
-
LLM jailbreaks linked to mid-to-late layer feature vulnerabilities
Researchers have developed a method to identify specific internal features within large language models that contribute to their vulnerability to jailbreaking attacks. By analyzing the Gemma-2-2B model using the BeaverT…
-
Google Research unveils CTCL for privacy-preserving synthetic data generation
Google Research has developed a new privacy-preserving synthetic data generation algorithm called CTCL, designed for resource-constrained AI applications. Unlike previous methods that require fine-tuning large language …
-
Google releases Gemma 2 2B, ShieldGemma, and Gemma Scope
Google has announced updates to its Gemma family of models, including the release of Gemma 2 2B. This new iteration is designed for efficiency and accessibility, aiming to empower developers with powerful yet lightweigh…
-
Google unveils Simula and CTCL for advanced synthetic data generation
Google Research has introduced Simula, a framework that treats synthetic data generation as a mechanism design problem. This approach allows for fine-grained control over dataset characteristics like coverage, complexit…
-
Google DeepMind releases T5Gemma encoder-decoder LLMs adapted from Gemma
Google DeepMind has introduced T5Gemma, a new family of encoder-decoder large language models derived from their existing Gemma 2 models. This adaptation technique allows for flexible combinations of encoder and decoder…