Gemma 2B
PulseAugur coverage of Gemma 2B — every cluster mentioning Gemma 2B across labs, papers, and developer communities, ranked by signal.
3 天有情绪数据
-
语言模型中的概念几何源于词语共现
一篇新的研究论文提出了一个分布理论,解释了像“is-a”关系这样的分层概念如何在语言模型中以几何方式表示。研究表明,词语共现统计数据的谱组织自然地导致了嵌入中的分层分裂几何。这种现象在word2vec嵌入中被观察到,并且也扩展到了Gemma 2B的解嵌入,这表明复杂的概念层次结构可以从基本的统计模式中涌现,而无需专门的机制。
-
Gemma 2B local LLM proves capable in week-long wilderness test
A user tested the Gemma 2B local LLM over a week, finding it to be a "pretty damn decent" tool for the task. The evaluation suggests the model performs well in a practical, real-world scenario.
-
Small Gemma 2B model shows promise in AI alignment audits
Researchers have explored the use of a small, specialized Gemma 2B model as a judge for auditing AI alignment. This model, trained on specific code examples, demonstrated an ability to identify out-of-domain misalignmen…
-
Google I/O: Gemini 1.5 Pro, Gemma 2, and Genkit framework unveiled
Google has unveiled a suite of AI tools and models at its I/O 2024 conference, aiming to simplify AI development. The company introduced Gemini 1.5 Pro with a 2 million token context window, enabling reasoning over vast…
-
PERSA管道使用RLHF使大型语言模型反馈与教师风格保持一致
研究人员开发了PERSA,这是一种使用人类反馈强化学习(RLHF)来调整大型语言模型以生成个性化教育反馈的新方法。该方法专门针对将大型语言模型的反馈风格与特定教师的风格保持一致,同时不损害诊断准确性。通过仅更新顶部的Transformer块及其投影,PERSA增强了风格可控性,同时保持内容正确性,在代码反馈基准测试中取得了高分。
-
Researchers develop SNMF for interpretable LLM feature analysis
Researchers have developed a new method for understanding the internal workings of large language models by decomposing MLP activations. This technique, semi-nonnegative matrix factorization (SNMF), identifies interpret…
-
AI safety research probes jailbreak success and emergent misalignment in LLMs
Two new research papers explore the underlying causes of AI safety failures in large language models. One paper introduces LOCA, a method to provide local, causal explanations for why specific jailbreak prompts succeed,…
-
New research identifies 'override gap' as key failure in LLM adaptation
Researchers have identified a knowledge conflict failure in hypernetwork-based methods for adapting large language models, where accuracy drops significantly when new information contradicts pre-existing knowledge. This…
-
研究人员开发新方法来消除大型语言模型(LLM)奖励模型的偏差并改进其性能
研究人员开发了新的方法来提高用于对齐大型语言模型(LLM)的奖励模型(RM)的可靠性和可解释性。一种方法引入了因果驱动的干预技术,以在推理时减轻 RM 中的各种偏差,显示出对虚假特征的敏感性降低,而没有性能权衡。另一项开发是“reward-lens”库,它将机制可解释性工具应用于 RM,揭示线性归因并不总是能预测因果打补丁的效果。此外,一种称为时间连贯奖励建模(TCRM)的新方法将 RM 视为价值函数,从而能够进行可解释的 token…
-
Google DeepMind发布T5Gemma编码器-解码器LLM,改编自Gemma
Google DeepMind推出了T5Gemma,这是一个新的编码器-解码器大型语言模型系列,源自其现有的Gemma 2模型。这种改编技术允许灵活组合编码器和解码器的大小,从而在模型质量和推理效率之间取得更好的平衡。实验表明,T5Gemma模型在各种基准测试中的表现与同类仅解码器的Gemma模型相当或更优,在数学推理和阅读理解等任务中提供了显著的速度和准确性优势。