Gemma-2-2B-it
PulseAugur coverage of Gemma-2-2B-it — every cluster mentioning Gemma-2-2B-it across labs, papers, and developer communities, ranked by signal.
2 天有情绪数据
-
SANA-WM 模型生成时长一分钟的 720p 视频
研究人员发布了 SANA-WM,一个能够生成时长一分钟、分辨率为 720p 的视频的开源世界模型。该扩散 Transformer 模型采用了混合线性注意力机制和双分支架构来实现精确的相机控制。该模型还包含一个两阶段生成流程,并使用精炼器来增强质量和时间一致性,它使用具有度量尺度 6-DoF 相机姿态的强大标注流程进行训练。
-
New method simplifies language model interpretability
Researchers have introduced Exemplar Partitioning (EP), a new method for mechanistic interpretability in language models that offers a more streamlined approach than existing dictionary-learning techniques like sparse a…
-
New methods enhance LLM control without sacrificing performance or reasoning
Researchers have developed new methods for steering large language model (LLM) behaviors at inference time without sacrificing generation quality. One approach, Prompt-only SV (PrOSV), intervenes only on prompt tokens, …
-
New methods enhance sparse autoencoder interpretability and stability
Researchers have developed new methods to address limitations in sparse autoencoders (SAEs), which are used to interpret the internal representations of large language models. One paper introduces adaptive elastic net S…