实体 Gemma

Gemma

PulseAugur coverage of Gemma — every cluster mentioning Gemma across labs, papers, and developer communities, ranked by signal.

总计 · 30天

46

90 天内 46

发布 · 30天

0

90 天内 0

论文 · 30天

13

90 天内 13

层级分布 · 90 天

frontier release 1
significant 4
research 8
tool 29
commentary 2
meme 2

关系

情绪 · 30 天

15 天有情绪数据

最近 · 第 3/3 页 · 共 46 条

TOOL · CL_29335 · Jul 10 · 19:20

Cactus launches open-source AI engine for mobile devices

Cactus has released an open-source AI engine designed for mobile devices and wearables, prioritizing low latency and reduced RAM usage. The engine supports multimodal capabilities, including speech, vision, and language…
TOOL · CL_47694 · Apr 17 · 00:00

Together AI 发布持续 LLM 微调平台

Together AI 推出了新的微调平台，允许用户持续改进开放权重语言模型。该平台现已支持偏好优化和持续训练，使模型能够根据用户反馈和新数据进行调整。新的 Web UI 简化了该过程，允许开发人员直接从浏览器管理数据集、指定参数和监控实验。
TOOL · CL_17594 · Jan 22 · 17:40

BrowserAI 通过 WebGPU 加速实现在本地运行 LLM

BrowserAI 是一个开源项目，它允许大型语言模型直接在网页浏览器中运行，并利用 WebGPU 进行加速。这种方法确保了 100% 的隐私，因为所有处理都在本地进行，消除了服务器成本并启用了离线功能。该 SDK 支持多种引擎和流行模型，并提供文本生成、语音识别、文本转语音和音频源分离等功能。
RESEARCH · CL_01364 · Jul 31 · 00:00

Google releases Gemma 2 2B, ShieldGemma, and Gemma Scope

Google has announced updates to its Gemma family of models, including the release of Gemma 2 2B. This new iteration is designed for efficiency and accessibility, aiming to empower developers with powerful yet lightweigh…
FRONTIER RELEASE · CL_01252 · Jun 28 · 06:21

Gemma 3n 在开源生态系统中全面可用！

Google DeepMind 已全面发布 Gemma 3n，这是一款专为设备端应用设计的移动优先多模态模型。这种新架构支持图像、音频、视频和文本输入，以及文本输出，并针对效率进行了优化，提供有效参数为 2B 和 4B 的版本，模仿了传统 2B 和 4B 模型的内存占用。Gemma 3n 引入了 MatFormer 等新组件以提高灵活性，以及 Per Layer Embeddings 以提高内存效率，在多语言、数学、编码和推理方面取得…
RESEARCH · CL_01283 · Dec 5 · 00:00

研究人员揭示提高 LLM 推理速度和效率的新方法

Google Research 推出了“投机级联”（speculative cascades），一种通过将投机解码与标准级联相结合来提高大型语言模型（LLM）效率的新颖方法。这种混合方法旨在降低计算成本和推理延迟，同时不损害输出质量。通过策略性地使用较小的模型来预测 token，然后用较大的模型进行验证，投机级联与单独使用任一技术相比，提供了更好的成本-质量权衡，Gemma 和 T5 模型已证明了这一点。