Español(ES) Llamion Technical Report

Llamion 语言模型将 Orion-14B 转换为 Llama 架构

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-25 10:27

研究人员推出了一系列名为 Llamion 的新型 140 亿参数开放权重语言模型。这些模型通过一种称为高效知识保留转换（KEPT）的技术，将 Orion-14B 模型转换为 Llama 架构。该方法结合了参数映射和跨架构知识蒸馏，以保留 Orion 的行为。Llamion 模型在 KoMMLU 等基准测试中表现出色，超越了现有模型，并保留了 Python 编程和处理 200K token 上下文等能力。 AI

影响引入了一种将现有大型语言模型高效转换为新架构的方法，可能促进更广泛的应用和定制。

排序理由该集群描述了一篇关于新语言模型系列创建和性能的最新研究论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 Español(ES) · Kisu Yang, Yoonna Jang, Hyeonseok Moon, Hwanseok Jang, Taewoo Lee, Hyungjin Lee, Jeseung Lee, Juhyoung Park, Heuiseok Lim · 2026-05-26 04:00

Llamion Technical Report

arXiv:2605.25676v1 Announce Type: new Abstract: We release Llamion, a family of 14B-parameter open-weight language models obtained by transforming Orion-14B into the standardized Llama-family architecture. The transformation is performed by Efficient Knowledge Preservation for Tr…
arXiv cs.CL TIER_1 Español(ES) · Heuiseok Lim · 2026-05-25 10:27

Llamion Technical Report

We release Llamion, a family of 14B-parameter open-weight language models obtained by transforming Orion-14B into the standardized Llama-family architecture. The transformation is performed by Efficient Knowledge Preservation for Transformation (KEPT), a recipe that combines (i) …

报道来源 [2]

Llamion Technical Report

Llamion Technical Report

相关实体

相关话题