PulseAugur
EN
LIVE 12:50:29

Llamion language models transform Orion-14B into Llama architecture

Researchers have introduced Llamion, a new family of 14B-parameter open-weight language models. These models are created by transforming the Orion-14B model into the Llama architecture using a technique called Efficient Knowledge Preservation for Transformation (KEPT). This method combines parameter mapping and cross-architecture knowledge distillation to preserve Orion's behavior. Llamion models demonstrate strong performance on benchmarks like KoMMLU, exceeding existing entries, and retain capabilities such as Python programming and handling a 200K-token context. AI

IMPACT Introduces a method for efficiently transforming existing LLMs into new architectures, potentially enabling broader adoption and customization.

RANK_REASON The cluster describes a new research paper detailing the creation and performance of a new language model family.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Llamion language models transform Orion-14B into Llama architecture

COVERAGE [2]

  1. arXiv cs.CL TIER_1 Español(ES) · Kisu Yang, Yoonna Jang, Hyeonseok Moon, Hwanseok Jang, Taewoo Lee, Hyungjin Lee, Jeseung Lee, Juhyoung Park, Heuiseok Lim ·

    Llamion Technical Report

    arXiv:2605.25676v1 Announce Type: new Abstract: We release Llamion, a family of 14B-parameter open-weight language models obtained by transforming Orion-14B into the standardized Llama-family architecture. The transformation is performed by Efficient Knowledge Preservation for Tr…

  2. arXiv cs.CL TIER_1 Español(ES) · Heuiseok Lim ·

    Llamion Technical Report

    We release Llamion, a family of 14B-parameter open-weight language models obtained by transforming Orion-14B into the standardized Llama-family architecture. The transformation is performed by Efficient Knowledge Preservation for Transformation (KEPT), a recipe that combines (i) …