PulseAugur
EN
LIVE 22:18:04

InfiGFusion fuses LLMs using graph-on-logits distillation

Researchers have developed InfiGFusion, a novel framework for merging heterogeneous open-source large language models. This method uses a Graph-on-Logits Distillation (GLD) loss to model semantic dependencies between tokens, which previous methods overlooked. InfiGFusion significantly improves fusion quality and stability, outperforming state-of-the-art baselines on 11 benchmarks, particularly in complex reasoning tasks. AI

IMPACT Introduces a new method for improving the performance of fused LLMs, especially in complex reasoning tasks.

RANK_REASON The cluster contains an academic paper detailing a new method for model fusion. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Yuanyi Wang, Zhaoyi Yan, Yiming Zhang, Qi Zhou, Yanggan Gu, Fei Wu, Hongxia Yang ·

    InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

    arXiv:2505.13893v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have intensified efforts to fuse heterogeneous open-source models into a unified system that inherits their complementary strengths. Existing logit-based fusion methods maintain in…