PulseAugur
实时 22:48:49

Tropical geometry reveals sparsity is combinatorial depth in MoE models

A new paper introduces a theoretical framework for understanding Mixture-of-Experts (MoE) models using tropical geometry. The research establishes that the routing mechanism in MoE architectures is equivalent to a specific tropical polynomial, which partitions the input space and quantifies model expressivity. This analysis reveals that sparsity in MoE models contributes to their combinatorial depth and geometric capacity, offering 'Combinatorial Resilience' against capacity collapse on low-dimensional data, unlike dense networks. AI

影响 Provides a novel geometric lens for analyzing MoE architectures, potentially guiding future model design and understanding their expressivity.

排序理由 This is a theoretical computer science paper published on arXiv. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

Tropical geometry reveals sparsity is combinatorial depth in MoE models

报道来源 [1]

  1. arXiv cs.LG TIER_1 English(EN) · Ye Su, Huayi Tang, Zixuan Gong, Yong Liu ·

    Sparsity is Combinatorial Depth: Quantifying MoE Expressivity via Tropical Geometry

    arXiv:2602.03204v2 Announce Type: replace Abstract: While Mixture-of-Experts (MoE) architectures define the state-of-the-art, their theoretical success is often attributed to heuristic efficiency rather than geometric expressivity. In this work, we present the first analysis of M…