PulseAugur
实时 04:59:23

AeSlides framework uses verifiable rewards to improve LLM slide generation aesthetics

Researchers have introduced AeSlides, a novel reinforcement learning framework designed to improve the aesthetic quality of slides generated by large language models. This system utilizes verifiable metrics to quantify and supervise slide layout, addressing the gap between text-centric generation and visual appeal. By directly optimizing for aesthetic coherence, AeSlides significantly enhances aspect ratio compliance, reduces whitespace and element collisions, and improves overall visual balance. Evaluations show AeSlides outperforms existing methods and even surpasses models like Claude-Sonnet-4.5 in human assessments. AI

影响 Enhances LLM capabilities in visual presentation generation, potentially improving tools for content creation and communication.

排序理由 This is a research paper detailing a new framework for improving LLM-based slide generation.

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

AeSlides framework uses verifiable rewards to improve LLM slide generation aesthetics

报道来源 [1]

  1. arXiv cs.CV TIER_1 English(EN) · Yiming Pan, Chengwei Hu, Xuancheng Huang, Can Huang, Mingming Zhao, Yuean Bi, Xiaohan Zhang, Aohan Zeng, Linmei Hu ·

    AeSlides: Incentivizing Aesthetic Layout in LLM-Based Slide Generation via Verifiable Rewards

    arXiv:2604.22840v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated strong potential in agentic tasks, particularly in slide generation. However, slide generation poses a fundamental challenge: the generation process is text-centric, whereas its quality…