PulseAugur
实时 11:54:38

Researchers develop AesRM to improve video aesthetics with expert feedback

Researchers have developed AesRM, a new family of reward models designed to improve the aesthetic quality of generated videos. This system breaks down video aesthetics into three dimensions: Visual Aesthetics, Visual Fidelity, and Visual Plausibility, with 15 specific criteria. AesRM utilizes expert feedback from a dataset of 2500 video pairs to train models that can predict preferences and generate interpretable reasoning. The models were trained through a three-stage process, including atomic aesthetic capability learning and reinforcement learning, and have shown improved performance and robustness compared to existing methods. Additionally, AesRM has been used to enhance the video generation model Wan2.2, resulting in noticeable aesthetic improvements. AI

影响 Introduces a new framework and models for evaluating and improving video generation aesthetics, potentially impacting content creation tools.

排序理由 This is a research paper detailing a new model and benchmark for video aesthetics.

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

Researchers develop AesRM to improve video aesthetics with expert feedback

报道来源 [2]

  1. arXiv cs.CV TIER_1 English(EN) · Yujin Han, Yujie Wei, Yefei He, Xinyu Liu, Tianle Li, Zichao Yu, Andi Han, Shiwei Zhang, Tingyu Weng, Difan Zou ·

    AesRM: Improving Video Aesthetics with Expert-Level Feedback

    arXiv:2604.28078v1 Announce Type: new Abstract: Despite rapid advances in photorealistic video generation, real-world applications such as filmmaking require video aesthetics, e.g., harmonious colors and cinematic lighting, beyond visual fidelity. Prior work on visual aesthetics …

  2. arXiv cs.CV TIER_1 English(EN) · Difan Zou ·

    AesRM: Improving Video Aesthetics with Expert-Level Feedback

    Despite rapid advances in photorealistic video generation, real-world applications such as filmmaking require video aesthetics, e.g., harmonious colors and cinematic lighting, beyond visual fidelity. Prior work on visual aesthetics largely focuses on images, often reducing aesthe…