PulseAugur
LIVE 08:24:44
research · [2 sources] ·
0
research

Researchers develop AesRM to improve video aesthetics with expert feedback

Researchers have developed AesRM, a new family of reward models designed to improve the aesthetic quality of generated videos. This system breaks down video aesthetics into three dimensions: Visual Aesthetics, Visual Fidelity, and Visual Plausibility, with 15 specific criteria. AesRM utilizes expert feedback from a dataset of 2500 video pairs to train models that can predict preferences and generate interpretable reasoning. The models were trained through a three-stage process, including atomic aesthetic capability learning and reinforcement learning, and have shown improved performance and robustness compared to existing methods. Additionally, AesRM has been used to enhance the video generation model Wan2.2, resulting in noticeable aesthetic improvements. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Introduces a new framework and models for evaluating and improving video generation aesthetics, potentially impacting content creation tools.

RANK_REASON This is a research paper detailing a new model and benchmark for video aesthetics.

Read on arXiv cs.CV →

COVERAGE [2]

  1. arXiv cs.CV TIER_1 · Yujin Han, Yujie Wei, Yefei He, Xinyu Liu, Tianle Li, Zichao Yu, Andi Han, Shiwei Zhang, Tingyu Weng, Difan Zou ·

    AesRM: Improving Video Aesthetics with Expert-Level Feedback

    arXiv:2604.28078v1 Announce Type: new Abstract: Despite rapid advances in photorealistic video generation, real-world applications such as filmmaking require video aesthetics, e.g., harmonious colors and cinematic lighting, beyond visual fidelity. Prior work on visual aesthetics …

  2. arXiv cs.CV TIER_1 · Difan Zou ·

    AesRM: Improving Video Aesthetics with Expert-Level Feedback

    Despite rapid advances in photorealistic video generation, real-world applications such as filmmaking require video aesthetics, e.g., harmonious colors and cinematic lighting, beyond visual fidelity. Prior work on visual aesthetics largely focuses on images, often reducing aesthe…