PulseAugur
EN
LIVE 21:12:19

Researchers develop AesRM to improve video aesthetics with expert feedback

Researchers have developed AesRM, a new family of reward models designed to improve the aesthetic quality of generated videos. This system breaks down video aesthetics into three dimensions: Visual Aesthetics, Visual Fidelity, and Visual Plausibility, with 15 specific criteria. AesRM utilizes expert feedback from a dataset of 2500 video pairs to train models that can predict preferences and generate interpretable reasoning. The models were trained through a three-stage process, including atomic aesthetic capability learning and reinforcement learning, and have shown improved performance and robustness compared to existing methods. Additionally, AesRM has been used to enhance the video generation model Wan2.2, resulting in noticeable aesthetic improvements. AI

IMPACT Introduces a new framework and models for evaluating and improving video generation aesthetics, potentially impacting content creation tools.

RANK_REASON This is a research paper detailing a new model and benchmark for video aesthetics.

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Researchers develop AesRM to improve video aesthetics with expert feedback

COVERAGE [2]

  1. arXiv cs.CV TIER_1 English(EN) · Yujin Han, Yujie Wei, Yefei He, Xinyu Liu, Tianle Li, Zichao Yu, Andi Han, Shiwei Zhang, Tingyu Weng, Difan Zou ·

    AesRM: Improving Video Aesthetics with Expert-Level Feedback

    arXiv:2604.28078v1 Announce Type: new Abstract: Despite rapid advances in photorealistic video generation, real-world applications such as filmmaking require video aesthetics, e.g., harmonious colors and cinematic lighting, beyond visual fidelity. Prior work on visual aesthetics …

  2. arXiv cs.CV TIER_1 English(EN) · Difan Zou ·

    AesRM: Improving Video Aesthetics with Expert-Level Feedback

    Despite rapid advances in photorealistic video generation, real-world applications such as filmmaking require video aesthetics, e.g., harmonious colors and cinematic lighting, beyond visual fidelity. Prior work on visual aesthetics largely focuses on images, often reducing aesthe…