PulseAugur
实时 21:28:20
English(EN) Geo-Align: Video Generation Alignment via Metric Geometry Reward

Geo-Align 框架通过 RL 增强视频重渲染

研究人员开发了 Geo-Align,一个用于相机控制视频重渲染的新型强化学习框架。该方法解决了现有方法依赖合成数据且难以泛化到真实世界视频的局限性。Geo-Align 利用了尺度感知感知奖励机制和度量 3D 估计器,以确保精确的相机轨迹提取和物理尺度的遵守,在可控性和视觉保真度方面优于监督学习基线。 AI

影响 为视频重渲染引入了一种新的强化学习方法,提高了真实世界应用的泛化能力和相机控制能力。

排序理由 该集群包含一篇详细介绍视频生成新研究框架的学术论文。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

报道来源 [4]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    Geo-Align:通过度量几何奖励实现视频生成对齐

    Geo-Align presents a reinforcement learning framework for camera-controlled video re-rendering that improves generalization through scale-aware perceptual rewards and metric 3D estimation for camera trajectory extraction.

  2. arXiv cs.CV TIER_1 English(EN) · Manjin Kim, Suha Kwak, Minsu Cho ·

    Tempered Self-Similarity Alignment for Physically Plausible Video Generation

    arXiv:2605.24962v1 Announce Type: new Abstract: Despite remarkable advances in video generative models, they still struggle to generate physically realistic videos, frequently exhibiting appearance drift, implausible motion, and temporal inconsistencies. In this work, we address …

  3. arXiv cs.CV TIER_1 English(EN) · Zizun Li, Haoyu Guo, Runzhe Teng, Chunhua Shen, Tong He ·

    Geo-Align:通过度量几何奖励实现视频生成对齐

    arXiv:2605.23903v1 Announce Type: new Abstract: Camera-controlled video generation has achieved remarkable progress in recent years. However, existing video-to-video re-rendering methods primarily rely on Supervised Fine-Tuning using synthetic datasets. At present, there is an ex…

  4. arXiv cs.CV TIER_1 English(EN) · Tong He ·

    Geo-Align:通过度量几何奖励实现视频生成对齐

    Camera-controlled video generation has achieved remarkable progress in recent years. However, existing video-to-video re-rendering methods primarily rely on Supervised Fine-Tuning using synthetic datasets. At present, there is an extreme scarcity of synchronized, multi-view real-…