PulseAugur
实时 12:52:50
English(EN) Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion

新AI模型Paris 2.0和Pantheon360推动视频生成技术发展

研究人员推出了Paris 2.0,一个利用去中心化计算进行训练的新型视频生成模型。该模型建立在Paris 1.0的原理之上,通过将视频生成质量的Frechet Video Distance (FVD)相比于单体训练方法降低约一半,显著提高了视频生成质量。另外,Pantheon360框架通过整合3D感知扩散和几何缓存系统,确保时空一致性,从而为数字孪生应用生成高保真360°视频。 AI

影响 这些在去中心化和3D感知视频生成方面的进展,可能带来更高效的大模型训练以及数字孪生应用中更逼真的效果。

排序理由 该集群包含两篇详细介绍新型AI视频生成模型的独立研究论文。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

报道来源 [4]

  1. arXiv cs.LG TIER_1 English(EN) · Ali Rouzbayani, Bidhan Roy, Marcos Villagra, Zhiying Jiang ·

    Paris 2.0: A Decentralized Diffusion Model for Video Generation

    arXiv:2605.26064v1 Announce Type: cross Abstract: We present Paris 2.0, the first video generation model pre-trained through decentralized computation. Its training recipe builds upon Paris 1.0 (arXiv:2510.03434), the first ever open-weight Decentralized Diffusion Model (DDM), wh…

  2. arXiv cs.LG TIER_1 English(EN) · Zhiying Jiang ·

    Paris 2.0: A Decentralized Diffusion Model for Video Generation

    We present Paris 2.0, the first video generation model pre-trained through decentralized computation. Its training recipe builds upon Paris 1.0 (arXiv:2510.03434), the first ever open-weight Decentralized Diffusion Model (DDM), which showed that image generation can be trained wi…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion

    Pantheon360 enables high-fidelity 360° video generation for digital twins by combining 3D-aware diffusion with explicit geometric caching to ensure spatial-temporal consistency.

  4. arXiv cs.CV TIER_1 English(EN) · Ting-Hsuan Chen, Ying-Huan Chen, Tao Tu, Jie-Ying Lee, Cho-Ying Wu, Fangzhou Lin, Hengyuan Zhang, David Paz, Xinyu Huang, Yuliang Guo, Yu-Lun Liu, Yue Wang, Liu Ren ·

    Pantheon360: Taming Digital Twin Generation via 3D-Aware 360{\deg} Video Diffusion

    arXiv:2605.25449v1 Announce Type: new Abstract: Generating complete digital twins from videos requires precise camera control, global scene coverage, and strict spatial-temporal consistency constraints that remain challenging for perspective video generators due to their limited …