English(EN) GeoT2V-Bench: Benchmarking 3D Consistency in Text-to-Video Models via 3D Reconstruction

新基准评估文本到视频模型中的3D一致性

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-23 17:12

研究人员推出了GeoT2V-Bench，一个旨在评估文本到视频（T2V）模型3D一致性的新基准。该基准评估T2V模型的视频输出是否能够支持静态场景的准确3D重建。GeoT2V-Bench分析了生成视频的各个方面，包括相机运动、静态渲染错误以及灵活和静态场景拟合之间的差异，以识别标准视觉可信度检查可能遗漏的故障模式。 AI

影响该基准通过突出文本到视频模型在3D场景重建能力方面的不足，有望推动其改进。

排序理由该集群描述了一个用于评估AI模型的新基准，该基准在一篇学术论文中提出。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 English(EN) · Chenrui Fan, Paolo Favaro · 2026-06-24 04:00

GeoT2V-Bench：通过3D重建对文本到视频模型中的3D一致性进行基准测试

arXiv:2606.24829v1 Announce Type: new Abstract: Camera-prompted text-to-video (T2V) models are increasingly used to synthesize virtual camera captures, such as orbiting objects or moving through static scenes. For these outputs, visual plausibility is insufficient: the generated …
arXiv cs.CV TIER_1 English(EN) · Paolo Favaro · 2026-06-23 17:12

GeoT2V-Bench：通过3D重建对文本到视频模型中的3D一致性进行基准测试

Camera-prompted text-to-video (T2V) models are increasingly used to synthesize virtual camera captures, such as orbiting objects or moving through static scenes. For these outputs, visual plausibility is insufficient: the generated frames should also provide coherent multi-view e…

报道来源 [2]

GeoT2V-Bench：通过3D重建对文本到视频模型中的3D一致性进行基准测试

GeoT2V-Bench：通过3D重建对文本到视频模型中的3D一致性进行基准测试

相关实体

相关话题