WorldMark: A Unified Benchmark Suite for Interactive Video World Models

By PulseAugur Editorial · [4 sources] · 2024-02-15 08:00

OpenAI has unveiled Sora, a video generation model capable of producing up to a minute of high-fidelity video, utilizing a diffusion transformer architecture that processes video and image data as spacetime patches. This approach allows Sora to handle variable durations, resolutions, and aspect ratios, aiming to create general-purpose simulators of the physical world. Concurrently, a new benchmark suite called WorldMark has been introduced to standardize the evaluation of interactive video world models, addressing the previous lack of comparable metrics across different models. AI

RANK_REASON OpenAI released Sora, a frontier video generation model, alongside a technical report detailing its capabilities.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 4 sources. How we write summaries →

WorldMark: A Unified Benchmark Suite for Interactive Video World Models

COVERAGE [4]

OpenAI News TIER_1 English(EN) · 2024-02-15 08:00

Video generation models as world simulators

We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patche…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-23 13:50

WorldMark: A Unified Benchmark Suite for Interactive Video World Models

Interactive video generation models such as Genie, YUME, HY-World, and Matrix-Game are advancing rapidly, yet every model is evaluated on its own benchmark with private scenes and trajectories, making fair cross-model comparison impossible. Existing public benchmarks offer useful…
Synced Review TIER_1 English(EN) · Synced · 2025-05-28 09:31

Adobe Research Unlocking Long-Term Memory in Video World Models with State-Space Models

<p>By combining State-Space Models (SSMs) for efficient long-range dependency modeling with dense local attention for coherence, and using training strategies like diffusion forcing and frame local attention, researchers from Adobe Research successfully overcome the long-standing…
arXiv cs.CV TIER_1 English(EN) · Yongtao Ge · 2026-04-23 13:50

WorldMark: A Unified Benchmark Suite for Interactive Video World Models

Interactive video generation models such as Genie, YUME, HY-World, and Matrix-Game are advancing rapidly, yet every model is evaluated on its own benchmark with private scenes and trajectories, making fair cross-model comparison impossible. Existing public benchmarks offer useful…

COVERAGE [4]

Video generation models as world simulators

WorldMark: A Unified Benchmark Suite for Interactive Video World Models

Adobe Research Unlocking Long-Term Memory in Video World Models with State-Space Models

WorldMark: A Unified Benchmark Suite for Interactive Video World Models

RELATED ENTITIES

RELATED TOPICS