The Latent Space podcast released part one of its ICML 2024 recordings, focusing on generative video, vision, and robotics. The episode features discussions on OpenAI's Sora, Google DeepMind's award-winning Genie and VideoPoet models, and the broader field of generative video world simulators. It also touches upon diffusion models and the future of video generation beyond just scaling data. AI
Summary written by None from 1 source. How we write summaries →
RANK_REASON The cluster discusses research papers and models presented at ICML 2024, including award-winning work from Google DeepMind.