V-JEPA
PulseAugur coverage of V-JEPA — every cluster mentioning V-JEPA across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
Drive-JEPA framework advances end-to-end autonomous driving with novel video pretraining
Researchers have introduced Drive-JEPA, a novel framework that combines Video Joint-Embedding Predictive Architecture (V-JEPA) with multimodal trajectory distillation for end-to-end autonomous driving. This approach ada…
-
New framework learns implicit 3D physics from video
Researchers have developed a self-supervised framework called Neural Voxel Dynamics that learns implicit 3D physics directly from video. This method addresses limitations in current generative video models by predicting…
-
Chinese AI startup Fysics AI launches physics-based world model
Shanghai-based Fysics AI has launched Fysiverse, a new AI world model that incorporates real-world physical laws directly into its code. This approach differs from the data-driven methods used by companies like OpenAI a…
-
New Clin-JEPA framework enables joint-embedding predictive pretraining on EHR data
Researchers have developed Clin-JEPA, a novel multi-phase co-training framework designed for joint-embedding predictive pretraining on electronic health records (EHR). This framework addresses the challenge of creating …
-
Video foundation models show emergent intuitive physics understanding
A new research paper investigates whether video foundation models possess an understanding of intuitive physics. The study probes frozen representations of models like V-JEPA, VideoMAE, and LTX-Video using benchmarks su…
-
TrAction uses sparse trajectories for efficient action recognition
Researchers have developed TrAction, a novel transformer architecture for action recognition using sparse point trajectories instead of dense video. This method aims to reduce biases found in traditional models that rel…
-
VISTA system wins Ego4D challenge with object interaction anticipation
Researchers have developed VISTA, a novel system designed for anticipating human-object interactions in egocentric videos. VISTA integrates spatial object detection with temporal context from a frozen V-JEPA 2.1 model t…
-
ZhuoYu transitions to physical AI, seeing it as a survival imperative
Zhuo Yu, a company specializing in intelligent vehicles, is shifting its focus to "physical AI," a move Vice President Yu Beibei describes as a survival imperative rather than a market trend. The company has developed a…