UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial Representations
Researchers have developed UrbanFusion, a novel spatial representation model designed to integrate diverse geospatial data for urban phenomenon forecasting. The model utilizes modality-specific encoders and a Transformer-based fusion module to learn unified representations from various inputs like street view imagery, remote sensing data, and points of interest. Extensive evaluations across numerous cities demonstrate UrbanFusion's superior generalization and predictive performance compared to existing GeoAI models, enabling flexible use of available modalities during both pretraining and inference. AI
IMPACT Enhances GeoAI capabilities by enabling more robust integration of diverse urban data for improved forecasting.