New GeoDisaster Benchmark Tests AI Agents in Disaster Response

By PulseAugur Editorial · [2 sources] · 2026-06-15 19:46

Researchers have introduced GeoDisaster, a new benchmark designed to evaluate and improve the capabilities of orchestrated agents in operational disaster geo-intelligence. This benchmark includes 2,921 instances across five task families, integrating diverse Earth observation and GIS data for tasks like hazard detection and damage assessment. The accompanying multi-agent framework utilizes a novel alignment technique called Role-Contract Expectation Alignment (RCEA) to enhance tool use and decision-making in disaster response scenarios. AI

IMPACT This benchmark could drive advancements in AI agent capabilities for real-world applications like disaster response and geo-intelligence.

RANK_REASON The cluster describes a new academic benchmark and associated framework for evaluating AI agents, published on arXiv.

Read on arXiv cs.CV →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

arXiv cs.MA (Multiagent) TIER_1 English(EN) · Biplab Banerjee · 2026-06-15 19:46

GeoDisaster: Benchmarking Orchestrated Agents for Operational Disaster Geo-Intelligence

Remote-sensing vision-language models (RS-VLMs) have advanced Earth-observation analysis toward visual interpretation and instruction-following, yet fall short of operational geo-intelligence, which demands tool-grounded spatial reasoning and structured, evidence-backed decisions…
arXiv cs.CV TIER_1 English(EN) · Maram Hasan, Aman Verma, Savitra Roy, Hariseetharam Gunduboina, Daksh Jain, Muhammad Haris Khan, Subhasis Chaudhuri, Biplab Banerjee · 2026-06-17 04:00

GeoDisaster: Benchmarking Orchestrated Agents for Operational Disaster Geo-Intelligence

arXiv:2606.17246v1 Announce Type: new Abstract: Remote-sensing vision-language models (RS-VLMs) have advanced Earth-observation analysis toward visual interpretation and instruction-following, yet fall short of operational geo-intelligence, which demands tool-grounded spatial rea…

COVERAGE [2]

GeoDisaster: Benchmarking Orchestrated Agents for Operational Disaster Geo-Intelligence

GeoDisaster: Benchmarking Orchestrated Agents for Operational Disaster Geo-Intelligence

RELATED ENTITIES

RELATED TOPICS