Researchers have developed DiagramBank, a new dataset containing over 57,000 schematic diagrams extracted from AI and ML papers hosted on OpenReview. This dataset meticulously links each diagram to its source paper's title, abstract, caption, and in-text references, providing valuable context. DiagramBank is designed to support advancements in scientific document understanding, diagram retrieval, and the creation of new benchmarks, with a reported precision of 93.67% based on a manual audit. AI
IMPACT Provides a structured resource to improve AI model understanding of scientific diagrams and their context.
RANK_REASON The cluster describes the release of a new dataset for AI/ML research, including its methodology and audit results. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →