Researchers have introduced SciRisk-Bench, a new benchmark designed to assess the safety of AI models used in scientific applications (AI4Science). This benchmark evaluates models on their ability to recognize and avoid risks across various scientific disciplines and specific risk dimensions. SciRisk-Bench covers 7 disciplines, 31 subdisciplines, and 10 distinct risk dimensions, providing a more detailed analysis of AI safety in science than previous datasets. AI
IMPACT Enhances safety evaluations for AI models deployed in scientific research, potentially leading to more reliable and secure AI4Science applications.
RANK_REASON The cluster describes a new academic benchmark for AI safety research.
Read on Hugging Face Daily Papers →
- AI4Science
- LLMs
- SciRisk-Bench
- alphaXiv
- arXiv
- CatalyzeX
- Connected Papers
- CORE Recommender
- DagsHub
- Gotit.pub
- Hugging Face
- Litmaps
- ScienceCast
- scite Smart Citations
AI-generated summary · Google Gemini · from 3 sources. How we write summaries →