Researchers have introduced OneFocus, a novel vision-language model designed to enhance X-ray security screening capabilities. To address the scarcity of relevant training data, they developed MMXray, a benchmark dataset comprising over 52,000 image-caption pairs of X-ray contraband, alongside CleanDET and AnyContraSyn for synthetic data generation. OneFocus is built to perform multiple tasks including visual question answering, contraband localization, and classification, aiming to improve generalization and understanding in security applications. AI
IMPACT This research could lead to more effective automated contraband detection systems, improving security in logistics and transportation.
RANK_REASON The cluster describes a new research paper published on arXiv detailing a novel vision-language model and associated datasets for X-ray security screening. [lever_c_demoted from research: ic=1 ai=1.0]
- alphaXiv
- AnyContraSyn
- arXiv
- CatalyzeX
- CleanDET
- CORE Recommender
- DagsHub
- Gotit.pub
- Hugging Face
- MMXray
- OneFocus
- OnePipe
- ScienceCast
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →