Brief

last 24h

[2/2] 222 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.CV English(EN) · 5d

Revisiting Integration of Image and Metadata for DICOM Series Classification: Cross-Attention and Dictionary Learning

Researchers have developed a new multimodal framework for classifying DICOM image series, integrating both image content and acquisition metadata. This approach uses a bi-directional cross-modal attention mechanism and a metadata encoder that handles missing or inconsistent data without imputation. The system is designed to manage variable series lengths and image dimensions, demonstrating superior performance over existing methods on both in-domain and out-of-domain evaluations. AI

IMPACT This new framework could improve the accuracy and efficiency of medical image analysis pipelines.
TOOL · arXiv cs.CV English(EN) · 5d

Towards Selection of Large Multimodal Models as Engines for Burned-in Protected Health Information Detection in Medical Images

Researchers evaluated large multimodal models (LMMs) like GPT-4o and Gemini 2.5 Flash for detecting protected health information (PHI) in medical images. While LMMs showed improved text recognition (lower Word Error Rate) compared to traditional OCR methods, this did not always translate to higher overall PHI detection accuracy. The study found that LMMs were most effective on complex imprint patterns and offered recommendations for selecting and deploying these models in healthcare settings. AI

IMPACT LMMs show potential for improving PHI detection in medical images, particularly for complex cases, guiding future healthcare AI deployments.

Brief

Revisiting Integration of Image and Metadata for DICOM Series Classification: Cross-Attention and Dictionary Learning

Towards Selection of Large Multimodal Models as Engines for Burned-in Protected Health Information Detection in Medical Images