PulseAugur
EN
LIVE 06:58:05

New AI framework translates radiologist speech to MRI tumor segmentation

Researchers have developed LoGSAM, a novel framework designed for parameter-efficient segmentation of brain tumors in MRI scans. This system transforms radiologist dictations into text prompts that guide foundation models for localization and segmentation. By leveraging pre-trained models like Whisper ASR, Grounding DINO, and MedSAM with minimal parameter updates, LoGSAM achieves a Dice score of 80.32% on the BRISC 2025 dataset, nearing the performance of fully fine-tuned models while using significantly fewer parameters. The pipeline also demonstrated high accuracy in extracting tumor-specific information from German dictations. AI

IMPACT This research demonstrates a novel approach to medical image segmentation by leveraging foundation models and speech-to-text technology, potentially streamlining diagnostic workflows.

RANK_REASON The cluster contains a research paper detailing a new AI model and framework. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New AI framework translates radiologist speech to MRI tumor segmentation

COVERAGE [1]

  1. arXiv cs.CV TIER_1 English(EN) · Mohammad Robaitul Islam Bhuiyan, Sheethal Bhat, Melika Qahqaie ·

    LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation

    arXiv:2603.17576v3 Announce Type: replace Abstract: Precise localization and delineation of brain tumors using magnetic resonance imaging (MRI) are essential for planning therapy and guiding surgical decisions. To address this, we propose LoGSAM, a parameter-efficient, detection-…