Researchers have developed LoGSAM, a novel framework designed for parameter-efficient segmentation of brain tumors in MRI scans. This system transforms radiologist dictations into text prompts that guide foundation models for localization and segmentation. By leveraging pre-trained models like Whisper ASR, Grounding DINO, and MedSAM with minimal parameter updates, LoGSAM achieves a Dice score of 80.32% on the BRISC 2025 dataset, nearing the performance of fully fine-tuned models while using significantly fewer parameters. The pipeline also demonstrated high accuracy in extracting tumor-specific information from German dictations. AI
IMPACT This research demonstrates a novel approach to medical image segmentation by leveraging foundation models and speech-to-text technology, potentially streamlining diagnostic workflows.
RANK_REASON The cluster contains a research paper detailing a new AI model and framework. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →