Researchers have developed a zero-shot, agentic workflow using open-source Large Language Models (LLMs) to extract crucial information from lung pathology reports. This method aims to automate the population of 13 College of American Pathologists synoptic fields, a task traditionally requiring manual effort and prone to errors. While a supervised baseline achieved a Micro-F1 score of 0.960, the best performing zero-shot LLM, GPT OSS 20B, reached a Micro-F1 of 0.893, demonstrating its capability to accurately extract complex relations without specific training. AI
IMPACT This research suggests open-source LLMs can provide a cost-effective solution for automating critical data extraction in medical pathology.
RANK_REASON Research paper detailing a novel application of LLMs for information extraction from clinical narratives.
- alphaXiv
- arXiv
- CatalyzeX
- College of American Pathologists
- Connected Papers
- CORE Recommender
- DagsHub
- GatorTron
- Gotit.pub
- GPT OSS 20B
- Hugging Face
- Litmaps
- ScienceCast
- scite Smart Citations
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →