Weakly Supervised NLP Automates Diagnosis Identification from Discharge Letters

By PulseAugur Editorial · [1 sources] · 2026-06-15 04:00

Researchers have developed a weakly supervised Natural Language Processing (NLP) pipeline to automatically identify patient diagnoses from hospital discharge letters. This method avoids the need for extensive manual annotation by using a transformer model to generate semantic embeddings and a two-level clustering procedure to create weak labels for training a classifier. In a case study on bronchiolitis, the best weakly supervised model achieved an AUROC of 77.68%, demonstrating its potential for scalable disease identification from clinical text. AI

IMPACT This NLP technique could significantly reduce manual annotation time for clinical research, enabling faster and more scalable disease identification from large datasets.

RANK_REASON The cluster describes a research paper detailing a new NLP method for medical diagnosis identification. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Vittorio Torri, Elisa Barbieri, Anna Cantarutti, Carlo Giaquinto, Francesca Ieva · 2026-06-15 04:00

Automatic identification of diagnosis from hospital discharge letters via weakly supervised Natural Language Processing

arXiv:2410.15051v3 Announce Type: replace Abstract: Identifying patient diagnoses from hospital discharge letters is essential for large-scale cohort selection and epidemiological research, but traditional supervised approaches require extensive manual annotation, which is often …

COVERAGE [1]

Automatic identification of diagnosis from hospital discharge letters via weakly supervised Natural Language Processing

RELATED ENTITIES

RELATED TOPICS