Learning Preference-Based Objectives from Clinical Narratives for Dynamic Sepsis Treatment
Researchers have developed a new framework called Clinical Narrative-informed Preference Rewards (CN-PR) to learn reward functions for reinforcement learning in healthcare. This method extracts trajectory-level preferences from clinical narratives, such as discharge summaries, to supervise the learning process. The CN-PR framework was evaluated in dynamic sepsis treatment, showing that policies learned with this approach led to improved recovery outcomes and comparable mortality rates to existing methods. AI
IMPACT This framework could enable more nuanced and personalized treatment strategies in dynamic healthcare scenarios by leveraging unstructured clinical data.