PulseAugur
实时 12:32:23
English(EN) Food Noise & False Safety: A Systematic Evaluation of How LLMs Fail to Adapt to Eating Disorder Queries with Clinician Feedback

研究发现大型语言模型未能适应饮食失调查询

一项新的研究论文评估了大型语言模型(LLMs)如何回应与饮食失调相关的查询。该研究在临床专家的参与下进行,识别出用户提示中的特定语言线索,这些线索会增加不安全或有害的LLM回应的可能性。研究人员发现,LLMs可能会不加批判地适应并助长危险的用户输入,对寻求支持的个人构成风险。 AI

影响 强调了大型语言模型与弱势群体互动时的关键安全问题,有必要为敏感查询改进安全防护措施。

排序理由 学术论文,通过专家反馈评估大型语言模型的安全性。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Giulia Pucci, Emily Hemendinger, Ruizhe Li, Gavin Abercrombie, Tanvi Dinkar, Arabella Sinclair ·

    Food Noise & False Safety: A Systematic Evaluation of How LLMs Fail to Adapt to Eating Disorder Queries with Clinician Feedback

    arXiv:2606.02444v1 Announce Type: new Abstract: Recent evidence shows that people with eating disorders (EDs) are increasingly seeking guidance, advice, and emotional support from Large Language Model (LLM)-based chat systems. Although these systems are not designed to provide cl…

  2. arXiv cs.AI TIER_1 English(EN) · Arabella Sinclair ·

    Food Noise & False Safety: A Systematic Evaluation of How LLMs Fail to Adapt to Eating Disorder Queries with Clinician Feedback

    Recent evidence shows that people with eating disorders (EDs) are increasingly seeking guidance, advice, and emotional support from Large Language Model (LLM)-based chat systems. Although these systems are not designed to provide clinical advice, their perceived expertise, neutra…