Researchers have developed a Multi-Dimensional Credibility Assessment (MDCA) framework to evaluate the trustworthiness of AI-generated radiology reports. The study focused on enhancing LLM-generated liver MRI reports and explored prompt optimization techniques. Several advanced LLMs, including Kimi-K2-Instruct-0905, Qwen3-235B-A22B-Instruct-2507, DeepSeek-V3, and ByteDance-Seed-OSS-36B-Instruct, were evaluated using the SiliconFlow platform. AI
IMPACT Establishes a framework for evaluating AI-generated medical reports, potentially improving diagnostic accuracy and trust in AI tools within healthcare.
RANK_REASON The cluster contains an academic paper detailing a new framework and evaluation of existing models. [lever_c_demoted from research: ic=1 ai=1.0]
- ByteDance-Seed-OSS-36B-Instruct
- DeepSeek-V3
- Kimi-K2-Instruct-0905
- Qiuli Wang
- Qwen3-235B-A22B-Instruct-2507
- SiliconFlow
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →