English(EN) SICI: A Semantic-Pragmatic Complexity Index Reveals Regime Shifts in LLM Stance Detection

新的SICI指数揭示大型语言模型立场检测的复杂度转变

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-11 10:58

研究人员开发了SICI，一个衡量文本对大型语言模型（LLM）立场检测的语义-语用复杂度的七维指数。该指数比现有方法更能预测LLM的准确性，并揭示了LLM的错误会随着复杂度的增加而可预测地转变，从过度归因转向弃权。研究发现，包括GPT-3.5、GPT-4o-mini、DeepSeek-V3和GPT-4o在内的模型，常见的干预措施（如提示和检索）并未完全克服这种高复杂度瓶颈。 AI

影响这项研究为评估LLM在复杂任务上的性能提供了一个新指标，可能指导未来的模型开发和微调策略。

排序理由这是一篇研究论文，详细介绍了一个新指数以及关于LLM行为的发现。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Fuqiang Niu, Bowen Zhang · 2026-06-12 04:00

SICI: A Semantic-Pragmatic Complexity Index Reveals Regime Shifts in LLM Stance Detection

arXiv:2606.13189v1 Announce Type: new Abstract: Prompt-based LLMs are increasingly used for stance detection, but harder examples are not always repaired by clearer instructions, reasoning prompts, retrieval, or debate. We introduce SICI (Stance Inference Complexity Index), a sev…
arXiv cs.CL TIER_1 English(EN) · Bowen Zhang · 2026-06-11 10:58

SICI: A Semantic-Pragmatic Complexity Index Reveals Regime Shifts in LLM Stance Detection

Prompt-based LLMs are increasingly used for stance detection, but harder examples are not always repaired by clearer instructions, reasoning prompts, retrieval, or debate. We introduce SICI (Stance Inference Complexity Index), a seven-dimensional diagnostic measure of the semanti…

报道来源 [2]

SICI: A Semantic-Pragmatic Complexity Index Reveals Regime Shifts in LLM Stance Detection

SICI: A Semantic-Pragmatic Complexity Index Reveals Regime Shifts in LLM Stance Detection

相关实体

相关话题