实体 arXiv cs.CL

arXiv cs.CL

PulseAugur coverage of arXiv cs.CL — every cluster mentioning arXiv cs.CL across labs, papers, and developer communities, ranked by signal.

总计 · 30天

3

90 天内 3

发布 · 30天

0

90 天内 0

论文 · 30天

3

90 天内 3

层级分布 · 90 天

主题

情绪 · 30 天

3 天有情绪数据

最近 · 第 1/1 页 · 共 3 条

RESEARCH · CL_82112 · Jun 9 · 01:40

New framework enhances LLM output diversity

Researchers have developed a new framework to analyze and improve the diversity of outputs generated by large language models. The framework categorizes methods based on where diversity is introduced during the generati…
RESEARCH · CL_79571 · Jun 8 · 11:05

LLM多跳推理失败与预训练数据相关

一篇新的研究论文探讨了大型语言模型（LLM）为何在多跳推理方面存在困难，即使它们拥有所需的单个事实。研究发现，模型在组合来自不同事实的信息以回答新问题时会失败，例如从两个相关信息推断出生日期。这种失败归因于预训练阶段缺乏对组合式上下文的暴露，而不是知识的缺失。
RESEARCH · CL_72524 · Jun 4 · 00:00

研究发现LLM立场模拟对上下文敏感

研究人员开发了一个新的框架来审计大型语言模型（LLM）在在线讨论中模拟用户立场的方式。该框架测试了LLM模拟对对话上下文变化的敏感性，包括模因等模态元素。研究发现，LLM可以根据修订后的上下文有效地改变模拟立场，这既突显了使用这些模型模仿在线意见动态的潜力，也揭示了其风险。