实体 large-language models

large-language models

PulseAugur coverage of large-language models — every cluster mentioning large-language models across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

490

90 天内 490

发布 · 30天

90 天内 0

论文 · 30天

378

90 天内 378

层级分布 · 90 天

significant 4
research 137
tool 266
commentary 76
meme 7

关系

时间线

2026-05-25 research_milestone A study found that large language models exhibit persistent biases when providing guidance on religious conversions. 来源
2026-05-22 research_milestone A study evaluated LLM performance in psychiatric screening, finding varying accuracy and a tendency to discount symptom evidence in certain contexts. 来源
2026-05-21 research_milestone A new framework was proposed to improve cross-lingual cultural knowledge alignment in LLMs. 来源
2026-05-18 research_milestone A paper was published detailing multilingual jailbreaking vulnerabilities in LLMs using low-resource languages.
2026-05-18 research_milestone A study found that LLMs corrupt document content in delegated workflows. 来源
2026-05-18 research_milestone Large language models demonstrated zero-shot goal recognition capabilities in a new study.
2026-05-16 research_milestone A new benchmark and dataset are introduced for evaluating LLMs on legal precedent classification.
2026-05-15 research_milestone A new paper proposes using LLMs for data augmentation to improve cognitive score prediction from speech. 来源
2026-05-15 research_milestone A study was published on arXiv evaluating LLM reasoning in tax law and proposing neuro-symbolic alternatives. 来源
2026-05-15 research_milestone Development of a new framework for AI value alignment and introduction of the DailyDilemmas test by Cornell University. 来源
2026-05-15 research_milestone Researchers identified an implementation fidelity gap in LLMs, showing they can understand algorithms but struggle to code in unseen languages. 来源
2026-05-13 research_milestone LLMs demonstrated superior accuracy, speed, and cost-effectiveness in transcribing historical handwriting compared to specialized software. 来源
2026-05-13 research_milestone A new method for LLM adaptation using active information seeking was published on arXiv. 来源
2026-05-12 research_milestone A research paper demonstrates that LLMs exhibit bias towards sponsored products, but this can be mitigated with specific user prompts. 来源
2026-05-12 research_milestone A new paper proposes a behavior-based approach for federated fine-tuning of LLMs. 来源

情绪 · 30 天

25 天有情绪数据

最近 · 第 4/10 页 · 共 200 条

large-language models

大型语言模型（LLMs）用于学生编程解释自动化评估的测试

新框架解决了大型语言模型定理证明器中的对称性问题

新方法引导大语言模型注意力以纠正推理错误

新基准揭示大型语言模型推理失败及Claude的回避行为

NaviAgent 通过双层规划改进 LLM 工具编排

AI代理在程序验证和定理证明方面展现出潜力

TingIS系统使用LLM实时发现关键事件

新的MTR-Bench评估大型语言模型的多轮推理能力

新基准揭示大型语言模型在新闻摘要中表现出显著的框架偏见

大型语言模型量化内部叙事以描绘抑郁状态

新的GCPO框架通过几何感知不确定性改进LLM的后训练

大型语言模型使用新的PromptNCE方法估计互信息

新的MCTS方法增强了可解释性和效率

新的解码方法提高了LLM的事实准确性和效率

新框架VCR-Agent利用大型语言模型增强生物学发现

新基准评估LLM在交互式科学代码生成方面的能力

新框架利用LLM进行高级时间序列预测

大型语言模型通过动态提问改善门诊转诊

新框架量化LLM调查模拟不确定性

AI框架从调查数据中预测公众意见趋势