实体 large-language models

large-language models

PulseAugur coverage of large-language models — every cluster mentioning large-language models across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

491

90 天内 491

发布 · 30天

90 天内 0

论文 · 30天

378

90 天内 378

层级分布 · 90 天

significant 4
research 137
tool 267
commentary 76
meme 7

关系

时间线

2026-05-25 research_milestone A study found that large language models exhibit persistent biases when providing guidance on religious conversions. 来源
2026-05-22 research_milestone A study evaluated LLM performance in psychiatric screening, finding varying accuracy and a tendency to discount symptom evidence in certain contexts. 来源
2026-05-21 research_milestone A new framework was proposed to improve cross-lingual cultural knowledge alignment in LLMs. 来源
2026-05-18 research_milestone A paper was published detailing multilingual jailbreaking vulnerabilities in LLMs using low-resource languages.
2026-05-18 research_milestone A study found that LLMs corrupt document content in delegated workflows. 来源
2026-05-18 research_milestone Large language models demonstrated zero-shot goal recognition capabilities in a new study.
2026-05-16 research_milestone A new benchmark and dataset are introduced for evaluating LLMs on legal precedent classification.
2026-05-15 research_milestone A new paper proposes using LLMs for data augmentation to improve cognitive score prediction from speech. 来源
2026-05-15 research_milestone A study was published on arXiv evaluating LLM reasoning in tax law and proposing neuro-symbolic alternatives. 来源
2026-05-15 research_milestone Development of a new framework for AI value alignment and introduction of the DailyDilemmas test by Cornell University. 来源
2026-05-15 research_milestone Researchers identified an implementation fidelity gap in LLMs, showing they can understand algorithms but struggle to code in unseen languages. 来源
2026-05-13 research_milestone LLMs demonstrated superior accuracy, speed, and cost-effectiveness in transcribing historical handwriting compared to specialized software. 来源
2026-05-13 research_milestone A new method for LLM adaptation using active information seeking was published on arXiv. 来源
2026-05-12 research_milestone A research paper demonstrates that LLMs exhibit bias towards sponsored products, but this can be mitigated with specific user prompts. 来源
2026-05-12 research_milestone A new paper proposes a behavior-based approach for federated fine-tuning of LLMs. 来源

情绪 · 30 天

26 天有情绪数据

最近 · 第 10/10 页 · 共 200 条

large-language models

新的评分标准评估大语言模型生成的法律命题

AI拥护者忽视了隐藏的成本和低效率

大型语言模型生成性别化行为，影响智能体的信任校准

LLM通过结构感知文本嵌入增强图异常检测

AI代理获得物理理解以用于CAD工程设计

大型语言模型在低资源语音识别错误纠正方面展现出潜力

AI网关：2026年管理LLM的必备工具

CodePercept 利用代码而非仅靠推理来提升 LLM 的视觉感知能力

跨模态技能注入可高效提升VLM能力

AI生成内容与人类写作难以区分

新框架SciCustom为科学任务定制化LLM评估

新框架SciCustom为科学任务定制化LLM评估

新的PAVE架构使生成式代理能够证明违规行为的合理性

针对可解释的错误信息检测对大型语言模型进行微调

研究发现：跨语言大型语言模型解释可能缺乏忠实性

新的DECOR框架使用信息操纵理论审计LLM欺骗

Google的TurboQuant大幅削减LLM内存需求，影响芯片股

论文：LLM不确定性量化是错误的无监督聚类

New LLM defense rewrites training data to combat poisoning attacks

GRASP框架增强LLM论点评估一致性