人工智能模型在文本和视频中处理幽默生成和理解

作者 PulseAugur 编辑部 · [4 个来源] · 2026-06-02 04:00

研究人员正在开发新的方法来评估和生成人工智能幽默，解决什么使事物有趣的主观性质。一种方法涉及使用偏好建模的“生成多个，选择最佳”策略，该策略在 SemEval-2026 幽默生成任务中排名很高。另一项研究分析了 YouTube Shorts 上的黑暗幽默和常规幽默，发现黑暗幽默会引起更多混合甚至有毒的观众反应。此外，还创建了一个名为 v-HUB 的新基准来评估多模态 LLM 理解视频中幽默的能力，并强调包含音频可以提高理解能力。 AI

影响新的基准和方法正在出现，以评估人工智能对幽默的细微理解和生成能力，从而推动多模态和约束式人工智能能力的边界。

排序理由多篇研究论文介绍了使用人工智能评估和生成幽默的新基准、数据集和方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。我们如何撰写摘要 →

报道来源 [4]

arXiv cs.AI TIER_1 English(EN) · Alexey Tikhonov, Alexey Ivanov · 2026-06-02 04:00

lmfaoooo at SemEval-2026 Task 1: Humor Is an Audience. Preference Modeling for Constrained Humor Generation

arXiv:2606.00022v1 Announce Type: cross Abstract: Humor generation remains difficult not only because producing fluent, novel jokes is hard, but because "funny" is audience-dependent and supervision is noisy -- preferences vary with audience, context, and culture, and annotator a…
arXiv cs.AI TIER_1 English(EN) · Sydney Johns, Sanjeev Parthasarathy, Shantnu Bhalla, Vaibhav Garg · 2026-06-02 04:00

When Jokes Cross the Line: Analyzing Regular Humor and Dark Humor in YouTube Shorts

arXiv:2606.00046v1 Announce Type: cross Abstract: Video platforms such as YouTube have reshaped how users engage with entertainment and information, emphasizing brief, highly engaging content such as Shorts. Within this ecosystem, certain content occupies a gray area where it rem…
arXiv cs.AI TIER_1 English(EN) · Zhengpeng Shi, Yanpeng Zhao, Jianqun Zhou, Yuxuan Wang, Qinrong Cui, Wei Bi, Songchun Zhu, Bo Zhao, Zilong Zheng · 2026-06-02 04:00

v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound

arXiv:2509.25773v3 Announce Type: replace-cross Abstract: AI models capable of comprehending humor hold real-world promise -- for example, enhancing engagement in human-machine interactions. To gauge and diagnose the capacity of multimodal large language models (MLLMs) for humor …
arXiv cs.CL TIER_1 English(EN) · Edward Ajayi, Prasenjit Mitra · 2026-06-02 04:00

HumorRank: A Tournament-Based Leaderboard for Evaluating Humor Generation in Large Language Models

arXiv:2604.19786v2 Announce Type: replace Abstract: Humor remains difficult to evaluate in large language models (LLMs) because what makes a response funny is subjective, comparative, and shaped by interacting comedic mechanisms rather than a single scalar property. Existing humo…

报道来源 [4]

lmfaoooo at SemEval-2026 Task 1: Humor Is an Audience. Preference Modeling for Constrained Humor Generation

When Jokes Cross the Line: Analyzing Regular Humor and Dark Humor in YouTube Shorts

v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound

HumorRank: A Tournament-Based Leaderboard for Evaluating Humor Generation in Large Language Models

相关实体

相关话题