English(EN) How LLMs See Creativity: Zero-Shot Scoring of Visual Creativity with Interpretable Reasoning

研究发现大型语言模型展现零样本视觉创造力评分能力

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-29 00:39

一项新的研究论文探讨了多模态大型语言模型（LLMs）在没有预先训练的情况下评估视觉创造力的能力。该研究测试了包括Gemini 3 Flash、Gemma 4-31B-it和GPT-5.4 Mini在内的六个大型语言模型，对AI生成的图像和人类素描进行了评估。结果表明，这些模型能够与人类的创造力评分保持一致，相关性范围从.29到.68。虽然大型语言模型的逐步推理过程提供了对其评估标准的解释性，例如平衡原创性和质量，但这种推理并未增强其与人类判断的一致性。 AI

影响多模态大型语言模型在零样本视觉创造力评估方面展现出潜力，为AI生成的艺术品和素描提供了可解释的推理。

排序理由学术论文，详细介绍了关于大型语言模型能力的研究发现。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · William Orwig, Roger E. Beaty · 2026-06-30 04:00

How LLMs See Creativity: Zero-Shot Scoring of Visual Creativity with Interpretable Reasoning

arXiv:2606.29672v1 Announce Type: new Abstract: Evaluating the originality of visual images poses enduring challenges for creativity assessment. Automated scoring using AI models has proven effective in the verbal domain, yet key questions remain about evaluating visual creativit…
arXiv cs.CL TIER_1 English(EN) · Roger E. Beaty · 2026-06-29 00:39

How LLMs See Creativity: Zero-Shot Scoring of Visual Creativity with Interpretable Reasoning

Evaluating the originality of visual images poses enduring challenges for creativity assessment. Automated scoring using AI models has proven effective in the verbal domain, yet key questions remain about evaluating visual creativity and understanding how models arrive at their r…

报道来源 [2]

How LLMs See Creativity: Zero-Shot Scoring of Visual Creativity with Interpretable Reasoning

How LLMs See Creativity: Zero-Shot Scoring of Visual Creativity with Interpretable Reasoning

相关实体

相关话题