CreativeBench benchmark evaluates and enhances machine creativity

By PulseAugur Editorial · [1 sources] · 2026-06-12 04:00

Researchers have introduced CreativeBench, a new benchmark designed to evaluate and enhance machine creativity, particularly in code generation. The benchmark utilizes a cognitive framework and includes two subsets, CreativeBench-Combo and CreativeBench-Explore, to assess combinatorial and exploratory creativity. A key finding is that while scaling models improves combinatorial creativity, it can lead to diminished returns in exploration and a tendency towards "convergence-by-scaling," making models more correct but less divergent. To address this, the paper proposes EvoRePE, a strategy to improve machine creativity by incorporating evolutionary search patterns. AI

IMPACT Introduces a new benchmark for evaluating machine creativity, potentially guiding future model development towards more divergent and exploratory capabilities.

RANK_REASON The cluster contains a research paper introducing a new benchmark and evaluation methodology for machine creativity. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

CreativeBench benchmark evaluates and enhances machine creativity

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Zi-Han Wang, Lam Nguyen, Zhengyang Zhao, Mengyue Yang, Chengwei Qin, Yujiu Yang, Linyi Yang · 2026-06-12 04:00

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

arXiv:2603.11863v2 Announce Type: replace Abstract: The saturation of high-quality pre-training data has shifted research focus toward evolutionary systems capable of continuously generating novel artifacts, leading to the success of AlphaEvolve. However, the progress of such sys…

COVERAGE [1]

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

RELATED ENTITIES

RELATED TOPICS