实体
ClassEval-Pro
ClassEval-Pro
PulseAugur coverage of ClassEval-Pro — every cluster mentioning ClassEval-Pro across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
-
ClassEval-Pro benchmark reveals LLMs struggle with class-level code generation
Researchers have introduced ClassEval-Pro, a new benchmark designed to evaluate the class-level code generation capabilities of large language models. This benchmark consists of 300 tasks across 11 domains, created usin…
-
LLM research explores new methods for training, evaluation, and understanding model behavior
Researchers are developing new methods to improve LLM capabilities in various domains. One study introduces MemCoE, a cognition-inspired framework for LLM agents to learn how to organize and update long-term user memory…