PulseAugur
实时 23:31:41
实体 ClassEval-Pro

ClassEval-Pro

PulseAugur coverage of ClassEval-Pro — every cluster mentioning ClassEval-Pro across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
  1. RESEARCH · CL_14647 ·

    ClassEval-Pro benchmark reveals LLMs struggle with class-level code generation

    Researchers have introduced ClassEval-Pro, a new benchmark designed to evaluate the class-level code generation capabilities of large language models. This benchmark consists of 300 tasks across 11 domains, created usin…

  2. RESEARCH · CL_00255 ·

    LLM research explores new methods for training, evaluation, and understanding model behavior

    Researchers are developing new methods to improve LLM capabilities in various domains. One study introduces MemCoE, a cognition-inspired framework for LLM agents to learn how to organize and update long-term user memory…