CoSPlay: Cooperative Self-Play at Test-Time with Self-Generated Code and Unit Test
Researchers have developed CoSPlay, a novel framework for improving LLM code generation without relying on ground-truth unit tests. This training-free approach uses cooperative self-play to iteratively refine both generated code and its associated unit tests. By analyzing execution signals, CoSPlay prunes weak code and refreshes unreliable tests, leading to significant improvements in code generation accuracy and test quality. AI
IMPACT This framework offers a scalable inference strategy for competitive code generation, potentially reducing reliance on costly ground-truth data.