A user discovered that Anthropic's Claude Code frequently failed to adhere to Test-Driven Development (TDD) principles, often writing the source code before the corresponding test. An audit of 30 days of commits revealed that in six out of ten features, the test file was committed after the source file, sometimes by as much as 23 minutes. This behavior stems from the model's next-token prediction, where its training data predominantly shows implementation preceding tests, making the AI
Summary written by gemini-2.5-flash-lite from 1 sources. How we write summaries →
RANK_REASON [lever_c_demoted from research: ic=1 ai=1.0]