A developer conducted a 28-hour experiment using four concurrent Claude Code sessions, mediated solely through a shared filesystem. These sessions, named compass, Soul, V5, and nautilus-core, collaborated on tasks like contract negotiation and error detection. The experiment highlighted the effectiveness of filesystem-based contracts for agent communication, especially when agents have a stake in the relationship and deadlines are visible in their prompts. A key finding was the discovery of a significant discrepancy in test passing rates, where a reported "22/22 tests passing" was actually 11/22 failing due to a missing Python module initialization file. AI
IMPACT Demonstrates a novel, low-overhead method for multi-agent AI collaboration and reliability testing.
RANK_REASON This is a detailed case study of an experiment with an AI model, providing specific data and findings. [lever_c_demoted from research: ic=1 ai=1.0]
Read on dev.to — Claude Code tag →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →