The math of multi-model consensus: when 3 cheap reviews beat 1 expensive one
Using multiple smaller AI models can be more effective than a single large one for tasks like code review, according to mathematical analysis. The key is that the smaller models should have uncorrelated errors, meaning their mistakes do not overlap. This approach, similar to RAID for disks or ensemble classifiers, can achieve higher accuracy rates than a single, more powerful model, often at a lower cost and with parallel processing benefits. AI
IMPACT This approach could lead to more cost-effective and robust AI systems for tasks like code review and quality assurance.