English(EN) Karpathy’s 90-Second Time Machine Through 33 Years of Neural Networks

Karpathy 重访 1989 年神经网络，用现代 AI 技术降低错误率

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-06 13:01

Andrej Karpathy 重建了一个 1989 年的神经网络，通过应用现代深度学习技术，错误率降低了 60%。他展示了使用交叉熵损失（而非均方误差）、AdamW 优化器以及数据增强（特别是图像移位）等创新方法，显著提高了模型的性能。Karpathy 还表明，即使使用原始的 1989 年方法，仅将数据集大小从 7,291 张图像增加到 50,000 张，也能大幅降低错误率。 AI

影响展示了基础 AI 技术和数据扩展如何在历史模型上持续带来显著改进。

排序理由文章详细介绍了使用现代技术复制和改进历史 AI 研究论文的实验。[lever_c_demoted from research: ic=1 ai=1.0]

在 Towards AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Towards AI TIER_1 English(EN) · monica · 2026-06-06 13:01

Karpathy 的 33 年神经网络时间机器（90 秒速览）

<h4>What one of the world’s best AI educators discovered by recreating a 1989 neural network on his MacBook — and what it tells us about 2055.</h4><p>90 seconds. That’s how long it took Andrej Karpathy to train a neural network that originally needed 3 full days on a 1989 SUN-4 w…

报道来源 [1]

Karpathy 的 33 年神经网络时间机器（90 秒速览）

相关实体

相关话题