model collapse
PulseAugur coverage of model collapse — every cluster mentioning model collapse across labs, papers, and developer communities, ranked by signal.
4 天有情绪数据
-
文化演化理论解释模型崩溃
研究人员将模型崩溃(大型语言模型在训练自身输出来进行训练时会退化)这一现象重新解读为一种文化演化过程。通过应用迭代学习理论,他们使用LLaMA-2-7B和Mistral-7B模型在多种语言上推导并测试了五个预测。一个关键发现是,在未经筛选的自训练过程中,组合性最初会增加然后减少,这种模式即使在正则化数据下也持续存在,并且只有通过任务基础的筛选才能缓解。
-
Model collapse in interactive AI learning environments analyzed
Researchers have developed a new framework to understand model collapse in structured interactive learning environments. Their work addresses the challenges posed by generative AI models being trained on synthetic data …
-
ML systems fail in production due to infrastructure, not models
A recent article highlights the critical difference between testing an ML model in isolation and testing the entire production system. It details a scenario where a recommendation model, performing well in offline evalu…
-
AI models degrade due to 'data cannibalism' from synthetic training
Model collapse, also termed "data cannibalism," describes a degradation in AI model performance. This occurs when models are trained repeatedly on synthetic data generated by other AI systems, rather than on novel human…
-
Model collapse threatens AI democratization, disproportionately harming low-resource communities
A new position paper argues that model collapse, where generative models trained on prior models' outputs degrade in performance, poses a significant threat to low-resource communities. This phenomenon exacerbates data …
-
Researchers mathematically prove AI cannot self-improve to superintelligence
Researchers have mathematically demonstrated that artificial intelligence cannot achieve superintelligence through recursive self-improvement. Instead of advancing towards artificial general intelligence, AI models are …