OpenAI research reveals deep double descent phenomenon in neural networks

By PulseAugur Editorial · [1 sources] · 2019-12-05 08:00

OpenAI researchers have identified a phenomenon called "deep double descent" in various deep learning models, including CNNs, ResNets, and transformers. This occurs when models are not carefully regularized, causing performance to initially improve, then worsen, and then improve again as model size, data, or training time increases. The research indicates that in certain regimes, larger models can perform worse, more training data can be detrimental, and extended training can paradoxically reverse overfitting. AI

RANK_REASON The cluster describes a research paper published by OpenAI detailing a phenomenon observed in deep learning models.

Read on OpenAI News →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

OpenAI research reveals deep double descent phenomenon in neural networks

COVERAGE [1]

OpenAI News TIER_1 English(EN) · 2019-12-05 08:00

Deep double descent

We show that the double descent phenomenon occurs in CNNs, ResNets, and transformers: performance first improves, then gets worse, and then improves again with increasing model size, data size, or training time. This effect is often avoided through careful regularization. While t…

COVERAGE [1]

Deep double descent

RELATED TOPICS