PulseAugur
实时 14:11:39
English(EN) Learning with not Enough Data Part 3: Data Generation

Google AI 推出嵌套学习;OpenAI 在元学习和 AI 安全方面取得进展

Google Research 推出了“嵌套学习”(Nested Learning),这是一种新颖的机器学习范式,旨在解决持续学习中的灾难性遗忘问题。该方法将模型视为相互关联的优化问题,使它们能够在不丧失先前任务熟练度的情况下获取新知识。一个名为“Hope”的概念验证架构已通过该范式在语言建模和长上下文记忆管理方面展示了卓越的性能。OpenAI 还发布了关于元学习算法的研究,包括 Reptile,该算法专注于学习如何高效地为新任务学习,以及一种分层强化学习算法,通过将复杂问题分解为高级动作来加快任务完成速度。 AI

排序理由 该集群包含来自主要 AI 实验室的关于新机器学习范式和算法的多个研究论文和博客文章。

在 Lil'Log (Lilian Weng) 阅读 →

AI 生成摘要 · Google Gemini · 来自 13 个来源。 我们如何撰写摘要 →

Google AI 推出嵌套学习;OpenAI 在元学习和 AI 安全方面取得进展

报道来源 [13]

  1. Google AI / Research TIER_1 English(EN) ·

    Introducing Nested Learning: A new ML paradigm for continual learning

    Algorithms & Theory

  2. OpenAI News TIER_1 English(EN) ·

    The power of continuous learning

    Lilian Weng works on Applied AI Research at OpenAI.

  3. OpenAI News TIER_1 English(EN) ·

    On first-order meta-learning algorithms

  4. OpenAI News TIER_1 English(EN) ·

    Reptile: A scalable meta-learning algorithm

    We’ve developed a simple meta-learning algorithm called Reptile which works by repeatedly sampling a task, performing stochastic gradient descent on it, and updating the initial parameters towards the final parameters learned on that task. Reptile is the application of the Shorte…

  5. OpenAI News TIER_1 English(EN) ·

    Learning a hierarchy

    We’ve developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solving of tasks requiring thousands of timesteps. Our algorithm, when applied to a set of navigation problems, discovers a set of hig…

  6. Hugging Face Blog TIER_1 English(EN) ·

    An Introduction to Q-Learning Part 2/2

  7. Hugging Face Blog TIER_1 English(EN) ·

    An Introduction to Q-Learning Part 1

  8. Lil'Log (Lilian Weng) TIER_1 English(EN) ·

    Learning with not Enough Data Part 3: Data Generation

    <p>Here comes the Part 3 on learning with not enough data (Previous: <a href="https://lilianweng.github.io/posts/2021-12-05-semi-supervised/">Part 1</a> and <a href="https://lilianweng.github.io/posts/2022-02-20-active-learning/">Part 2</a>). Let’s consider two approaches for gen…

  9. Lil'Log (Lilian Weng) TIER_1 English(EN) ·

    Learning with not Enough Data Part 2: Active Learning

    <!-- The performance of supervised learning tasks improves with more high-quality labels available. However, it is expensive to collect a large number of labeled samples. Active learning is one paradigm to deal with not enough labeled data, when there are resources for labeling m…

  10. Lil'Log (Lilian Weng) TIER_1 English(EN) ·

    Learning with not Enough Data Part 1: Semi-Supervised Learning

    <!-- The performance of supervised learning tasks improves with more high-quality labels available. However, it is expensive to collect a large number of labeled samples. There are several paradigms in machine learning to deal with the scenario when the labels are scarce. Semi-su…

  11. Lil'Log (Lilian Weng) TIER_1 English(EN) ·

    Meta-Learning: Learning to Learn Fast

    <!-- Meta-learning, also known as "learning to learn", intends to design models that can learn new skills or adapt to new environments rapidly with a few training examples. There are three common approaches: 1) learn an efficient distance metric (metric-based); 2) use (recurrent)…

  12. arXiv cs.LG TIER_1 English(EN) · Noor Islam S. Mohammad, Md Muntaqim Meherab ·

    Regularized Meta-Learning for Improved Generalization

    arXiv:2602.12469v2 Announce Type: replace Abstract: Deep ensemble methods often improve predictive performance, yet they suffer from three practical limitations: redundancy among base models that inflates computational cost and degrades conditioning, unstable weighting under mult…

  13. Eugene Yan TIER_1 English(EN) ·

    NLP for Supervised Learning - A Brief Survey

    Examining the broad strokes of NLP progress and comparing between models