Pulse

last 48h

[7/2007] 98 sources

What AI is actually talking about — clusters surfacing on Bluesky, Reddit, HN, Mastodon and Lobsters, re-ranked to elevate originality and crush noise.

RESEARCH · Mastodon — sigmoid.social English(EN) · 117mo · [43 sources] · LOBSTERSMASTOBLOG

Title: P9: Survey of Open-Source Machine Learning and Data Sciecne in [2024-10-03 Thu] a python library for bandit algorithms and off-policy evaluation 8) AIRI

OpenAI has released Triton 1.0, an open-source programming language designed to make GPU programming more accessible for researchers. Triton allows users to write efficient GPU code, comparable to expert-level performance, with significantly less code than traditional methods. This release aims to simplify the development of complex neural network operations and improve performance by automating low-level GPU optimizations. AI
RESEARCH · Eugene Yan English(EN) · 117mo · [4 sources] · BLOG

How Prototyping Can Help You to Get Buy-In

Eugene Yan details a multi-part process for building a product classification API, emphasizing the importance of prototyping to gain stakeholder buy-in. He explains how to acquire and prepare data, including cleaning titles and handling encoding issues, before training a machine learning model. The series also covers developing the API itself and demonstrates image search capabilities, though the API was later discontinued due to cloud costs. AI

IMPACT Provides a practical guide to end-to-end data product development, useful for engineers building similar classification systems.
RESEARCH · Eugene Yan English(EN) · 120mo · BLOG

Thoughts on Functional Programming in Scala Course (Coursera)

Eugene Yan shares his experience taking a Coursera course on functional programming in Scala, taught by the language's designer, Martin Odersky. The six-week course covered Scala fundamentals, functional programming concepts, and emphasized software engineering practices like unit testing with ScalaTest. Yan found that while he may not frequently use recursive solutions in his data science work, the course improved his understanding of Scala and problem-solving through tail recursion, ultimately making his code more robust and efficient. AI
RESEARCH · Practical AI English(EN) · 121mo · [36 sources] · BLOG

Generative models: exploration to deployment

Researchers are developing new methods to improve LLM capabilities in various domains. One study introduces MemCoE, a cognition-inspired framework for LLM agents to learn how to organize and update long-term user memory, enhancing personalization. Another paper, ReLay, explores personalized LLM-generated summaries, finding that while personalization improves comprehension, it also introduces risks of bias and hallucinations. Additionally, a new benchmark called ClassEval-Pro has been created to evaluate LLMs on class-level code generation, revealing significant performance gaps among current frontier models. AI

IMPACT Advances in LLM memory, personalization, and code generation benchmarks will drive further research and development in AI agents and software engineering.
RESEARCH · OpenAI News English(EN) · 122mo · [800 sources] · MASTOBLOGX

RL²: Fast reinforcement learning via slow reinforcement learning

OpenAI has published a series of research papers detailing advancements in reinforcement learning. These include achieving superhuman performance in Dota 2 with OpenAI Five, developing benchmarks for safe exploration in RL, and quantifying generalization capabilities with the CoinRun environment. The company also explored novel methods like prediction-based rewards for curiosity-driven exploration, learning policy representations in multiagent systems, and an experimental metalearning approach called Evolved Policy Gradients for faster training on new tasks. Further research addresses variance reduction in policy gradients and the equivalence between policy gradients and soft Q-learning, alongside challenging robotics environments for multi-goal RL. AI

IMPACT Demonstrates significant progress in RL capabilities, including superhuman performance, safety, generalization, and exploration, pushing the boundaries of AI.
RESEARCH · Practical AI English(EN) · 126mo · [20 sources] · BLOGREDDIT

Learning to learn deep learning 📖

Google AI has introduced Test-Time Diffusion Deep Researcher (TTD-DR), a novel framework that mimics human research processes by iteratively drafting and revising reports using retrieved information. This approach models report writing as a diffusion process, refining initial drafts through a denoising mechanism powered by search. OpenAI has also published several articles detailing techniques for training large neural networks, including data, pipeline, and tensor parallelism, as well as exploring the nonlinear computational properties of deep linear networks due to floating-point arithmetic. Additionally, OpenAI discussed infrastructure considerations for deep learning and a reparameterization technique called weight normalization to accelerate training. AI
RESEARCH · Eugene Yan English(EN) · 133mo · BLOG

DataScience SG Meetup - How we got top 3% in Kaggle

Eugene Yan shared insights from his experience placing in the top 3% of a Kaggle competition at a DataScience SG Meetup. The presentation covered various aspects of the competition, including evaluation metrics, feature engineering, machine learning techniques, and ensembling methods. The talk, held at SMU, drew a large audience interested in practical data science applications. AI

Pulse

Title: P9: Survey of Open-Source Machine Learning and Data Sciecne in [2024-10-03 Thu] a python library for bandit algorithms and off-policy evaluation 8) AIRI

How Prototyping Can Help You to Get Buy-In

Thoughts on Functional Programming in Scala Course (Coursera)

Generative models: exploration to deployment

RL²: Fast reinforcement learning via slow reinforcement learning

Learning to learn deep learning 📖

DataScience SG Meetup - How we got top 3% in Kaggle