Pulse

last 48h

[5/5] 89 sources

What AI is actually talking about — clusters surfacing on Bluesky, Reddit, HN, Mastodon and Lobsters, re-ranked to elevate originality and crush noise.

RESEARCH · X — SemiAnalysis · 13h · X

Cerebras — Faster Tokens Please

Cerebras Systems has announced a new wafer-scale engine designed to accelerate AI model training and inference. The company claims this new hardware significantly reduces the time required for processing tokens, a key metric in large language model performance. This advancement aims to address the growing computational demands of complex AI workloads. AI

IMPACT This new hardware could significantly speed up AI training and inference, potentially lowering costs and enabling more complex models.
RESEARCH · X — SemiAnalysis · 1d · X

THE MORE U BUY, THE MORE U SAVE: By ganging up multiple B200 8-GPU machines together over RoCEv2 CX-7 ethernet with Tomahawk switches with an inference optimiza

Nvidia's B200 GPUs are being deployed in large clusters, utilizing RoCEv2 Ethernet and Tomahawk switches for efficient inference. This setup allows for significant cost savings as more machines are added, indicating a trend towards scaled-out AI infrastructure. AI

IMPACT Highlights cost-saving strategies for large-scale AI inference deployments using advanced hardware.
RESEARCH · X — SemiAnalysis · 1d · [11 sources] · X

Here is where things have split:

Recent geopolitical events and trade tensions are reshaping the semiconductor supply chain, particularly for photoresist materials. Korean companies are increasing reliance on domestic suppliers and have secured waivers to import Russian naphtha, while Japanese firms, adhering to G7 restrictions, face disruptions due to their just-in-time inventory models. This divergence creates opportunities for Korean suppliers and risks for Japanese ones in the critical photoresist production process. AI

IMPACT Shifts in photoresist supply chain due to geopolitical tensions could impact semiconductor manufacturing capacity and costs.
RESEARCH · OpenAI News · 52mo · [289 sources] · MASTOBLOGX

RL²: Fast reinforcement learning via slow reinforcement learning

OpenAI has published a series of research papers detailing advancements in reinforcement learning (RL). These include achieving superhuman performance in the game Dota 2 using large-scale deep RL, developing benchmarks for safe exploration in RL environments, and quantifying generalization capabilities with a new environment called CoinRun. The research also explores novel methods like Random Network Distillation for curiosity-driven exploration, Evolved Policy Gradients for faster learning on new tasks, and variance reduction techniques for policy gradients. Additionally, OpenAI is investigating policy representations in multiagent systems and the theoretical equivalence between policy gradients and soft Q-learning. AI

IMPACT These advancements in reinforcement learning, particularly in generalization, safety, and exploration, could accelerate the development of more capable AI agents for complex real-world tasks.
RESEARCH · OpenAI News · 97mo · [739 sources] · HNLOBSTERSMASTOBLOGREDDITX

AI and compute

Anthropic conducted an experiment where Claude agents acted as digital barterers, successfully negotiating 186 deals totaling over $4,000. Participants found the deals fair, with nearly half expressing willingness to pay for such a service. The experiment highlighted that while model quality, such as Opus versus Haiku, significantly impacted deal outcomes, human participants did not perceive this difference. AI

IMPACT Demonstrates potential for AI agents in complex negotiation and commerce, suggesting future market viability.

Pulse

Cerebras — Faster Tokens Please

THE MORE U BUY, THE MORE U SAVE: By ganging up multiple B200 8-GPU machines together over RoCEv2 CX-7 ethernet with Tomahawk switches with an inference optimiza

Here is where things have split:

RL²: Fast reinforcement learning via slow reinforcement learning

AI and compute