Brief

last 24h

[2/2] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · arXiv cs.AI English(EN) · 4d · [4 sources]

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

NVIDIA has introduced Gated DeltaNet-2, a new linear attention layer designed to improve memory editing in recurrent neural networks. This model separates the processes of erasing old information and writing new information using distinct channel-wise gates, addressing a limitation in previous delta-rule architectures. Trained on 100 billion tokens with 1.3 billion parameters, Gated DeltaNet-2 demonstrates superior performance over existing models like Mamba-2 and KDA, particularly in long-context retrieval tasks. AI

IMPACT Enhances long-context processing in recurrent models, potentially improving performance on complex language tasks.
RESEARCH · Latent Space (swyx) English(EN) · 3d

[AINews] New AI Infra unicorns: Exa, Modal, TurboPuffer

Several AI infrastructure companies have achieved significant funding milestones, with Turbopuffer reaching $100M ARR and profitability, Exa securing $250M in a Series C round valuing it at $2.2B, and Modal raising $355M at a $4.7B valuation. The AI News digest also highlighted advancements in model research, including RAEv2 for unified vision understanding and generation, NVIDIA's Gated DeltaNet-2 for improved language modeling, and a study questioning the necessity of subword tokenization. Additionally, discussions touched upon mechanistic interpretability and the potential for AI to drive breakthroughs in mathematics research, though with some skepticism regarding specific claims. AI

IMPACT Major funding rounds for AI infrastructure companies signal continued investment and growth in the sector, potentially accelerating development and deployment of AI technologies.
- OpenAI
- NVIDIA
- Modal
- Latent Space
- Exa
- Turbopuffer
- RAEv2
- Gated DeltaNet-2

Brief

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

[AINews] New AI Infra unicorns: Exa, Modal, TurboPuffer