Brief

last 24h

[3/3] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · TLDR AI English(EN) · 1w

Qwen 3.7 🤖, Cursor Composer 2.5 👨‍💻, Anthropic acquires Stainless 🛠️

Qwen has released version 3.7 of its language model, which features a specific circuit for political censorship that can be modified without losing factual knowledge. NVIDIA's Cosmos Predict 2.5 model can now be fine-tuned for robot video generation using efficient LoRA/DoRA methods. Additionally, the new HRM-Text model offers a more accessible and cost-effective approach to pre-training foundation models. AI

IMPACT New model releases and fine-tuning techniques offer improved control and accessibility for AI development.
- Anthropic
- NVIDIA
- xAI
- Langchain
- Grok
- Qwen
- LoRA
- DoRA
- Cosmos Predict 2.5
- Qwen 3.7
- HRM-Text
TOOL · arXiv cs.CL English(EN) · 6d

HRM-Text: Efficient Pretraining Beyond Scaling

Researchers have developed HRM-Text, a novel Hierarchical Recurrent Model that significantly reduces the computational resources and training data required for pretraining large language models. By decoupling computation into strategic and execution layers and training exclusively on instruction-response pairs, a 1B-parameter model achieved competitive performance on several benchmarks with a fraction of the tokens and compute used by standard models. This approach makes foundational LLM research more accessible by lowering the barrier to entry for pretraining from scratch. AI

IMPACT Enables more researchers to train foundational models from scratch, potentially accelerating innovation.
RESEARCH · Transformers — Releases English(EN) · 1mo · [10 sources]

Patch release: v5.5.2

Hugging Face's `transformers` library has seen a series of releases and patches, introducing new models and fixing various bugs. Notably, version 5.9.0 added Cohere's Command A+ (Cohere2Moe) and HRM-Text, while also improving audio support and generation capabilities. Earlier releases, such as v5.8.0, integrated models like DeepSeek-V4, Gemma 4 Assistant, GraniteSpeechPlus, Granite4Vision, EXAONE 4.5, and PP-FormulaNet. Several patch releases have addressed specific issues, including problems with DeepSeek V4 integration, flash attention, Qwen MoE models with FP8, and Gemma4 device map support. AI

IMPACT New model integrations and bug fixes in a widely used library accelerate research and development across the AI ecosystem.

Brief

Qwen 3.7 🤖, Cursor Composer 2.5 👨‍💻, Anthropic acquires Stainless 🛠️

HRM-Text: Efficient Pretraining Beyond Scaling

Patch release: v5.5.2