Brief

last 24h

[7/7] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · r/LocalLLaMA English(EN) · 7h

(Yet Another) KV cache calculator - kvanta.vcerny.cz

A new web-based tool called KVANTA has been released to calculate KV cache sizes for large language models. The developer created KVANTA because they found existing calculators to be inadequate. The tool is designed to support any model available on Hugging Face and is open-source under the Apache 2.0 license. AI

IMPACT Provides a new utility for users running local LLMs, simplifying resource management.
TOOL · dev.to — LLM tag English(EN) · 3d

Qwen3-Coder-Next: 80B total, 3B active, 70.6 on SWE-Bench

Alibaba's Qwen3-Coder-Next, an 80 billion parameter model with 3 billion active parameters, has achieved a 70.6 score on the SWE-Bench Verified benchmark. This performance is notable as it rivals top closed-source models while offering downloadable weights under the Apache 2.0 license. The model employs a sparse Mixture-of-Experts architecture and a hybrid attention mechanism, combining linear attention for long contexts with standard attention for global context reconstruction. AI

IMPACT Sets a new SOTA for open-source coding models on SWE-Bench, making advanced coding assistance more accessible.
SIGNIFICANT · Towards AI English(EN) · 4d

Qwen 3.6 Reviewed: The Open-Weight Coder That Just Crashed the Frontier Party

Alibaba's Qwen 3.6 model family, particularly the 27B dense variant, has demonstrated performance competitive with leading frontier models like GPT-5.4 and Claude 4.6 on coding tasks. This open-weight model, runnable on consumer hardware with a modest GPU, has generated significant buzz in the AI community for its accessibility and capability. The Qwen 3.6 lineup includes several variants, with the Apache 2.0 license for the 27B model offering broad commercial use. AI

IMPACT Accelerates the trend of powerful open-weight models running on consumer hardware, challenging frontier API dominance for coding tasks.
TOOL · Mastodon — fosstodon.org 한국어(KO) · 5d

opendataloader-pdf is an open-source PDF parser that extracts Markdown/JSON (bounding box) and HTML, and handles complex tables, formulas, and scanned documents with hybrid AI mode and built-in OCR (80+ languages). It mass-generates Tagged PDFs for screen readers with automatic tagging (Apache-2.0).

Sayzard has released opendataloader-pdf, an open-source tool designed to parse PDF documents. It can extract content into Markdown, JSON with bounding boxes, and HTML formats. The tool incorporates a hybrid AI mode and built-in OCR supporting over 80 languages, enabling it to handle complex tables, mathematical formulas, and scanned documents. AI

IMPACT Enables extraction of complex data from PDFs, potentially improving AI data ingestion pipelines.
RESEARCH · r/MachineLearning English(EN) · 3d · [2 sources]

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

Numind has released NuExtract3, an open-weight 4B visual language model designed for extracting information from complex documents. Built on Qwen3.5-4B and licensed under Apache-2.0, this model can convert document images to Markdown, extract structured data into JSON templates, and handle various visual inputs. It is designed to be self-hostable with minimal VRAM requirements and offers multiple weight formats for broad compatibility. AI

IMPACT Provides a self-hostable, open-weight alternative for document information extraction tasks.
SIGNIFICANT · Mastodon — mastodon.social 日本語(JA) · 5d · [6 sources]

Cohere releases Command A+, an MoE multimodal AI built for agent tasks, a high-performance open-source model for enterprises that can be deployed in their own environments https://fed.brid.gy/r/https://gigazine.net/news/20260522-cohere-command-a-p

Cohere has released Command A+, an open-source, multimodal AI model designed for enterprise use and agentic tasks. This new model integrates reasoning, vision, and multilingual capabilities, supporting 48 languages and offering significant improvements in speed and efficiency over previous versions. Command A+ is available on Hugging Face with various quantization options, including W4A4, which drastically reduces serving footprint with minimal performance loss, making it suitable for on-premises deployment. AI

IMPACT Accelerates enterprise adoption of advanced AI agents by providing a powerful, efficient, and customizable open-source model.
TOOL · Hugging Face Trending Models Deutsch(DE) · 6d

zhifeixie/Mega-ASR

Researchers have developed Mega-ASR, a new automatic speech recognition system designed to perform robustly in challenging real-world audio conditions. This system utilizes a Qwen3-ASR-1.7B backbone and incorporates an audio quality router to intelligently switch between a robust recognition path and a standard path. The goal is to maintain high accuracy on clean speech while significantly improving performance on degraded audio, such as that with heavy noise or reverberation. AI

IMPACT Enhances speech-to-text capabilities in challenging real-world scenarios, potentially improving accessibility and usability of voice interfaces.

Brief

(Yet Another) KV cache calculator - kvanta.vcerny.cz

Qwen3-Coder-Next: 80B total, 3B active, 70.6 on SWE-Bench

Qwen 3.6 Reviewed: The Open-Weight Coder That Just Crashed the Frontier Party

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

Cohere releases Command A+, an MoE multimodal AI built for agent tasks, a high-performance open-source model for enterprises that can be deployed in their own environments https://fed.brid.gy/r/https://gigazine.net/news/20260522-cohere-command-a-p

zhifeixie/Mega-ASR