PulseAugur / Brief
EN
LIVE 01:46:07

Brief

last 24h
[1/1] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. mudler/Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-APEX-MTP-GGUF just released !

    A new quantized model, Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-APEX-MTP-GGUF, has been released by mudler. This model is based on the APEX (Adaptive Precision for Expert Models) quantization technique and includes a multi-token prediction (MTP) head for self-speculative decoding. The MTP head is bundled directly into the GGUF file, simplifying its use with recent versions of llama.cpp. AI

    IMPACT Enables local execution of advanced reasoning models with speculative decoding.