ENTITY Horizon 2020

Horizon 2020

PulseAugur coverage of Horizon 2020 — every cluster mentioning Horizon 2020 across labs, papers, and developer communities, ranked by signal.

Total · 30d

0 over 90d

Releases · 30d

0 over 90d

Papers · 30d

0 over 90d

TIER MIX · 90D

No coverage in the last 90 days.

RELATIONSHIPS

instance of bfloat16 50%

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 12 TOTAL

TOOL · CL_28269 · May 11 · 17:32

LoKA framework enables low-precision FP8 for large recommendation models

Researchers have developed LoKA, a framework designed to make low-precision arithmetic, specifically FP8, practical for large recommendation models (LRMs). Unlike previous attempts that often degraded model quality, LoK…
SIGNIFICANT · CL_23577 · May 8 · 21:10

Superhuman and Databricks build 200K QPS AI inference platform

Superhuman and Databricks engineers collaborated to build a high-throughput inference platform capable of handling over 200,000 queries per second. This joint effort modernized Superhuman's serving stack, migrating from…
TOOL · CL_20689 · May 7 · 04:02

LLM Study Diary #3: PyTorch tensors, float types, and training infrastructure

This LLM study diary entry focuses on PyTorch fundamentals for training large language models. It details tensor basics, exploring various floating-point data types like FP32, BF16, and FP8 for efficiency and stability.…
RESEARCH · CL_19366 · May 6 · 12:30

Chinese chipmakers adopt DeepSeek's V4 AI model, boosting domestic hardware

Chinese technology firms, including Huawei and Cambricon, are rapidly adopting DeepSeek's new V4 AI model. This integration is happening across various hardware architectures within China, driven partly by geopolitical …
RESEARCH · CL_08634 · Apr 29 · 04:00

SnapMLA paper details hardware-aware FP8 quantized pipelining for efficient long-context MLA decoding

Researchers have developed SnapMLA, a new framework designed to enhance the efficiency of long-context decoding in Multi-head Latent Attention (MLA) architectures. This approach utilizes hardware-aware FP8 quantization …
FRONTIER RELEASE · CL_07710 · Apr 28 · 15:58

NVIDIA launches Nemotron 3 Nano Omni, unifying multimodal AI for efficiency

NVIDIA has released Nemotron 3 Nano Omni, an open multimodal model capable of processing text, images, audio, and video. This model aims to unify these modalities into a single architecture, improving efficiency and ena…
COMMENTARY · CL_07557 · Apr 28 · 11:10

No Jensen, Not All Compute is Created Equal

Nvidia CEO Jensen Huang suggested China could overcome advanced chip limitations by using more numerous, less advanced chips for AI training. However, this perspective overlooks the critical differences in chip capabili…
RESEARCH · CL_07014 · Apr 28 · 04:00

TACO framework boosts LLM training throughput by 1.87X with tensor compression

Researchers have introduced TACO, a novel framework designed to enhance the efficiency of training large-scale tensor-parallel Large Language Models (LLMs). TACO addresses communication overhead by employing an FP8-base…
RESEARCH · CL_03567 · Apr 25 · 22:41

Qwen3.6-35B model quantizations show FP8 quality worse than INT8, NVFP4 is a lie

A user on Reddit's LocalLLaMA community shared findings on the Qwen3.6-35B model, focusing on Kullback-Leibler (KLD) divergence metrics for different quantization formats like INT8, FP8, and NVFP4. The analysis, conduct…
RESEARCH · CL_03804 · Apr 25 · 16:08

AI safety research proposes formal framework for computational substrates

This series of posts explores the concept of 'substrates' in AI, which refers to the computational context layers necessary for implementing AI systems. The authors argue that current AI safety research lacks a clear fr…
FRONTIER RELEASE · CL_02784 · Apr 24 · 21:25

DeepSeek V4 models offer high performance with reduced inference costs and NPU support

DeepSeek has released its V4 family of open-weight large language models, featuring a 1.6 trillion parameter model and a smaller 284 billion parameter Flash MoE model. These new models claim to rival top proprietary LLM…
RESEARCH · CL_05065 · Apr 24 · 14:07

SpikingBrain2.0 model offers efficient long-context and cross-platform AI inference

Researchers have introduced SpikingBrain2.0 (SpB2.0), a 5 billion parameter model designed for efficient long-context processing and cross-platform inference. The model features a novel Dual-Space Sparse Attention mecha…

LoKA framework enables low-precision FP8 for large recommendation models

Superhuman and Databricks build 200K QPS AI inference platform

LLM Study Diary #3: PyTorch tensors, float types, and training infrastructure

Chinese chipmakers adopt DeepSeek's V4 AI model, boosting domestic hardware

SnapMLA paper details hardware-aware FP8 quantized pipelining for efficient long-context MLA decoding

NVIDIA launches Nemotron 3 Nano Omni, unifying multimodal AI for efficiency

No Jensen, Not All Compute is Created Equal

TACO framework boosts LLM training throughput by 1.87X with tensor compression

Qwen3.6-35B model quantizations show FP8 quality worse than INT8, NVFP4 is a lie

AI safety research proposes formal framework for computational substrates

DeepSeek V4 models offer high performance with reduced inference costs and NPU support

SpikingBrain2.0 model offers efficient long-context and cross-platform AI inference