AVX2
PulseAugur coverage of AVX2 — every cluster mentioning AVX2 across labs, papers, and developer communities, ranked by signal.
-
StateSMix compressor uses Mamba SSMs and n-grams for online lossless compression
Researchers have developed StateSMix, a novel lossless compression algorithm that utilizes Mamba-style State Space Models (SSMs) combined with sparse n-gram context mixing. This system trains token-by-token on the data …
-
llama.cpp and ik_llama.cpp add FP4 inference support for VRAM savings
The llama.cpp and ik_llama.cpp projects have both integrated support for FP4 (4-bit floating-point) inference, a significant advancement for model quantization. llama.cpp now includes NVFP4, an Nvidia-specific format, w…
-
PHP-ORT brings machine learning inference to PHP developers
A new infrastructure project called PHP-ORT aims to bring machine learning inference capabilities directly to PHP, the server-side language used by a significant portion of the web. This development seeks to empower mil…