ONNX
PulseAugur coverage of ONNX — every cluster mentioning ONNX across labs, papers, and developer communities, ranked by signal.
10 day(s) with sentiment data
-
New steganography technique hides messages in ONNX model weights
A researcher has developed a novel steganography technique that embeds hidden messages within the least significant bits of fine-tuned ONNX model weights. This method aims to conceal data by modifying only the weights t…
-
Author builds hybrid search engine combining vector and keyword search
The author details their experience building a hybrid search engine as part of the LLM Zoomcamp 2026. They explain the fundamental differences between traditional keyword search and vector search, emphasizing that vecto…
-
Simon Willison ports Moebius image model to browser using Claude Code
Simon Willison detailed his side project of porting the Moebius image pinpointing model to ONNX format. This conversion was performed using Claude Code, with the goal of enabling the model to run entirely within a web b…
-
Moebius image inpainting model ported to browser using Claude Code
Simon Willison successfully ported the Moebius 0.2B image inpainting model to run in a web browser using Claude Code. The process involved converting the model to ONNX format and leveraging WebGPU for browser-based exec…
-
Researcher seeks library to release new optimization algorithm
A researcher is seeking recommendations for the best library to release their newly developed QQN Quadratic Quasi-Newton optimization algorithm. They have existing implementations in Rust, Java, and JavaScript but want …
-
Humorous NFT minting device built on Raspberry Pi sells for $9.92
A digital entrepreneur has developed a humorous, portable device that mints NFTs using a Raspberry Pi. The contraption, trained on an M3 MacBook, can generate a 128-pixel square NFT in three seconds. The creator, identi…
-
GAN deployed on Raspberry Pi 4 creates physical NFT minting device
A user has successfully deployed a Generative Adversarial Network (GAN) on a Raspberry Pi 4, creating a physical device that mints non-fungible tokens (NFTs). The GAN, trained on a dataset of faces, generates hybrid fac…
-
New RAMS system adapts YOLOv8 tiers for edge AI perception
Researchers have developed RAMS, a novel runtime controller designed for embedded edge perception systems. RAMS dynamically switches between different tiers of YOLOv8 models based on real-time device resource monitoring…
-
Cursor users report integration issues with Moonshine Tiny voice model
A user on Reddit's r/cursor subreddit detailed a specific failure mode encountered while attempting to integrate a quantized Moonshine Tiny ONNX model into the Foursquare voice service. The user outlined the correct fil…
-
Offline voice loop for local AI models runs entirely on CPU
A developer has created a fully offline voice interaction loop for local AI models, utilizing only CPU resources and ensuring all data remains on the user's machine. This system integrates Silero VAD for voice activity …
-
AI models converted to ONNX for real-time DJ software use
Anmol Mishra presented a method for converting source separation AI models into the ONNX format. This conversion enables real-time application within DJ software. The presentation, part of the Audio Developer Conference…
-
Sixfab launches AI HAT+ for Raspberry Pi 5, offering 25 TOPS
Sixfab has released the AI HAT+, an NPU accelerator for the Raspberry Pi 5, priced around $100. This HAT connects via PCIe and GPIO, offering a simple setup that recognizes the NPU automatically within 15 minutes. It su…
-
Run PyTorch and ONNX models on Triton Inference Server without GPU
This article details how to run both PyTorch and ONNX models simultaneously on a single inference server using NVIDIA's Triton Inference Server. The process is demonstrated on a local Mac environment without requiring a…
-
SynaptoRoute optimizes AI agent routing with local semantic processing
A new semantic routing system called SynaptoRoute has been developed to address bottlenecks in LLM-based routing for AI agents. This system aims to reduce latency and token costs by performing routing locally using vect…
-
AI source separation models converted for real-time DJ software use
Anmol Mishra presented a method for converting source separation AI models into the ONNX format. This conversion enables real-time application of these models within DJ software. The presentation, part of the Audio Deve…
-
ONNX framework speeds up Sentence-BERT inference
This article explores how the ONNX framework can accelerate inference times for Sentence-BERT (SBERT) models, which are commonly used for generating sentence embeddings. The author demonstrates this by converting the `a…
-
Supertone ships Supertonic 3 TTS with 31 languages, inline expressiveness
Supertone has launched Supertonic 3, an updated on-device text-to-speech model that now supports 31 languages, a significant increase from its previous five. This new version boasts improved accuracy, reduced errors lik…
-
Hugging Face releases open multilingual embedding models with 32K context
Hugging Face has released Granite Embedding Multilingual R2, a suite of open-source multilingual embedding models. The release includes a 97M-parameter compact model that leads in retrieval quality among open models und…
-
ONNX format powers Microsoft AI, enabling cross-framework compatibility
The article discusses the often-overlooked AI format known as ONNX (Open Neural Network Exchange). ONNX serves as a crucial intermediary, enabling interoperability between different AI frameworks and hardware accelerato…
-
Source Separation Models Converted to ONNX for DJ Software
Anmol Mishra presented on converting source separation models to ONNX format for real-time use in DJ software. The presentation, shared via YouTube links across various Mastodon instances, focused on the technical aspec…