ONNX Runtime
PulseAugur coverage of ONNX Runtime — every cluster mentioning ONNX Runtime across labs, papers, and developer communities, ranked by signal.
7 day(s) with sentiment data
-
Kuma project compiles PyTorch models for browser execution via WebGPU
A new project called Kuma aims to compile PyTorch models into self-contained WebGPU executables. This approach would allow models to run directly in the browser without needing Python or a server-side runtime. The proje…
-
PaddleOCR releases PP-OCRv6 with 50-language support on Hugging Face
PaddleOCR has released PP-OCRv6, an updated suite of universal OCR models available on Hugging Face. This new generation offers improved text detection and recognition accuracy, with models ranging from 1.5 million to 3…
-
Microsoft Presidio gains traction on GitHub with new features
Microsoft Presidio, an open-source tool for detecting and protecting sensitive information, has gained significant traction on GitHub, surpassing 9,390 stars. Recent updates to the project include the addition of an ONN…
-
AI LOD framework optimizes game animation with distance-aware model precision
Researchers have introduced a novel framework called AI Level of Detail (AI LOD) to optimize real-time human motion prediction in games. This approach dynamically adjusts the precision of machine learning models based o…
-
ONNX Runtime outperforms HF Transformers in CPU-only speech benchmark
A benchmark comparing ONNX Runtime, Hugging Face Transformers, and GGUF for the Parakeet TDT 0.6B model on CPU-only hardware revealed that ONNX Runtime achieved a 37% faster inference time than Hugging Face Transformers…
-
Python project enables local, GPU-accelerated AI background removal
A new open-source project, bg-vanish-mcp, has been released to enable AI assistants to perform background removal on images locally. This tool leverages Python, the DirectML API via ONNX Runtime, and U2NET models for GP…
-
Meta's EnCodec gets portable C++ implementation
A C++ implementation of Meta's EnCodec audio codec has been developed, aiming for portability and high performance without external machine learning runtimes. This project, available on GitHub, compiles model weights di…
-
Browser-based real-time voice changer released as MVP
A developer has created a real-time voice changer that operates entirely within a web browser. This tool leverages WebAssembly, ONNX Runtime, and WebGPU for its functionality. The creator has released it as a minimum vi…
-
New framework tackles industrial Edge AI deployment challenges
This paper introduces a new systems framework designed to improve the deployment of Edge AI applications on industrial embedded platforms. It argues that treating AI deployment as a systems problem, rather than just a m…
-
New framework optimizes LLM inference energy use on multi-GPU systems
Researchers have developed EnergyLens, a framework designed to optimize the energy consumption of large language models (LLMs) during inference on multi-GPU systems. This tool addresses the challenge of predicting and r…
-
Supertone releases Supertonic 3 TTS with 31 languages
Supertone has released Supertonic 3, an on-device text-to-speech model that now supports 31 languages, a significant expansion from its previous 5-language offering. This updated version boasts improved reading stabilit…
-
Aycromo platform uses deep learning for rapid chromosome detection
Researchers have developed Aycromo, an open-source desktop platform designed to assist in cytogenetic analysis for diagnosing genetic diseases. This platform leverages deep learning models, specifically YOLOv11, to auto…