PulseAugur
EN
LIVE 07:02:23
ENTITY WebGPU

WebGPU

PulseAugur coverage of WebGPU — every cluster mentioning WebGPU across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
25
25 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
4
4 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

7 day(s) with sentiment data

RECENT · PAGE 1/2 · 25 TOTAL
  1. TOOL · CL_112417 ·

    Kuma project compiles PyTorch models for browser execution via WebGPU

    A new project called Kuma aims to compile PyTorch models into self-contained WebGPU executables. This approach would allow models to run directly in the browser without needing Python or a server-side runtime. The proje…

  2. TOOL · CL_110863 ·

    230M LFM2.5 model runs in-browser at 1,400 tokens/sec

    A 230 million parameter model, LFM2.5, is now capable of running in a web browser at a speed of 1,400 tokens per second. This performance is achieved through custom WebGPU kernels, developed by individuals previously as…

  3. TOOL · CL_110104 ·

    SDXL image generation runs locally in browser via WebGPU

    An open-source browser extension has been developed to run the SDXL image generation model locally using WebGPU. This extension allows users to install and run the model directly within their browser without complex set…

  4. TOOL · CL_103941 ·

    Moebius image inpainting model ported to browser using Claude Code

    Simon Willison successfully ported the Moebius 0.2B image inpainting model to run in a web browser using Claude Code. The process involved converting the model to ONNX format and leveraging WebGPU for browser-based exec…

  5. RESEARCH · CL_103994 ·

    New AI frameworks enable safer multi-agent systems and browser-based latent collaboration

    Researchers have developed a new framework for hierarchical multi-agent reinforcement learning that enforces safety constraints while maintaining efficiency. This approach uses a constraint manifold at a low level to en…

  6. TOOL · CL_97443 ·

    Gemma 4-E2B runs in-browser at 255 tok/s with WebGPU kernels

    A demo and WebGPU kernels for Gemma 4-E2B have been released, enabling in-browser operation at approximately 255 tokens per second. The optimization was reportedly aided by Fable 5 before its shutdown. The release inclu…

  7. TOOL · CL_88761 ·

    Browser-based AI controls virtual hand in physics sandbox

    A new AI sandbox called Semantic Hand allows users to control a virtual hand in a browser environment using natural language prompts. The system leverages local AI models like Nemotron 3 Nano 4B or Qwen 3.5-4B, running …

  8. TOOL · CL_71883 ·

    Chrome extension enables local Stable Diffusion 1.5 image generation

    A new Chrome extension allows users to run Stable Diffusion 1.5 locally within their browser using WebGPU. This tool requires Chrome 113+ and approximately 4-6 GB of RAM, with an initial download of about 2.1 GB for the…

  9. TOOL · CL_66475 ·

    Browser-based real-time voice changer released as MVP

    A developer has created a real-time voice changer that operates entirely within a web browser. This tool leverages WebAssembly, ONNX Runtime, and WebGPU for its functionality. The creator has released it as a minimum vi…

  10. TOOL · CL_53185 ·

    Chrome proposes on-device Embedding API for web apps

    Google Chrome is proposing a new Embedding API that would allow web developers to generate vector embeddings for content directly on the user's device. This API aims to leverage on-device AI infrastructure to enable fea…

  11. TOOL · CL_53214 ·

    Ollama v0.30.0, Qwen3.5 35B, and 1-bit AI on WebGPU

    Ollama's v0.30.0 pre-release is set to improve llama.cpp interoperability. Separately, a new Qwen3.5 35B model is available in GGUF and GPTQ formats, optimized for local inference on consumer GPUs. Additionally, PrismML…

  12. RESEARCH · CL_53000 ·

    PrismML releases compact 1-bit/ternary image diffusion models

    PrismML has released new text-to-image diffusion transformer models, Binary and Ternary Bonsai Image 4B. These models are significantly smaller than previous versions, with a size of approximately 3GB, and can run local…

  13. TOOL · CL_52547 ·

    wllama brings GGUF LLMs to browser via WebAssembly and WebGPU

    A new tool called wllama enables users to run GGUF large language models directly within their web browser. Leveraging WebAssembly and WebGPU, wllama bypasses typical browser limitations like the 4GB memory constraint a…

  14. TOOL · CL_41024 ·

    WebLLM brings AI models to browsers via WebGPU

    WebLLM is a new project that enables large language models to run directly within web browsers using WebGPU for hardware acceleration. This client-side execution enhances user privacy and reduces server costs by keeping…

  15. TOOL · CL_40924 ·

    Browser-native GPU architecture enables MRI digital twins

    Researchers have developed a new browser-native GPU architecture for creating interactive MRI digital twins. This decentralized approach bypasses traditional server-side rendering, executing complex 3D simulations direc…

  16. TOOL · CL_25390 ·

    ProxyFace adds local, emotional avatars to AI chats

    ProxyFace is an open-source project that adds a local, expressive avatar to AI interactions. It utilizes a small, on-device emotion model and eye-tracking to make the avatar react to AI output and the user's gaze. The p…

  17. TOOL · CL_15669 ·

    Researchers create high-fidelity mobile avatars using local blendshapes

    Researchers have developed a new method for creating high-fidelity human avatars that can operate on mobile devices. This approach utilizes local linear blendshapes within specific body parts to capture complex appearan…

  18. RESEARCH · CL_15141 ·

    Run LLMs locally with LFM 2 and Transformers.js, using WebGPU

    Thomas Bley has released new slides detailing how to run Large Language Models (LLMs) locally using LFM 2. The presentation also covers using Transformers.js with WebGPU for privacy filters, function calling, and embedd…

  19. RESEARCH · CL_13698 ·

    Born ML framework achieves pure Go GPU acceleration with v0.8.0 update

    Born ML has released version 0.8.0, transitioning its GPU backend from Rust's wgpu-native to a pure Go implementation called gogpu/wgpu. This change eliminates the need for external runtime dependencies like .dll or .so…

  20. RESEARCH · CL_08477 ·

    Nvidia's Nemotron 3 Nano Omni and Llama.cpp enable local LLM execution

    Thomas Bley has released new presentation slides detailing how to run large language models locally. The slides cover Nvidia's Nemotron 3 Nano Omni, built-in tools for Llama.cpp, and the use of Transformers.js with WebG…