Researchers have developed Tessera, a new architecture designed to securely stream model weights to edge accelerators in Unified Memory Architecture (UMA) systems. This approach addresses the challenge of protecting proprietary deep neural networks on commodity devices by enabling inline, cache-line granularity decryption of weights. Tessera intercepts memory bursts and decrypts them in parallel with DRAM fetches, streaming plaintext directly into isolated NPU SRAM with minimal bandwidth overhead. AI
影响 Enhances security for deploying proprietary models on edge devices by enabling efficient, hardware-backed DRM.
排序理由 Academic paper detailing a new architecture for secure weight streaming on edge accelerators.
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →