Sigma-Branch: Hierarchical Single-Path Network Reconstruction for Dynamic Inference with Reduced Active Parameters
Researchers have introduced Sigma-Branch (SigmaB), a novel framework designed to optimize deep neural networks for memory-constrained edge devices. SigmaB restructures dense networks into a hierarchical tree with shared backbones, routers, and specialized leaves, enabling dynamic inference. This approach significantly reduces the number of active parameters per inference by executing only a single root-to-leaf path, thereby minimizing off-chip weight transfers without sacrificing overall model capacity. AI
IMPACT Reduces per-inference active parameters by up to 60%, enabling more efficient AI deployment on edge devices with limited memory.