Researchers have developed HASTE, a novel framework designed to compress large pre-trained convolutional neural networks (CNNs) without requiring additional training or data access. This plug-and-play module utilizes locality-sensitive hashing to dynamically merge redundant channels during inference, thereby reducing computational costs. Experiments on datasets like CIFAR-10 and ImageNet show significant reductions in FLOPs, such as a 46.2% decrease in ResNet34 with a minimal accuracy drop. AI
IMPACT Enables more efficient deployment of large CNNs on resource-constrained devices without retraining.
RANK_REASON The cluster describes a new research paper detailing a novel framework for CNN compression.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →