CIFAR-10
PulseAugur coverage of CIFAR-10 — every cluster mentioning CIFAR-10 across labs, papers, and developer communities, ranked by signal.
- instance of CIFAR-100 70%
- instance of Tiny-ImageNet 70%
- used by federated learning 70%
- instance of residual neural network 70%
- instance of Fashion-MNIST 70%
- used by SGD 70%
- used by residual neural network 70%
- instance of ImageNet ILSVRC-2012 70%
- instance of ImageNet-100 70%
- competes with AdamW 70%
- used by Imagenette 70%
- instance of differential privacy 70%
19 day(s) with sentiment data
-
CurvSSL framework enhances self-supervised learning with manifold geometry
Researchers have introduced CurvSSL, a novel self-supervised learning framework that incorporates local manifold geometry into its training process. This method augments standard SSL techniques by adding a curvature-bas…
-
New StAD method speeds up generative model likelihood calculations
Researchers have developed a new method called StAD to improve the speed and accuracy of likelihood calculations in diffusion and flow-based generative models. This technique bypasses the need to compute the Jacobian of…
-
New watermarking embeds signals in generative model dynamics
Researchers have developed a novel watermarking technique for generative models that embeds signals directly into the learned continuous dynamics, specifically the velocity field of flow matching models. This method for…
-
New research advances federated learning with proactive client selection and privacy analysis
Researchers are exploring new methods to improve federated learning, a technique for training models across decentralized data sources while preserving privacy. One approach, "Choose Wisely and Privately," uses mutual i…
-
XOResNet advances deep spiking neural networks with novel residual learning
Researchers have developed XOResNet, a novel architecture for deep spiking neural networks (SNNs) that improves learning and representation capabilities. The design incorporates an OR-ADD shortcut connection to better m…
-
Zebrafish microcircuits inspire energy-efficient and robust AI
Researchers have developed a new method to attribute specific computational functions to microcircuits within biological neural networks, using the zebrafish tectal microcircuit as a model. By analyzing signal propagati…
-
New OUIDecay method adapts CNN regularization layer-by-layer
Researchers have introduced OUIDecay, a novel adaptive weight decay method for convolutional neural networks. This technique dynamically adjusts regularization strength for each layer based on online activation patterns…
-
Muon optimizer fails on convex Lipschitz functions, study finds
A new paper challenges the theoretical underpinnings of the Muon optimization algorithm, demonstrating that it does not converge on convex Lipschitz functions. The research suggests that Muon's practical success likely …
-
New framework corrects target shift in online learning systems
Researchers have developed a new framework to analyze and improve online learning systems that encounter distributional shifts. Their work, focusing on kernel regression, reveals that online learning effectively uses sh…
-
OrScale optimization method improves neural network training
Researchers have introduced OrScale, a novel optimization technique designed to enhance neural network training. OrScale builds upon the Muon method by incorporating layer-wise trust-ratio scaling, which measures the Fr…
-
Optical networks achieve superior image denoising via pre-training
Researchers have developed a novel pre-training method for all-optical image denoising using diffractive networks. This approach involves an initial training phase with a large dataset of 3.45 million images, followed b…
-
Spectral Surgery method rebalances deep network accuracy post-hoc
Researchers have developed a new post-hoc optimization method called Spectral Surgery to improve deep network classification performance. This technique directly perturbs model weights along specific "spike eigenvectors…
-
New STMD method speeds diffusion model inference without teacher
Researchers have developed Stochastic Transition-Map Distillation (STMD), a novel framework designed to accelerate the inference process for diffusion models without requiring a pre-trained teacher model. This method di…
-
New SWAP-Score metric evaluates neural networks without training
Researchers have introduced SWAP-Score, a novel zero-shot metric designed to evaluate neural networks without requiring training. This method measures a network's expressivity using sample-wise activation patterns and d…
-
GONO optimizer adapts Adam's momentum using directional consistency for better convergence
Researchers have introduced the GONO framework, an optimization signal designed to improve deep learning training by addressing the decoupling of directional alignment and loss convergence. Unlike existing optimizers th…
-
New research details efficient data reconstruction techniques for neural networks
Researchers have developed new techniques for data reconstruction attacks on neural networks, aiming to recover sensitive training data. Their unified optimization formulation, based on initial and trained parameter val…
-
New parameter E predicts Mixture-of-Experts model health, preventing dead experts.
Researchers have introduced a new dimensionless control parameter, E = T*H/(O+B), to predict the health of expert ecologies in Mixture-of-Experts (MoE) models. This parameter, derived from four hyperparameters, can prev…
-
Lookahead Drifting Model improves image generation with sequential drifting terms
Researchers have introduced a novel 'lookahead drifting model' for distribution mapping, building upon the existing 'drifting model' paradigm. This new approach computes a sequence of drifting terms at each training ite…
-
New MetaAdamW optimizer uses self-attention for adaptive learning rates
Researchers have developed MetaAdamW, a novel optimizer that enhances adaptive learning rates and weight decay by employing a self-attention mechanism. This Transformer-based approach dynamically adjusts hyperparameters…
-
LLMs accelerate neural architecture search with novel delta-based code generation
Researchers are exploring novel methods for Neural Architecture Search (NAS) using Large Language Models (LLMs). One approach, SPARK, aims to improve LLM knowledge integration by explicitly selecting functional factors …