ImageNet
PulseAugur coverage of ImageNet — every cluster mentioning ImageNet across labs, papers, and developer communities, ranked by signal.
- used by CIFAR-10 70%
- used by Diffusion Models 70%
- instance of Diffusion Models 70%
- used by vision transformer 70%
- used by residual neural network 70%
- instance of Diffusion models of ion-channel gating and the origin of power-law distributions from single-channel recording 70%
- instance of magazine 70%
- used by ConvNeXt 70%
- instance of arXiv 60%
- instance of CIFAR-100 60%
- instance of CNNS 60%
- affiliated with arXiv 50%
12 天有情绪数据
-
New attack framework targets AI models with theoretical guarantees
Researchers have developed a new framework for adversarial attacks on AI models, focusing on hard-label black-box scenarios where only the top prediction is accessible. Their approach introduces a novel zero-query initi…
-
Vision Transformers improved with selective token interaction
Researchers have identified a phenomenon called "semantic diffusion" that degrades the performance of Vision Transformers (ViTs) in dense prediction tasks over time. This occurs when global semantic information spreads …
-
New RBDC protocol slashes vision model training costs by 30%
Researchers have developed a new training protocol called RBDC to make training large vision models more resource-efficient. This method involves recursively coupling independently trained, narrower models in a paramete…
-
DINOv3 vs ImageNet: Transfer learning for industrial vision tasks
A new research paper explores the effectiveness of transfer learning for industrial visual inspection tasks. The study compares DINOv3, a self-supervised model, against traditional ImageNet pretraining for RGB and X-ray…
-
ConvNeXt-FD model enhances biomedical image segmentation
Researchers have developed ConvNeXt-FD, a new deep learning model for segmenting biomedical images. This model utilizes a U-Net-like structure with a ConvNeXt backbone and incorporates a novel loss function that include…
-
New Gaussian Mixture Model improves DDIM sampling quality
Researchers have developed a new method to improve the sampling process in Denoising Diffusion Implicit Models (DDIM). Their approach utilizes a Gaussian Mixture Model (GMM) as the reverse transition operator, which mat…
-
New MDSE attack fools Spiking Neural Networks and traditional models
Researchers have developed a new adversarial attack method called Mixed Dynamic Spiking Estimation (MDSE) specifically for Spiking Neural Networks (SNNs). This attack demonstrates that the effectiveness of white-box adv…
-
TextTeacher uses language embeddings to boost vision model accuracy
Researchers have developed TextTeacher, a novel method to enhance vision model performance by leveraging language embeddings. This technique injects text information from image captions into the training process of visi…
-
FAIR-Pruner framework enables adaptive layer-wise neural network pruning
Researchers have developed FAIR-Pruner, a new framework designed for automatic, layer-wise structured pruning of deep neural networks. This method adaptively allocates sparsity across network layers by using both remova…
-
AI learning rules align with early primate vision, diverge in higher areas
Researchers have published a study comparing how different learning rules in artificial neural networks align with visual processing in both humans and macaques. The study found that early visual cortex alignment was co…
-
New framework boosts medical image classification with dual model approach
Researchers have developed a new deep learning framework for medical image classification that combines self-supervised and transfer learning techniques. The approach utilizes two ConvNeXt-Tiny models, one pre-trained o…
-
New framework analyzes neural network robustness to data shifts
Researchers have developed a new framework to analyze the distributional robustness of deep neural networks, a key challenge for real-world AI deployment. The framework models interactions between layer weights and acti…
-
Fully Ternary Vision Transformer Achieves High Compression for Microcontrollers
Researchers have developed FTerViT, a fully ternary Vision Transformer that compresses all weight matrices and normalization parameters. This approach significantly reduces the model's memory footprint, making it more f…
-
Winfree Oscillatory Neural Network shows parameter efficiency
Researchers have introduced the Winfree Oscillatory Neural Network (WONN), a novel dynamical architecture that leverages generalized Winfree dynamics for computation and representation. This new model evolves representa…
-
Vision models ditch activations for polynomial alternatives
Researchers have developed new activation-free backbone architectures for vision models, utilizing polynomial functions instead of traditional pointwise nonlinearities like ReLU or GELU. These novel modules, integrated …
-
New routing method boosts Diffusion Transformer training efficiency
Researchers have developed Diffusion-Adaptive Routing (DAR), a novel method to improve information flow in Diffusion Transformers (DiTs). By analyzing cross-layer information dynamics, they identified inefficiencies in …
-
SRC-Flow method enhances image generation with compact semantic representations
Researchers have developed SRC-Flow, a new normalizing flow method designed to improve image generation quality. The approach addresses the challenge of normalizing flows struggling with high-dimensional representations…
-
Dual-Rate Diffusion speeds up generative models with interleaved networks
Researchers have developed Dual-Rate Diffusion, a novel technique to speed up the inference process for diffusion models. This method interleaves a computationally intensive context encoder with a lightweight denoising …
-
New image tokenization methods boost MLLM performance
Two new research papers propose novel methods for tokenizing images to improve multimodal large language models (MLLMs). The first paper, VFMTok, uses a frozen vision foundation model as a tokenizer, achieving significa…
-
Neural networks learn image features via Fourier analysis
Researchers have explored the learning dynamics of neural networks through a Fourier perspective, focusing on how they learn simpler features before more complex ones. Their work introduces a synthetic data model for tr…