ImageNet
PulseAugur coverage of ImageNet — every cluster mentioning ImageNet across labs, papers, and developer communities, ranked by signal.
- used by CIFAR-10 70%
- used by Diffusion Models 70%
- instance of Diffusion Models 70%
- used by vision transformer 70%
- used by residual neural network 70%
- instance of Diffusion models of ion-channel gating and the origin of power-law distributions from single-channel recording 70%
- instance of magazine 70%
- used by ConvNeXt 70%
- instance of arXiv 60%
- instance of CIFAR-100 60%
- instance of CNNS 60%
- affiliated with arXiv 50%
12 天有情绪数据
-
New StAD method speeds up generative model likelihood calculations
Researchers have developed a new method called StAD to improve the speed and accuracy of likelihood calculations in diffusion and flow-based generative models. This technique bypasses the need to compute the Jacobian of…
-
Deep learning ensemble boosts plant disease classification accuracy
Researchers have developed AgriMind, an ensemble deep learning framework designed to automate plant disease classification. This system combines three models—ResNet50, EfficientNet-B0, and DenseNet121—trained on over 20…
-
Sparse Autoencoders enable robust CLIP model fine-tuning
Researchers have developed a new method called SAE-FT for fine-tuning large vision-language models like CLIP. This technique uses Sparse Autoencoders to regularize changes in the model's visual representations, preventi…
-
Diffusion Transformers advance image generation and material transfer
Researchers have introduced several advancements in Diffusion Transformer (DiT) architectures for image generation and manipulation. One paper explores the use of register tokens in pixel-space DiTs to improve convergen…
-
Engram module fails to improve autoregressive image generation
Researchers investigated the effectiveness of the Engram module, a memory retrieval system, in autoregressive image generation models. Adapting the module for vision tasks, they found that Engram-augmented models perfor…
-
New AID method improves image inpainting with diffusion models
Researchers have developed a new method called Amortized Inpainting with Diffusion (AID) for image inpainting using pretrained diffusion models. AID trains a small, reusable guidance module offline, which can then be ap…
-
Asymmetric Flow Models achieve state-of-the-art image generation
Researchers have introduced Asymmetric Flow Modeling (AsymFlow), a novel approach to generative models that significantly improves performance in high-dimensional spaces. This method restricts noise prediction to a low-…
-
What-Where Transformer separates object appearance from location
Researchers have introduced the What-Where Transformer (WWT), a novel visual backbone designed to better separate object appearance from spatial location. This new architecture uses a slot-based design where tokens repr…
-
Study evaluates transfer learning for deep neural networks in image classification
Researchers explored how to best select pre-trained deep neural networks for image classification tasks. They adapted eleven models, originally trained on ImageNet, to five distinct target datasets. The study evaluated …
-
Embodied AI redefines computer vision's role at CVPR 2026
Embodied AI is shifting the focus of computer vision research, moving from understanding static images to enabling intelligent agents to interact with and manipulate the real world. This paradigm shift, evident at CVPR …
-
He Kai Ming's team advances flow matching for faster image generation
He Kai Ming's team has published several papers challenging the dominance of diffusion models in image generation, proposing flow matching as a more efficient alternative. Their work introduces methods like JiT, which d…
-
New generative models leverage Wasserstein flows for faster, higher-quality outputs
Researchers are exploring new methods for generative modeling, focusing on Wasserstein gradient flows to improve efficiency and sample quality. One approach, W-Flow, achieves state-of-the-art one-step generation for ima…
-
New theories explore spectral dynamics in deep neural network training
Two new arXiv papers explore the spectral dynamics of deep neural networks during training. One paper introduces "Neural Low-Degree Filtering" (Neural LoFi) as a theoretical framework to understand hierarchical feature …
-
New SWAP-Score metric evaluates neural networks without training
Researchers have introduced SWAP-Score, a novel zero-shot metric designed to evaluate neural networks without requiring training. This method measures a network's expressivity using sample-wise activation patterns and d…
-
FRInGe paper introduces Fisher-Rao Integrated Gradients for improved AI model attribution
Researchers have introduced FRInGe, a novel method for improving gradient-based attribution in machine learning models. FRInGe addresses limitations of existing techniques like Integrated Gradients by defining a referen…
-
Prologue method improves autoregressive image generation quality
Researchers have introduced Prologue, a novel method to enhance autoregressive image generation by decoupling reconstruction and generation tasks. Instead of altering visual tokens, Prologue generates a small set of ini…
-
New chaotic self-supervision boosts medical image classification accuracy
Researchers have developed a new self-supervised learning strategy called the Chaotic Denoising Autoencoder (CDAE) for medical image classification. Unlike methods that use masking, CDAE applies chaotic transformations …
-
Researchers accelerate discrete autoregressive models with Wasserstein flow and Jacobi decoding
Researchers have developed a new method to accelerate the inference of discrete autoregressive normalizing flows, a type of generative model. The proposed technique, Selective Jacobi Decoding, allows for parallel iterat…
-
New methods accelerate visual generation models with variable codebooks and optimized decoding
Researchers have introduced Variable Codebook Size Quantization (VCQ) to address limitations in autoregressive visual generation models. VCQ modifies the codebook size dynamically along the sequence, improving reconstru…
-
TwistNet-2D learns second-order channel interactions for texture recognition
Researchers have developed TwistNet-2D, a novel module designed to enhance texture recognition by capturing second-order channel interactions. This module computes local pairwise channel products with directional spatia…