PulseAugur
LIVE 10:07:53
ENTITY vision transformer

vision transformer

PulseAugur coverage of vision transformer — every cluster mentioning vision transformer across labs, papers, and developer communities, ranked by signal.

Total · 30d
55
55 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
55
55 over 90d
TIER MIX · 90D
SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/2 · 27 TOTAL
  1. TOOL · CL_29284 ·

    What-Where Transformer separates object appearance from location

    Researchers have introduced the What-Where Transformer (WWT), a novel visual backbone designed to better separate object appearance from spatial location. This new architecture uses a slot-based design where tokens repr…

  2. TOOL · CL_27971 ·

    Diffusion augmentation boosts Bangla character recognition accuracy

    Researchers have developed a confidence-guided diffusion augmentation method to improve the recognition of handwritten Bangla compound characters. This approach uses diffusion models to generate high-quality synthetic c…

  3. TOOL · CL_27505 ·

    Foundation model learns from Dutch satellite data for global benchmarks

    Researchers have developed a new foundation model for high-resolution remote sensing data, specifically trained on satellite images of the Netherlands. This model combines Convolutional Neural Networks and Vision Transf…

  4. TOOL · CL_21919 ·

    Researchers develop robust foundation model for conservation laws using recurrent Vision Transformers

    Researchers have developed a new architecture that enhances Flux Neural Operators (Flux NO) by incorporating context through Recurrent Vision Transformers. This hypernetwork model extracts solution dynamics over time, e…

  5. TOOL · CL_22428 ·

    LC4-DViT uses generative AI and transformers for accurate land-cover mapping

    Researchers have developed LC4-DViT, a novel framework for land-cover classification using a deformable Vision Transformer. This approach combines generative data creation with a deformation-aware backbone to improve ac…

  6. TOOL · CL_22391 ·

    New framework fuses facial and physiological signals for better emotion recognition

    Researchers have developed a new framework for video-based emotion recognition that combines facial expressions with physiological signals from remote photoplethysmography (rPPG). Their method uses prompt tuning to inte…

  7. RESEARCH · CL_20294 ·

    DART vision-language model offers comprehensive rope condition monitoring

    Researchers have developed DART, a vision-language foundation model designed for comprehensive rope condition monitoring. This model integrates a Vision Transformer with Llama-3.2-3B-Instruct to handle the entire inspec…

  8. TOOL · CL_18721 ·

    Hebbian Fast Weights enhance Vision Transformers for few-shot character recognition

    Researchers have developed a new approach to few-shot character recognition by integrating Hebbian Fast-Weight (HFW) modules into Vision Transformer architectures. This method aims to mimic biological neural systems' ab…

  9. RESEARCH · CL_18667 ·

    RD-ViT cuts data needs for segmentation, outperforming standard ViT with fewer parameters

    Researchers have developed RD-ViT, a novel Recurrent-Depth Vision Transformer designed for semantic segmentation tasks. This architecture significantly reduces data dependence by using a single, shared transformer block…

  10. RESEARCH · CL_18682 ·

    OneTrackerV2 unifies multimodal visual tracking with Dual Mixture-of-Experts

    Researchers have developed a new event-based visual object tracking framework that addresses limitations of existing methods by explicitly modeling event density variations across multiple temporal scales. This approach…

  11. TOOL · CL_15745 ·

    Researchers adapt Vision Transformers for fMRI analysis using flat maps

    Researchers have developed a new family of models called CortexMAE, which adapt Vision Transformers for analyzing functional MRI data by projecting 3D volumes into 2D flat maps. This approach, tested on over 2,000 hours…

  12. TOOL · CL_15561 ·

    Deep learning models show promise in predicting cryptocurrency regimes from chart data

    Researchers have conducted a systematic study on using deep learning for cryptocurrency regime prediction based on visual chart representations. They compared various image encoding methods, chart components, and neural…

  13. RESEARCH · CL_15610 ·

    AI models advance plant disease detection with new datasets and efficient distillation

    Researchers have developed new methods for plant leaf disease classification to aid in early detection and treatment. One approach involves training a new base model using the DenseNet201 architecture on a custom datase…

  14. TOOL · CL_16142 ·

    New framework enhances 3D ocean temperature reconstruction using AI

    Researchers have developed an adaptive framework using spatiotemporal clustering to reconstruct 3D ocean subsurface temperature from surface observations. This method integrates with deep learning models like DP-CNN, At…

  15. TOOL · CL_16148 ·

    Researchers develop AI framework for fluid-structure interaction prediction

    Researchers have developed a new machine learning framework for predicting fluid-structure interactions (FSI) over long periods on deforming meshes. The system integrates a graph neural operator with a vision Transforme…

  16. RESEARCH · CL_16300 ·

    New BerLU activation function improves deep learning stability and efficiency

    Researchers have introduced a new activation function called the Bernstein Linear Unit (BerLU) that aims to improve the stability and efficiency of deep neural networks. By utilizing Bernstein polynomials, BerLU creates…

  17. RESEARCH · CL_14354 ·

    ClustViT paper introduces token merging for efficient semantic segmentation

    Researchers have introduced ClustViT, a novel approach to enhance Vision Transformers for semantic segmentation tasks. This method employs a trainable Cluster module to merge similar tokens, guided by segmentation masks…

  18. RESEARCH · CL_14340 ·

    AI model uses copula-enhanced Vision Transformer for myopia diagnosis

    Researchers have developed a novel approach using a copula-enhanced Vision Transformer to improve the diagnosis of high myopia from ultra-widefield fundus images. This method addresses the challenges of capturing inter-…

  19. RESEARCH · CL_10159 ·

    Paper proposes unified framework for efficient model unlearning in vision and audio

    Researchers have introduced Graph-Propagated Projection Unlearning (GPPU), a novel method designed to selectively remove learned information from deep neural networks. This technique is applicable to both vision and aud…

  20. RESEARCH · CL_10128 ·

    Vision Transformer enables privacy-preserving clothing classification for thermal comfort

    Researchers have developed a novel privacy-preserving method for classifying clothing types using Vision Transformers. This approach aims to enable secure occupant-centric control systems for optimizing thermal comfort …