vision transformer
PulseAugur coverage of vision transformer — every cluster mentioning vision transformer across labs, papers, and developer communities, ranked by signal.
7 天有情绪数据
-
New framework enhances 3D ocean temperature reconstruction using AI
Researchers have developed an adaptive framework using spatiotemporal clustering to reconstruct 3D ocean subsurface temperature from surface observations. This method integrates with deep learning models like DP-CNN, At…
-
Researchers adapt Vision Transformers for fMRI analysis using flat maps
Researchers have developed a new family of models called CortexMAE, which adapt Vision Transformers for analyzing functional MRI data by projecting 3D volumes into 2D flat maps. This approach, tested on over 2,000 hours…
-
AI models advance plant disease detection with new datasets and efficient distillation
Researchers have developed new methods for plant leaf disease classification to aid in early detection and treatment. One approach involves training a new base model using the DenseNet201 architecture on a custom datase…
-
Deep learning models show promise in predicting cryptocurrency regimes from chart data
Researchers have conducted a systematic study on using deep learning for cryptocurrency regime prediction based on visual chart representations. They compared various image encoding methods, chart components, and neural…
-
Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces
Researchers have introduced JACTUS, a novel framework that unifies parameter-efficient fine-tuning (PEFT) and low-rank compression for adapting large pretrained models. Unlike sequential methods, JACTUS jointly optimize…
-
New BerLU activation function improves deep learning stability and efficiency
Researchers have introduced a new activation function called the Bernstein Linear Unit (BerLU) that aims to improve the stability and efficiency of deep neural networks. By utilizing Bernstein polynomials, BerLU creates…
-
ClustViT paper introduces token merging for efficient semantic segmentation
Researchers have introduced ClustViT, a novel approach to enhance Vision Transformers for semantic segmentation tasks. This method employs a trainable Cluster module to merge similar tokens, guided by segmentation masks…
-
AI model uses copula-enhanced Vision Transformer for myopia diagnosis
Researchers have developed a novel approach using a copula-enhanced Vision Transformer to improve the diagnosis of high myopia from ultra-widefield fundus images. This method addresses the challenges of capturing inter-…
-
Researchers adapt self-supervised learning for plant image recognition
Researchers have developed a self-supervised learning approach for plant image recognition, addressing the limitations of traditional supervised methods that require extensive expert-labeled data. The study found that s…
-
Paper proposes unified framework for efficient model unlearning in vision and audio
Researchers have introduced Graph-Propagated Projection Unlearning (GPPU), a novel method designed to selectively remove learned information from deep neural networks. This technique is applicable to both vision and aud…
-
Vision Transformer enables privacy-preserving clothing classification for thermal comfort
Researchers have developed a novel privacy-preserving method for classifying clothing types using Vision Transformers. This approach aims to enable secure occupant-centric control systems for optimizing thermal comfort …
-
New methods achieve industry-grade head modeling and AI-generated image detection
Researchers have developed a new framework for reconstructing high-fidelity 3D head models from single images, preserving facial identity and achieving industry-grade topology through a coarse-to-fine optimization pipel…
-
New research explores Vision Transformers for robust weed detection from drone imagery
Researchers have developed a new method for detecting Rumex obtusifolius (a type of weed) using drone imagery, addressing the challenge of domain adaptation in machine learning. Standard Convolutional Neural Networks (C…
-
A Graph-Augmented knowledge Distillation based Dual-Stream Vision Transformer with Region-Aware Attention for Gastrointestinal Disease Classification with Explainable AI
Researchers have developed a novel dual-stream deep learning framework for classifying gastrointestinal diseases from medical imagery. This system utilizes a teacher-student knowledge distillation approach, combining a …
-
Researchers distill Vision Transformers for robust learning from distorted images
Researchers have developed a new knowledge distillation framework to improve the robustness of vision models against image distortions. The method uses an asymmetric approach where a teacher model processes clean images…
-
Machine learning models reveal geographic data improves insurance claim predictions
Researchers have developed a method to incorporate geographic information into motor insurance claim prediction models, even with limited location data. By utilizing environmental data from OpenStreetMap and CORINE Land…
-
AI models achieve high accuracy in brain tumor classification and segmentation
Researchers have developed two distinct deep learning frameworks for brain tumor analysis using MRI scans. One framework utilizes a Vision Transformer (ViT-B/16) for automated four-class tumor classification, achieving …
-
Hugging Face benchmarks visual state-space models for remote-sensing segmentation
A new benchmark study rigorously compares visual state-space models (SSMs) like VMamba and MambaVision against traditional Vision Transformers for remote-sensing segmentation. The research found that while visual SSMs o…