vision transformer
PulseAugur coverage of vision transformer — every cluster mentioning vision transformer across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
Paper proposes unified framework for efficient model unlearning in vision and audio
Researchers have introduced Graph-Propagated Projection Unlearning (GPPU), a novel method designed to selectively remove learned information from deep neural networks. This technique is applicable to both vision and aud…
-
New methods achieve industry-grade head modeling and AI-generated image detection
Researchers have developed a new framework for reconstructing high-fidelity 3D head models from single images, preserving facial identity and achieving industry-grade topology through a coarse-to-fine optimization pipel…
-
New research explores Vision Transformers for robust weed detection from drone imagery
Researchers have developed a new method for detecting Rumex obtusifolius (a type of weed) using drone imagery, addressing the challenge of domain adaptation in machine learning. Standard Convolutional Neural Networks (C…
-
A Graph-Augmented knowledge Distillation based Dual-Stream Vision Transformer with Region-Aware Attention for Gastrointestinal Disease Classification with Explainable AI
Researchers have developed a novel dual-stream deep learning framework for classifying gastrointestinal diseases from medical imagery. This system utilizes a teacher-student knowledge distillation approach, combining a …
-
Researchers distill Vision Transformers for robust learning from distorted images
Researchers have developed a new knowledge distillation framework to improve the robustness of vision models against image distortions. The method uses an asymmetric approach where a teacher model processes clean images…
-
Machine learning models reveal geographic data improves insurance claim predictions
Researchers have developed a method to incorporate geographic information into motor insurance claim prediction models, even with limited location data. By utilizing environmental data from OpenStreetMap and CORINE Land…
-
AI models achieve high accuracy in brain tumor classification and segmentation
Researchers have developed two distinct deep learning frameworks for brain tumor analysis using MRI scans. One framework utilizes a Vision Transformer (ViT-B/16) for automated four-class tumor classification, achieving …
-
Hugging Face benchmarks visual state-space models for remote-sensing segmentation
A new benchmark study rigorously compares visual state-space models (SSMs) like VMamba and MambaVision against traditional Vision Transformers for remote-sensing segmentation. The research found that while visual SSMs o…