ENTITY vision transformer

vision transformer

PulseAugur coverage of vision transformer — every cluster mentioning vision transformer across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

81 over 90d

Releases · 30d

0 over 90d

Papers · 30d

80 over 90d

TIER MIX · 90D

frontier release 1
research 26
tool 54

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

17 day(s) with sentiment data

RECENT · PAGE 1/5 · 81 TOTAL

TOOL · CL_111748 · Jun 26 · 04:00

New AI method uses topology for improved flood detection in satellite imagery

Researchers have developed a new method for flood detection in satellite imagery by integrating topological data analysis (TDA) with neural networks. This approach aims to improve the interpretability of AI models used …
TOOL · CL_111891 · Jun 26 · 02:12

REViT imbues Vision Transformers with rotation equivariance without position encoding

Researchers have developed REViT, a novel approach that imbues Vision Transformers (ViTs) with rotation and reflection equivariance without relying on complex position encodings. By utilizing a 'Lifting' layer and Group…
TOOL · CL_109977 · Jun 25 · 04:00

New method offers tighter generalization bounds for neural networks

Researchers have developed a novel method to derive non-vacuous generalization bounds for deep neural networks from an optimization perspective. This approach models the discrete-time recursion process using a continuou…
RESEARCH · CL_105200 · Jun 22 · 13:52

Superhuman AI agent dominates Generals.io using self-play RL

A new research paper details the creation of a superhuman AI agent for the real-time strategy game Generals.io. Trained for four days on high-end GPUs, the agent achieved the top rank among over 5,000 human players and …
RESEARCH · CL_105072 · Jun 22 · 07:11

New framework uses hierarchical RL for neural network compression

Researchers have developed HiReLC, a hierarchical reinforcement learning framework designed to jointly quantize and prune deep neural networks. This approach uses low-level agents for per-kernel configurations and high-…
TOOL · CL_113315 · Jun 20 · 15:15

AI framework uses synthetic mammograms for label-efficient BAC segmentation

Researchers have developed BAC-JEPA, a novel framework for segmenting breast arterial calcifications (BAC) on mammograms using synthetic data. This label-efficient approach leverages procedurally generated arterial calc…
TOOL · CL_100244 · Jun 19 · 04:00

FrequencyFormer pipeline boosts vision transformer efficiency for edge devices

Researchers have developed FrequencyFormer, a novel pipeline designed to make vision transformers (ViTs) more efficient for deployment on sensor-edge systems. This approach leverages the frequency domain to compress ima…
TOOL · CL_100232 · Jun 19 · 04:00

New LEAP curriculum boosts Vision Transformer distillation efficiency

Researchers from the University of Oxford have introduced LEAP, a novel training curriculum designed to improve the efficiency of knowledge distillation for Vision Transformers (ViTs). LEAP utilizes a progressive approa…
TOOL · CL_100230 · Jun 19 · 04:00

New XAI dataset and method enhance species distribution model interpretability

Researchers have introduced a novel approach to enhance the interpretability of complex deep learning models used for species distribution modeling (SDMs). This method employs concept-based Explainable AI (XAI) techniqu…
TOOL · CL_100148 · Jun 19 · 04:00

AI model tunes quantum dots for Majorana modes

Researchers have developed a novel AI-enhanced method for tuning quantum dot simulators to achieve Majorana modes. This approach utilizes a deep vision-transformer network trained on synthetic data, incorporating a phys…
RESEARCH · CL_99573 · Jun 18 · 14:12

AI system automates scoring of student science drawings with confidence awareness

Researchers have developed a confidence-aware automated assessment system for student-drawn scientific models, utilizing a Vision Transformer (ViT). This system aims to reduce the cost and increase the scalability of ev…
RESEARCH · CL_99811 · Jun 18 · 08:03

New GPVAE framework enhances endoscopic video restoration

Researchers have developed a Gaussian Process Prior Variational Autoencoder (GPVAE) framework to improve the restoration of endoscopic videos, which are often degraded by artifacts like reflections and missing frames. T…
RESEARCH · CL_97836 · Jun 17 · 12:51

New LSTM-ViT Architecture Improves Weather Forecast Error Prediction

Researchers have developed a novel hybrid LSTM-Vision Transformer (LSTM-ViT) architecture to improve the prediction of forecast errors in high-resolution numerical weather prediction (NWP) systems. This new framework in…
TOOL · CL_96115 · Jun 17 · 04:00

ANEForge enables direct Python programming of Apple Neural Engine

A new Python package called ANEForge allows developers to directly program the Apple Neural Engine (ANE) without relying on CoreML. This bypass enables more efficient use of the ANE, which is the dedicated neural accele…
TOOL · CL_98912 · Jun 17 · 00:00

Bag of Dims: Training-Free Transformer Interpretability Method Unveiled

Researchers have developed a novel method called "Bag of Dims" that allows for training-free mechanistic interpretability of transformer models. This approach treats individual dimensions within transformer hidden state…
TOOL · CL_93955 · Jun 16 · 04:00

Deep learning models for lung cancer diagnosis show high accuracy but differing reasoning

A new study published on arXiv explores the interpretability of deep learning models used for lung cancer diagnosis. While three distinct models (CNN, ResNet50, and ViT) demonstrated high predictive accuracy, with ResNe…
TOOL · CL_93873 · Jun 16 · 04:00

Vision Transformer Outperforms CNNs in Maritime Ship Detection Study

A new study published on arXiv evaluates the effectiveness of Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) for maritime security applications, specifically ship detection. The research utilized a …
TOOL · CL_93205 · Jun 16 · 04:00

New Vision Transformer Cuts Image Captioning Costs with Clustering

Researchers have developed a new vision transformer architecture that significantly reduces computational costs for image captioning. By replacing the standard self-attention mechanism with a Gaussian Mixture Model-base…
RESEARCH · CL_90993 · Jun 12 · 17:48

New HumP-KD framework efficiently distills fire classification models

Researchers have developed HumP-KD, a novel framework for efficient fire classification using knowledge distillation. This method distills knowledge from larger transformer models like Swin-Tiny and ViT-Base into a smal…
TOOL · CL_85018 · Jun 11 · 04:00

New Vision Transformer enhances spacecraft pose estimation

Researchers have developed a new Vision Transformer model, PAID-ViT, designed to improve the accuracy of 6D pose estimation for spacecraft. This model is particularly effective in challenging conditions like varying ill…

New AI method uses topology for improved flood detection in satellite imagery

REViT imbues Vision Transformers with rotation equivariance without position encoding

New method offers tighter generalization bounds for neural networks

Superhuman AI agent dominates Generals.io using self-play RL

New framework uses hierarchical RL for neural network compression

AI framework uses synthetic mammograms for label-efficient BAC segmentation

FrequencyFormer pipeline boosts vision transformer efficiency for edge devices

New LEAP curriculum boosts Vision Transformer distillation efficiency

New XAI dataset and method enhance species distribution model interpretability

AI model tunes quantum dots for Majorana modes

AI system automates scoring of student science drawings with confidence awareness

New GPVAE framework enhances endoscopic video restoration

New LSTM-ViT Architecture Improves Weather Forecast Error Prediction

ANEForge enables direct Python programming of Apple Neural Engine

Bag of Dims: Training-Free Transformer Interpretability Method Unveiled

Deep learning models for lung cancer diagnosis show high accuracy but differing reasoning

Vision Transformer Outperforms CNNs in Maritime Ship Detection Study

New Vision Transformer Cuts Image Captioning Costs with Clustering

New HumP-KD framework efficiently distills fire classification models

New Vision Transformer enhances spacecraft pose estimation