ADE20K
PulseAugur coverage of ADE20K — every cluster mentioning ADE20K across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
Reload-Mamba enhances semantic segmentation with novel state-space modeling
Researchers have developed Reload-Mamba, a novel framework designed to enhance multi-class semantic segmentation using Mamba-based state space models. This approach tackles the issue of response dilution in sequential p…
-
New ToaSt framework boosts Vision Transformer efficiency
Researchers have developed a new framework called ToaSt designed to make Vision Transformers (ViTs) more computationally efficient. ToaSt decouples strategies for different parts of the ViT architecture, applying head-w…
-
AI accelerates image annotation with new segmentation techniques · 2 sources tracked
Researchers have developed new methods to accelerate image annotation for industrial applications. One study demonstrates that using unsupervised computer vision algorithms can reduce the time for semantic segmentation …
-
RATS! New Transformer Architecture Discovers Object Parts in Vision Models
Researchers have introduced RATS (Register Attention Transformers), a novel architecture for self-supervised visual models designed to discover compositional structure akin to human object part recognition. RATS utilize…
-
New certificate improves AI risk control and acceptance rates
Researchers have developed a new finite-sample certificate for adaptive selective conformal risk control, aiming to improve the safety and utility of selective predictors. This certificate simultaneously bounds selected…
-
New D3S2 method distills datasets for semantic segmentation
Researchers have developed D3S2, a novel framework for dataset distillation specifically designed for semantic segmentation tasks. This method addresses challenges like class imbalance and the need for precise pixel ali…
-
Vision Transformers improved with selective token interaction
Researchers have identified a phenomenon called "semantic diffusion" that degrades the performance of Vision Transformers (ViTs) in dense prediction tasks over time. This occurs when global semantic information spreads …
-
Vision models ditch activations for polynomial alternatives
Researchers have developed new activation-free backbone architectures for vision models, utilizing polynomial functions instead of traditional pointwise nonlinearities like ReLU or GELU. These novel modules, integrated …
-
New TsallisPGD attack method improves adversarial attacks on semantic segmentation models
Researchers have developed TsallisPGD, a novel adversarial attack method designed to more effectively target semantic segmentation models. This new approach utilizes Tsallis cross-entropy, a generalized form of standard…
-
AdaVFM framework uses LLMs to adapt vision models for edge devices
Researchers have developed AdaVFM, a novel framework designed to make large vision foundation models more efficient for edge devices. This system dynamically adjusts computational load based on the complexity of the sce…
-
FractalMamba++ scales vision models across resolutions using Hilbert curves
Researchers have introduced FractalMamba++, an enhanced vision backbone designed to improve the performance of Mamba-based models, particularly with high-resolution inputs. This new architecture leverages the geometric …
-
Canonical knowledge distillation proves effective for semantic segmentation
A new research paper demonstrates that standard knowledge distillation techniques are surprisingly effective for semantic segmentation tasks. The study found that when accounting for computational budget, canonical logi…
-
New DGM-Net model offers efficient semantic segmentation with geometric guidance
Researchers have developed DGM-Net, an efficient architecture for semantic segmentation that bypasses the need for large models and high computational budgets. The network utilizes a novel Directional Geometric Mamba (G…