Mamba
PulseAugur coverage of Mamba — every cluster mentioning Mamba across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
Phasor Memory Networks tackle gradient instability in explicit memory models
Researchers have introduced Phasor Memory Networks (PMNet), a novel architecture designed to overcome the gradient instability issues that have historically plagued explicit memory models. By employing Unitary Phasor Dy…
-
STAR framework boosts few-shot action recognition with LLM-guided temporal learning
Researchers have developed a new framework called STAR (Semantic-Temporal Adaptive Representation Learning) to improve few-shot action recognition in videos. This approach addresses issues of semantic-temporal misalignm…
-
Mamba-based neural decoder offers scalable solution for error-correcting codes
Researchers have developed a new neural decoder called MMPD, which utilizes Mamba state-space blocks to efficiently process long error-correcting codes. This attention-free approach significantly reduces memory and comp…
-
New Mamba-based network improves EEG decoding for stroke patients
Researchers have developed CFSPMNet, a novel framework designed to improve the decoding of motor imagery electroencephalography (MI-EEG) signals for stroke patients. This new model addresses the challenge of cross-patie…
-
NVIDIA Star Elastic embeds multiple reasoning models in one checkpoint
NVIDIA researchers have introduced Star Elastic, a novel post-training method that embeds multiple reasoning models of varying parameter sizes within a single checkpoint. This approach allows for the extraction of small…
-
VIMCAN network fuses Mamba and attention for real-time 3D human pose estimation
Researchers have developed VIMCAN, a novel hybrid network for visual-inertial 3D human pose estimation. This architecture integrates Mamba's efficient sequence modeling with Cross-Attention's spatial reasoning capabilit…
-
New research links neural network OOD generalization to feature engineering
Researchers have identified that deep neural networks often fail to learn representations that generalize to out-of-distribution (OOD) data because they cannot decouple feature learning from data-generating process iden…
-
GEM model generates LiDAR world models for autonomous driving
Researchers have developed GEM, a generative LiDAR world model designed to simulate environmental dynamics for autonomous driving. The model utilizes a deformable Mamba architecture to overcome challenges with disordere…
-
Wisteria model unifies multi-scale feature learning for DNA language analysis
Researchers have introduced Wisteria, a novel framework designed to enhance DNA language models by integrating multi-scale feature learning. This model combines gated dilated convolutions and gated multilayer perceptron…
-
HaM-World model enhances AI planning with selective memory and Hamiltonian dynamics
Researchers have introduced HaM-World, a novel structured world model designed to improve the stability and accuracy of planning in reinforcement learning. This model decomposes latent states into canonical (q, p) and c…
-
SSMamba model enhances pathological image classification with hybrid self-supervised learning
Researchers have developed SSMamba, a novel self-supervised hybrid state space model designed for pathological image classification. This framework addresses limitations in current models, such as domain shift across ma…
-
New GDS-Mamba model enhances tree species classification with graph and sparse tokens
Researchers have developed a new model called GDS-Mamba to improve the classification of tree species using MODIS satellite time series data. This model addresses challenges like subtle species differences and the coupl…
-
Simpler fusion modules outperform complex transformers for pasture biomass regression
A new research paper introduces the principle of "fusion complexity inversion," demonstrating that simpler cross-view fusion modules can outperform more complex ones like attention transformers and SSMs for pasture biom…
-
MambaBack architecture enhances whole slide image analysis with hybrid AI approach
Researchers have introduced MambaBack, a novel hybrid architecture designed to improve whole slide image (WSI) analysis in computational pathology. This new model combines the strengths of Mamba and MambaOut to better c…
-
Researchers develop SAMIC, a lightweight Mamba-based model for efficient perceptual image compression
Researchers have developed SAMIC, a novel method for efficient perceptual image compression that utilizes Mamba, a state space model known for its long-range modeling capabilities and linear complexity. Unlike tradition…
-
StateSMix compressor uses Mamba SSMs and n-grams for online lossless compression
Researchers have developed StateSMix, a novel lossless compression algorithm that utilizes Mamba-style State Space Models (SSMs) combined with sparse n-gram context mixing. This system trains token-by-token on the data …
-
Mantis framework offers efficient Mamba-native tuning for 3D point cloud models
Researchers have introduced Mantis, a novel framework for parameter-efficient fine-tuning (PEFT) specifically designed for Mamba-based 3D point cloud foundation models. Existing PEFT methods struggle with Mamba's state-…
-
HiFi-Mamba model enhances MRI reconstruction with dual-stream architecture
Researchers have developed HiFi-Mamba, a novel dual-stream Mamba-based architecture designed to improve the fidelity of MRI image reconstruction. This new model addresses limitations in existing Mamba variants by enhanc…
-
SAMamba3D adapts Segment Anything for generalizable 3D pore-scale image segmentation
Researchers have developed SAMamba3D, a new framework designed to improve the generalizability of 3D image segmentation for multiphase pore-scale rock images. This method adapts the existing Segment Anything Model (SAM)…
-
StereoMamba architecture enhances real-time stereo disparity estimation for robotic surgery
Researchers have developed StereoMamba, a novel architecture for real-time stereo disparity estimation in robot-assisted surgery. This system utilizes a Feature Extraction Mamba module to capture long-range spatial depe…