State Space Model
PulseAugur coverage of State Space Model — every cluster mentioning State Space Model across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
New SSM adapters outperform LoRA for long-context fine-tuning
Researchers have developed a new parameter-efficient fine-tuning (PEFT) method called Hankel Reduced order Model (HRM) adapters, which utilize state space models (SSMs) for long-context fine-tuning. Unlike traditional P…
-
New research explores diffusion and state space models beyond autoregressive AI
Two new arXiv papers explore advanced modeling techniques beyond traditional autoregressive language models. The first paper surveys Diffusion Models, Code World Models, and State Space Models for code intelligence, sug…
-
Infrawise tool prevents AI-written code from causing costly cloud errors
Infrawise has developed a tool that analyzes code and infrastructure to prevent costly AI-generated errors, such as inefficient database scans. The system first scans code repositories to identify database client calls …
-
StateSMix compressor uses Mamba SSMs and n-grams for online lossless compression
Researchers have developed StateSMix, a novel lossless compression algorithm that utilizes Mamba-style State Space Models (SSMs) combined with sparse n-gram context mixing. This system trains token-by-token on the data …
-
ViM-Q enables efficient Vision Mamba model inference on FPGAs
Researchers have developed ViM-Q, a novel algorithm-hardware co-design specifically for accelerating Vision Mamba (ViM) model inference on FPGAs. This approach tackles challenges in quantizing dynamic activation outlier…
-
NAKUL-Med model enhances medical signal analysis with dynamic kernels and spectral context
Researchers have developed NAKUL-Med, a novel spectral-graph state space model designed to enhance the analysis of multi-channel medical signals. This model addresses limitations in existing state space models by incorp…
-
SSMProbe framework reveals importance of token order in visual representations
Researchers have developed SSMProbe, a new framework for analyzing visual representations in AI models. This method utilizes State Space Models (SSMs) to account for the critical role of token order, challenging the tra…
-
Caracal architecture uses Fourier transforms for efficient long-sequence modeling
Researchers have introduced Caracal, a new architecture designed to improve the scalability of large language models for processing long sequences. Caracal replaces the computationally expensive attention mechanism with…
-
L2RU introduces stable state-space models for machine learning and control
Researchers have introduced L2RU, a new class of structured state-space models (SSMs) designed to ensure input-output stability and robustness. This architecture integrates deep learning expressiveness with dynamical sy…
-
New Mamba model variant enhances memory retention and bilinear computation
Researchers have introduced Bilinear Input Modulation (BIM) to enhance Selective State Space Models (SSMs), specifically Mamba, by incorporating state-input products. This augmentation allows for improved memory retenti…
-
Mamba model offers Transformer-level performance with faster inference and longer context
Mamba, a new State Space Model (SSM), presents an alternative to the dominant Transformer architecture in AI. It aims to match Transformer performance and scaling laws while efficiently handling extremely long sequences…