实体 State space models: Univariate representation of a multivariate model, partial interpolation and periodic convergence

State space models: Univariate representation of a multivariate model, partial interpolation and periodic convergence

PulseAugur coverage of State space models: Univariate representation of a multivariate model, partial interpolation and periodic convergence — every cluster mentioning State space models: Univariate representation of a multivariate model, partial interpolation and periodic convergence across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

90 天内 14

发布 · 30天

90 天内 0

论文 · 30天

90 天内 14

层级分布 · 90 天

关系

instance of State Space Models 90%
affiliated with linear attention 70%
instance of linear attention 70%

情绪 · 30 天

6 天有情绪数据

最近 · 第 1/1 页 · 共 14 条

TOOL · CL_44923 · May 22 · 04:00

新的内存分页技术提高了混合式大语言模型推理效率

研究人员开发了一种名为非对称虚拟内存分页（AVMP）的新内存管理技术，以提高混合式语言模型的效率。这些模型结合了Transformer层和状态空间模型（SSM），导致存在当前系统处理不佳的独特内存缓存类型。AVMP将这些缓存类型分离到不同的池中，并在需要时允许它们之间的容量迁移，从而减少内存不足事件并显著提高请求吞吐量。
RESEARCH · CL_42474 · May 20 · 15:36

Deformba method enhances State Space Models for vision tasks

Researchers have introduced Deformba, a novel context-adaptive method designed to enhance the application of State Space Models (SSMs) to vision tasks. Deformba addresses limitations in existing vision SSMs by dynamical…
TOOL · CL_36599 · May 15 · 15:18

Looped SSMs improve time series classification with depth-recurrence

Researchers have introduced Looped SSMs, a novel approach to State Space Models for time series classification. This method enhances performance by applying depth-recurrence, where model blocks are reused across layers,…
TOOL · CL_30805 · May 13 · 17:56

Quantum memory approach enhances long-sequence token modeling

Researchers have developed QLAM, a novel hybrid quantum-classical memory mechanism designed to enhance long-sequence token modeling. QLAM represents the hidden state as a quantum state, leveraging superposition to encod…
RESEARCH · CL_34499 · May 11 · 20:03

新的注意力方法应对大语言模型长上下文挑战

研究人员正在开发新的注意力机制来处理大型语言模型中日益增长的长上下文。一种方法，Runtime-Certified Bounded-Error Quantized Attention，使用分层 KV 缓存来压缩内存，同时保证回退到精确注意力，确保语言建模和检索等任务的质量。另一种方法，DashAttention，采用可微分稀疏分层注意力来适应性地选择相关 token，以与全注意力相当的准确性实现高稀疏度，并提供优于现有分层方法的性能。…
TOOL · CL_25583 · May 8 · 13:59

循环模型因误差动力学而在状态跟踪方面失败

研究人员引入了一种关于循环神经网络架构中状态跟踪的新视角，强调误差控制动力学而非理论表达能力。他们证明了仿射循环网络（包括状态空间模型和线性注意力）由于无法在状态分离子空间上纠正误差，因此在鲁棒状态跟踪方面存在困难。这种限制导致了由累积误差决定的有限视界解决方案，并且随着可区分性比率跨越临界阈值，跟踪精度会可预测地下降。
RESEARCH · CL_20526 · May 6 · 16:01

New paper proves AI models face 'Impossibility Triangle' trade-off

Researchers have identified a fundamental trade-off in long-context models, proving that no single architecture can simultaneously achieve efficiency, compactness, and recall. The study formalizes this "Impossibility Tr…
TOOL · CL_18843 · May 6 · 04:00

New method aligns State Space Model inductive bias for better data efficiency

Researchers have developed a new framework to align the inductive bias of State Space Models (SSMs) for improved data efficiency. This method, called Task-Dependent Initialization (TDI), matches the model's initial bias…
TOOL · CL_15589 · May 5 · 04:00

SSMProbe framework reveals importance of token order in visual representations

Researchers have developed SSMProbe, a new framework for analyzing visual representations in AI models. This method utilizes State Space Models (SSMs) to account for the critical role of token order, challenging the tra…
RESEARCH · CL_14356 · May 4 · 04:00

New AI models tackle image and video restoration with advanced techniques

Researchers have developed several new methods for image and video restoration tasks. One approach, Continuous Expert Assembly (CEA), uses a dynamic parameterization framework to adapt to diverse local degradation patte…
RESEARCH · CL_09762 · Apr 29 · 09:17

PKS4 scanners offer efficient video understanding with 10x lower training compute

Researchers have introduced PKS$^4$, a novel approach to efficient video understanding that addresses the computational challenges of long video sequences. This method integrates a plug-and-play module with linear-compl…
RESEARCH · CL_05127 · Apr 27 · 04:00

StateX framework boosts RNN recall by expanding model states post-training

Researchers have developed StateX, a post-training framework designed to improve the recall capabilities of recurrent neural networks (RNNs). This method efficiently expands the states of pre-trained RNNs, such as linea…
RESEARCH · CL_01131 · Apr 22 · 00:00

Apple researchers unveil parallel RNN training and enhanced SSMs at ICLR 2026

Apple researchers are presenting new work at ICLR 2026, focusing on advancements in recurrent neural networks (RNNs) and state space models (SSMs). Their paper "ParaRNN" introduces a parallelized training framework that…
RESEARCH · CL_04837 · Mar 28 · 01:24

Mamba model offers Transformer-level performance with faster inference and longer context

Mamba, a new State Space Model (SSM), presents an alternative to the dominant Transformer architecture in AI. It aims to match Transformer performance and scaling laws while efficiently handling extremely long sequences…

新的内存分页技术提高了混合式大语言模型推理效率

Deformba method enhances State Space Models for vision tasks

Looped SSMs improve time series classification with depth-recurrence

Quantum memory approach enhances long-sequence token modeling

新的注意力方法应对大语言模型长上下文挑战

循环模型因误差动力学而在状态跟踪方面失败

New paper proves AI models face 'Impossibility Triangle' trade-off

New method aligns State Space Model inductive bias for better data efficiency

SSMProbe framework reveals importance of token order in visual representations

New AI models tackle image and video restoration with advanced techniques

PKS4 scanners offer efficient video understanding with 10x lower training compute

StateX framework boosts RNN recall by expanding model states post-training

Apple researchers unveil parallel RNN training and enhanced SSMs at ICLR 2026

Mamba model offers Transformer-level performance with faster inference and longer context