PulseAugur
实时 10:21:15
实体 State space models: Univariate representation of a multivariate model, partial interpolation and periodic convergence

State space models: Univariate representation of a multivariate model, partial interpolation and periodic convergence

PulseAugur coverage of State space models: Univariate representation of a multivariate model, partial interpolation and periodic convergence — every cluster mentioning State space models: Univariate representation of a multivariate model, partial interpolation and periodic convergence across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
14
90 天内 14
发布 · 30天
0
90 天内 0
论文 · 30天
14
90 天内 14
层级分布 · 90 天
关系
情绪 · 30 天

6 天有情绪数据

最近 · 第 1/1 页 · 共 14 条
  1. TOOL · CL_44923 ·

    新的内存分页技术提高了混合式大语言模型推理效率

    研究人员开发了一种名为非对称虚拟内存分页(AVMP)的新内存管理技术,以提高混合式语言模型的效率。这些模型结合了Transformer层和状态空间模型(SSM),导致存在当前系统处理不佳的独特内存缓存类型。AVMP将这些缓存类型分离到不同的池中,并在需要时允许它们之间的容量迁移,从而减少内存不足事件并显著提高请求吞吐量。

  2. RESEARCH · CL_42474 ·

    Deformba method enhances State Space Models for vision tasks

    Researchers have introduced Deformba, a novel context-adaptive method designed to enhance the application of State Space Models (SSMs) to vision tasks. Deformba addresses limitations in existing vision SSMs by dynamical…

  3. TOOL · CL_36599 ·

    Looped SSMs improve time series classification with depth-recurrence

    Researchers have introduced Looped SSMs, a novel approach to State Space Models for time series classification. This method enhances performance by applying depth-recurrence, where model blocks are reused across layers,…

  4. TOOL · CL_30805 ·

    Quantum memory approach enhances long-sequence token modeling

    Researchers have developed QLAM, a novel hybrid quantum-classical memory mechanism designed to enhance long-sequence token modeling. QLAM represents the hidden state as a quantum state, leveraging superposition to encod…

  5. RESEARCH · CL_34499 ·

    新的注意力方法应对大语言模型长上下文挑战

    研究人员正在开发新的注意力机制来处理大型语言模型中日益增长的长上下文。一种方法,Runtime-Certified Bounded-Error Quantized Attention,使用分层 KV 缓存来压缩内存,同时保证回退到精确注意力,确保语言建模和检索等任务的质量。另一种方法,DashAttention,采用可微分稀疏分层注意力来适应性地选择相关 token,以与全注意力相当的准确性实现高稀疏度,并提供优于现有分层方法的性能。…

  6. TOOL · CL_25583 ·

    循环模型因误差动力学而在状态跟踪方面失败

    研究人员引入了一种关于循环神经网络架构中状态跟踪的新视角,强调误差控制动力学而非理论表达能力。他们证明了仿射循环网络(包括状态空间模型和线性注意力)由于无法在状态分离子空间上纠正误差,因此在鲁棒状态跟踪方面存在困难。这种限制导致了由累积误差决定的有限视界解决方案,并且随着可区分性比率跨越临界阈值,跟踪精度会可预测地下降。

  7. RESEARCH · CL_20526 ·

    New paper proves AI models face 'Impossibility Triangle' trade-off

    Researchers have identified a fundamental trade-off in long-context models, proving that no single architecture can simultaneously achieve efficiency, compactness, and recall. The study formalizes this "Impossibility Tr…

  8. TOOL · CL_18843 ·

    New method aligns State Space Model inductive bias for better data efficiency

    Researchers have developed a new framework to align the inductive bias of State Space Models (SSMs) for improved data efficiency. This method, called Task-Dependent Initialization (TDI), matches the model's initial bias…

  9. TOOL · CL_15589 ·

    SSMProbe framework reveals importance of token order in visual representations

    Researchers have developed SSMProbe, a new framework for analyzing visual representations in AI models. This method utilizes State Space Models (SSMs) to account for the critical role of token order, challenging the tra…

  10. RESEARCH · CL_14356 ·

    New AI models tackle image and video restoration with advanced techniques

    Researchers have developed several new methods for image and video restoration tasks. One approach, Continuous Expert Assembly (CEA), uses a dynamic parameterization framework to adapt to diverse local degradation patte…

  11. RESEARCH · CL_09762 ·

    PKS4 scanners offer efficient video understanding with 10x lower training compute

    Researchers have introduced PKS$^4$, a novel approach to efficient video understanding that addresses the computational challenges of long video sequences. This method integrates a plug-and-play module with linear-compl…

  12. RESEARCH · CL_05127 ·

    StateX framework boosts RNN recall by expanding model states post-training

    Researchers have developed StateX, a post-training framework designed to improve the recall capabilities of recurrent neural networks (RNNs). This method efficiently expands the states of pre-trained RNNs, such as linea…

  13. RESEARCH · CL_01131 ·

    Apple researchers unveil parallel RNN training and enhanced SSMs at ICLR 2026

    Apple researchers are presenting new work at ICLR 2026, focusing on advancements in recurrent neural networks (RNNs) and state space models (SSMs). Their paper "ParaRNN" introduces a parallelized training framework that…

  14. RESEARCH · CL_04837 ·

    Mamba model offers Transformer-level performance with faster inference and longer context

    Mamba, a new State Space Model (SSM), presents an alternative to the dominant Transformer architecture in AI. It aims to match Transformer performance and scaling laws while efficiently handling extremely long sequences…