PulseAugur
EN
LIVE 11:44:24

ITNet architecture unifies convolution, attention, and recurrence

Researchers have introduced ITNet, a novel neural network architecture that unifies convolution, attention, and recurrence into a single learnable integral transform. This architecture uses a learnable kernel, implemented as an MLP, to model pairwise interactions, allowing it to adapt its behavior from data. ITNet can recover the functionalities of various existing architectures, including LSTMs, GRUs, S4, Mamba, and self-attention, by adjusting its parameters. The model has demonstrated competitive or superior performance across multiple benchmarks such as ImageNet-1K, GLUE, ModelNet40, VQA v2, and NLVR2. AI

IMPACT Unifies disparate neural network architectures, potentially simplifying model design and improving performance across various tasks.

RANK_REASON The item is an academic paper introducing a new neural network architecture. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

ITNet architecture unifies convolution, attention, and recurrence

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Ashim Dhor, Rasel Mondal, Pin Yu Chen ·

    ITNet: A Learnable Integral Transform That Subsumes Convolution, Attention, and Recurrence

    arXiv:2606.19538v1 Announce Type: new Abstract: Convolutional networks, recurrent networks, and transformers each encode different inductive biases -- locality, sequential memory, and content-dependent pairwise interaction -- and have remained mathematically distinct since their …