New AI models enhance image and video super-resolution with diffusion and efficient architectures

By PulseAugur Editorial · [14 sources] · 2026-04-22 07:39

Researchers are developing new methods for image and video super-resolution using advanced AI techniques. Several papers explore diffusion models for joint spatiotemporal super-resolution, enabling adaptation across different spatial and temporal scales. Other work focuses on efficient single-image super-resolution through quantization and teacher-guided training, as well as multi-frame super-resolution for specialized image sensors. Additionally, generative priors and ensemble methods are being leveraged to enhance detail recovery and bridge the gap between restoration and generation in real-world super-resolution tasks. AI

IMPACT Advances in AI-driven super-resolution techniques promise enhanced detail and efficiency for image and video processing applications.

RANK_REASON Multiple arXiv papers detailing novel research methodologies and frameworks for image and video super-resolution.

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 14 sources. How we write summaries →

New AI models enhance image and video super-resolution with diffusion and efficient architectures

COVERAGE [14]

arXiv cs.LG TIER_1 English(EN) · Tom Beucler · 2026-04-23 17:49

A Scale-Adaptive Framework for Joint Spatiotemporal Super-Resolution with Diffusion Models

Deep-learning video super-resolution has progressed rapidly, but climate applications typically super-resolve (increase resolution) either space or time, and joint spatiotemporal models are often designed for a single pair of super-resolution (SR) factors (upscaling spatial and t…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-22 07:39

Efficient INT8 Single-Image Super-Resolution via Deployment-Aware Quantization and Teacher-Guided Training

Efficient single-image super-resolution (SISR) requires balancing reconstruction fidelity, model compactness, and robustness under low-bit deployment, which is especially challenging for x3 SR. We present a deployment-oriented quantized SISR framework based on an extract-refine-u…
arXiv cs.CV TIER_1 English(EN) · Jinpei Guo, Yifei Ji, Shengwei Wang, Zheng Chen, Yufei Wang, Sizhuo Ma, Yong Guo, Baiang Li, Jusheng Zhang, Yulun Zhang, Jian Wang · 2026-04-30 04:00

Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution

arXiv:2509.23980v2 Announce Type: replace Abstract: Diffusion models have recently shown promising results for video super-resolution (VSR). However, directly adapting generative diffusion models to VSR can result in redundancy, since low-quality videos already preserve substanti…
arXiv cs.CV TIER_1 English(EN) · Maitreya Patel, Jingtao Li, Weiming Zhuang, Yezhou Yang, Lingjuan Lv · 2026-04-29 04:00

VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations

arXiv:2604.24885v1 Announce Type: new Abstract: We introduce an efficient, resolution-agnostic autoregressive (AR) image synthesis approach that generalizes to arbitrary resolutions and aspect ratios, narrowing the gap to diffusion models at scale. At its core is VibeToken, a nov…
arXiv cs.CV TIER_1 English(EN) · Fabio D'Oronzio, Federico Putamorsi, Leonardo Zini, Marcella Cornia, Lorenzo Baraldi · 2026-04-29 04:00

GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution

arXiv:2604.25457v1 Announce Type: new Abstract: Despite recent advances, single-image super-resolution (SR) remains challenging, especially in real-world scenarios with complex degradations. Diffusion-based SR methods, particularly those built on Stable Diffusion, leverage strong…
arXiv cs.CV TIER_1 English(EN) · Lorenzo Baraldi · 2026-04-28 10:05

GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution

Despite recent advances, single-image super-resolution (SR) remains challenging, especially in real-world scenarios with complex degradations. Diffusion-based SR methods, particularly those built on Stable Diffusion, leverage strong generative priors but commonly rely on text con…
arXiv cs.CV TIER_1 English(EN) · Shyang-En Weng, Yi-Cheng Liao, Yu-Syuan Xu, Wei-Chen Chiu, Ching-Chun Huang · 2026-04-28 04:00

Bridging Restoration and Generation Manifolds in One-Step Diffusion for Real-World Super-Resolution

arXiv:2604.24136v1 Announce Type: new Abstract: Pretrained diffusion models have revolutionized real-world image super-resolution (Real-ISR) but suffer from computational bottlenecks due to iterative sampling. Recent single-step distillation accelerates inference but faces a star…
arXiv cs.CV TIER_1 English(EN) · Sangwook Baek, Vin Van Duong, Karam Park, Pilkyu Park · 2026-04-28 04:00

LatentBurst: A Fast and Efficient Multi Frame Super-Resolution for Hexadeca-Bayer Pattern CIS images

arXiv:2604.23268v1 Announce Type: new Abstract: This paper introduces a novel multi frame super-resolution network (MFSR) for burst hexadeca Bayer pattern Contact Image Sensor (CIS) images, which includes demosaicing, denoising, multi-frame fusion, and super-resolution. Designing…
arXiv cs.CV TIER_1 English(EN) · Dong Huo, Tristan Aumentado-Armstrong, Samrudhdhi B. Rangrej, Maitreya Suin, Angela Ning Ye, Zhiming Hu, Amanpreet Walia, Amirhossein Kazerouni, Konstantinos G. Derpanis, Iqbal Mohomed, Alex Levinshtein · 2026-04-28 04:00

BurstGP: Enhancing Raw Burst Image Super Resolution with Generative Priors

arXiv:2604.23508v1 Announce Type: new Abstract: Burst image super resolution (BISR) aims to construct a single high-resolution (HR) image by aggregating information from multiple low-resolution (LR) frames, relying on temporal redundancy and spatial coherence across the burst. Wh…
arXiv cs.CV TIER_1 English(EN) · Gengjia Chang, Xining Ge, Weijun Yuan, Zhan Li, Qiurong Song, Luen Zhu, Shuhong Liu · 2026-04-28 04:00

Training-Free Model Ensemble for Single-Image Super-Resolution via Strong-Branch Compensation

arXiv:2604.11564v2 Announce Type: replace Abstract: Single-image super-resolution has progressed from deep convolutional baselines to stronger Transformer and state-space architectures, yet the corresponding performance gains typically come with higher training cost, longer engin…
arXiv cs.CV TIER_1 English(EN) · Lingjuan Lv · 2026-04-27 18:08

VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations

We introduce an efficient, resolution-agnostic autoregressive (AR) image synthesis approach that generalizes to arbitrary resolutions and aspect ratios, narrowing the gap to diffusion models at scale. At its core is VibeToken, a novel resolution-agnostic 1D Transformer-based imag…
arXiv cs.CV TIER_1 English(EN) · Ching-Chun Huang · 2026-04-27 07:43

Bridging Restoration and Generation Manifolds in One-Step Diffusion for Real-World Super-Resolution

Pretrained diffusion models have revolutionized real-world image super-resolution (Real-ISR) but suffer from computational bottlenecks due to iterative sampling. Recent single-step distillation accelerates inference but faces a stark perception-distortion trade-off due to rigid t…
arXiv cs.CV TIER_1 English(EN) · Rashid Zia · 2026-04-23 16:05

Multiscale Super Resolution without Image Priors

We address the ambiguities in the super-resolution problem under translation. We demonstrate that combinations of low-resolution images at different scales can be used to make the super-resolution problem well posed. Such differences in scale can be achieved using sensors with di…
arXiv cs.CV TIER_1 English(EN) · Jie Zhou · 2026-04-23 09:09

VARestorer: One-Step VAR Distillation for Real-World Image Super-Resolution

Recent advancements in visual autoregressive models (VAR) have demonstrated their effectiveness in image generation, highlighting their potential for real-world image super-resolution (Real-ISR). However, adapting VAR for ISR presents critical challenges. The next-scale predictio…

COVERAGE [14]

RELATED ENTITIES

RELATED TOPICS