English(EN) PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution

PRISM通过新颖的校正和精炼技术增强文本图像超分辨率

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-13 05:31

研究人员开发了PRISM，一种新颖的基于扩散模型的文本图像超分辨率框架，可在严重降级的情况下提高可读性。该系统采用流匹配先验校正（FMPR）从不可靠的低质量输入中创建更准确的全局文本引导。此外，结构引导的不确定性感知残差编码器（SURE）通过选择性地整合可靠线索和抑制模糊线索来精炼局部笔画边界。PRISM实现了最先进的性能和快速的推理时间。 AI

影响引入了一种提高超分辨率图像中文本可读性的新方法，可能有利于OCR和文档分析应用。

排序理由该集群包含一篇详细介绍文本图像超分辨率新方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Xiaokang Yang · 2026-05-13 05:31

PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution

Text image super-resolution (Text-SR) requires more than visually plausible detail synthesis: slight errors in stroke topology may alter character identity and break readability. Existing methods improve text fidelity with stronger recognition-based or generative priors, yet they…

报道来源 [1]

PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution

相关实体

相关话题