English(EN) RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference

RTPrune 通过新颖的令牌剪枝技术将 DeepSeek-OCR 推理速度提升 1.23 倍

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-01 04:30

研究人员开发了 RTPrune，一种新颖的两阶段令牌剪枝方法，旨在提高 DeepSeek-OCR 推理的效率。该方法模仿了模型两次阅读的过程，首先优先处理高范数令牌以获取显著信息，然后使用最优传输理论合并剩余令牌。RTPrune 还包含针对 OCR 任务定制的动态剪枝比例，实现了准确性和效率之间的卓越平衡。 AI

影响提高了 OCR 任务的推理速度和效率，可能降低处理长文档的计算成本。

排序理由这是一篇研究论文，详细介绍了一种优化现有 OCR 模型推理效率的新方法。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 English(EN) · Ben Wan, Yan Feng, Zihan Tang, Weizhe Huang, Yuting Zeng, Jia Wang, Tongxuan Liu · 2026-05-04 04:00

RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference

arXiv:2605.00392v1 Announce Type: new Abstract: DeepSeek-OCR leverages visual-text compression to reduce long-text processing costs and accelerate inference, yet visual tokens remain prone to redundant textual and structural information. Moreover, current token pruning methods fo…
arXiv cs.CV TIER_1 English(EN) · Tongxuan Liu · 2026-05-01 04:30

RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference

DeepSeek-OCR leverages visual-text compression to reduce long-text processing costs and accelerate inference, yet visual tokens remain prone to redundant textual and structural information. Moreover, current token pruning methods for conventional vision-language models (VLMs) fai…

报道来源 [2]

RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference

RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference

相关实体

相关话题