PulseAugur
实时 14:06:20
English(EN) Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods

新的WATERec模型利用大型合成数据集推进艺术文本识别

研究人员开发了一种新方法WATERec,以改进艺术文本(称为WordArt)的识别。由于其复杂的字体和布局,WordArt比标准的场景文本识别更具挑战性。为解决此问题,他们创建了一个大型合成数据集WATER-S和一个新颖的模型架构,该架构使用视觉编码器处理任意形状的输入,并使用自回归解码器。该方法在WordArt-Bench上达到了90.40%的准确率,优于现有的通用和OCR专用视觉语言模型。 AI

影响 这项研究可能带来更强大的OCR系统,能够处理多样化和风格化的文本,从而改进文档分析和图像理解等应用。

排序理由 该集群描述了一篇详细介绍特定计算机视觉任务的新颖方法和数据集的学术论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

新的WATERec模型利用大型合成数据集推进艺术文本识别

报道来源 [4]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods

    WordArt (artistic text) features highly customized fonts, textures, and layouts, making WordArt-oriented scene TExt Recognition (WATER) substantially more challenging than general Scene Text Recognition (STR). Existing STR datasets and methods, typically built around regular scen…

  2. Hugging Face Daily Papers TIER_1 English(EN) ·

    Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods

    A large-scale synthetic dataset and specialized model architecture are introduced to address the challenges of artistic text recognition by improving data diversity and model flexibility for irregular text layouts.

  3. arXiv cs.CV TIER_1 English(EN) · Xingsong Ye, Yongkun Du, Jiaxin Zhang, Haojie Zhang, Chong Sun, Chen Li, Jing Lyu, Zhineng Chen ·

    Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods

    arXiv:2606.24484v1 Announce Type: new Abstract: WordArt (artistic text) features highly customized fonts, textures, and layouts, making WordArt-oriented scene TExt Recognition (WATER) substantially more challenging than general Scene Text Recognition (STR). Existing STR datasets …

  4. arXiv cs.CV TIER_1 English(EN) · Zhineng Chen ·

    Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods

    WordArt (artistic text) features highly customized fonts, textures, and layouts, making WordArt-oriented scene TExt Recognition (WATER) substantially more challenging than general Scene Text Recognition (STR). Existing STR datasets and methods, typically built around regular scen…