PulseAugur
实时 09:52:09
English(EN) VietFashion: Benchmarking Sketch-Text Composed Image Retrieval for Cultural Outfits

新的VietFashion基准针对文化服饰图像检索

研究人员推出了VietFashion,这是一个专为草图-文本组合图像检索设计的新基准,特别关注像越南传统奥黛(áo dài)这样的文化服饰。该基准利用手绘草图和文本描述的组合,以检索具有文化意义的服装,解决了标准AI模型在捕捉细微差别方面的局限性。该数据集包含超过21,000张图像,旨在通过纳入细粒度的文化语义和多目标检索设置来解决设计意图的模糊性,从而挑战当前的检索方法。 AI

影响 该基准可以推动时尚等专业领域的细粒度视觉检索,可能提高AI对文化细微差别的理解。

排序理由 该集群描述了一个针对特定AI任务的新学术基准和数据集,发布在arXiv上。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.CV TIER_1 English(EN) · Hoang-Nguyen Cao, Le-Hoang Bui, Dinh-Khoi Vo, Minh-Triet Tran, Trung-Nghia Le ·

    VietFashion: Benchmarking Sketch-Text Composed Image Retrieval for Cultural Outfits

    arXiv:2606.13427v1 Announce Type: new Abstract: Cultural garments pose a unique challenge for visual retrieval systems, as their identity often depends on subtle structural and symbolic details that are poorly captured by standard AI models. We introduce VietFashion, a new benchm…

  2. arXiv cs.CV TIER_1 English(EN) · Trung-Nghia Le ·

    VietFashion: Benchmarking Sketch-Text Composed Image Retrieval for Cultural Outfits

    Cultural garments pose a unique challenge for visual retrieval systems, as their identity often depends on subtle structural and symbolic details that are poorly captured by standard AI models. We introduce VietFashion, a new benchmark for sketch-text composed image retrieval cen…