PulseAugur
EN
LIVE 10:04:36

New framework rectifies noisy cross-modal data using graph reasoning

Researchers have developed a new framework called Intra-modal Neighbor-aware Noise Rectification (IN2R) to improve the accuracy of cross-modal retrieval by addressing noise in large web-harvested datasets. Unlike previous methods that filter or replace noisy labels, IN2R synthesizes a reliable supervision target by leveraging the geometric stability of intra-modal data. The framework uses a Graph Refiner and a Cross-Model Memory to reason over neighbors and create a continuous, soft prototype that reflects local semantic consensus, thereby rectifying inter-modal misalignment. Experiments on benchmark datasets like Flickr30K and MS-COCO show that IN2R significantly outperforms existing state-of-the-art methods. AI

IMPACT Improves data quality for cross-modal AI tasks, potentially enhancing generalization in retrieval models.

RANK_REASON Academic paper detailing a new method for improving cross-modal retrieval. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CV TIER_1 English(EN) · Yang Liu, Wentao Feng, Shu-Dong Huang, Yalan Ye, Jiancheng Lv ·

    Intra-Modal Neighbors Never Lie: Rectifying Inter-Modal Noisy Correspondence via Graph-Based Intra-Modal Reasoning

    arXiv:2606.04061v1 Announce Type: new Abstract: Large-scale web-harvested datasets have fueled the progress of cross-modal retrieval but inevitably suffer from noisy correspondence, which severely degrades model generalization. Existing methods primarily address this by filtering…