PulseAugur
实时 12:28:59

Self-supervised vision models impact semantic image retrieval performance

A new paper analyzes how self-supervised learning (SSL) methods for vision impact semantic image retrieval systems. The research found that the geometric properties of the learned representations, specifically their isotropy and purity, significantly affect the performance of approximate nearest neighbor (ANN) indexing. Highly anisotropic and skewed representations can degrade search performance, even if they show high accuracy in other tasks. AI

影响 Highlights how latent space geometry in SSL vision models affects ANN indexing for image retrieval.

排序理由 Academic paper analyzing a specific aspect of self-supervised learning for image retrieval.

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

Self-supervised vision models impact semantic image retrieval performance

报道来源 [2]

  1. arXiv cs.CV TIER_1 English(EN) · Esteban Rodr\'iguez-Betancourt, Edgar Casasola-Murillo ·

    Geometric Analysis of Self-Supervised Vision Representations for Semantic Image Retrieval

    arXiv:2604.24469v1 Announce Type: cross Abstract: Content-based image retrieval (CBIR) systems enable users to search images based on visual content instead of relying on metadata. The text domain has benefited from vector search of representations created with unsupervised metho…

  2. arXiv cs.CV TIER_1 English(EN) · Edgar Casasola-Murillo ·

    Geometric Analysis of Self-Supervised Vision Representations for Semantic Image Retrieval

    Content-based image retrieval (CBIR) systems enable users to search images based on visual content instead of relying on metadata. The text domain has benefited from vector search of representations created with unsupervised methods such as BERT. However, modern self-supervised l…