Cosine similarity of label representations offers no insight into softmax classifier probabilities

By PulseAugur Editorial · [1 sources] · 2026-06-02 04:00

Researchers have demonstrated that the cosine similarity of label representations, often used to gauge model similarity, provides no information about the probabilities assigned by a softmax classifier. The study proves that for any given unembeddings, it's possible to construct a model that assigns identical probabilities to all inputs while altering the cosine similarity to either 1 or -1. While cosine similarity can define label combinations for sigmoid classifiers, it requires all pairwise cosine similarities between unembedding differences for softmax classifiers to predict rankings, indicating that interpreting cosine similarity without context of the classifier is misleading. AI

IMPACT Clarifies limitations of a common metric for evaluating model representations, potentially guiding future research in model interpretability.

RANK_REASON The cluster contains an academic paper detailing novel research findings. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

Beatrix Miranda Ginn Nielsen

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Beatrix M. G. Nielsen, Andreas Grivas · 2026-06-02 04:00

What Cosine Similarity of Label Representations Can and Cannot Tell us

arXiv:2603.29488v2 Announce Type: replace Abstract: Cosine similarity is often used to measure the similarity of vector representations of neural network models. However, the cosine similarity of representations is not guaranteed to tell us anything about model probabilities. In …

COVERAGE [1]

What Cosine Similarity of Label Representations Can and Cannot Tell us

RELATED TOPICS