PulseAugur
EN
LIVE 06:51:34

New research suggests attention values better capture LLM sentence semantics

A new research paper proposes that attention values, rather than hidden states, are more effective for capturing sentence semantics in Large Language Models (LLMs). The paper introduces Value Aggregation (VA), a method that pools token values across layers and indices, outperforming existing LLM-based embeddings in a training-free setting. A refined technique, Aligned Weighted VA (AlignedWVA), further enhances performance by interpreting layer attention outputs as aligned weighted value vectors, achieving state-of-the-art results. AI

IMPACT Proposes a new method for generating more semantically rich sentence embeddings from LLMs, potentially improving downstream NLP applications.

RANK_REASON Academic paper detailing a novel method for LLM embeddings. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Yeqin Zhang, Yunfei Wang, Jiaxuan Chen, Ke Qin, Yizheng Zhao, Cam-Tu Nguyen ·

    LLM-based Embeddings: Attention Values Encode Sentence Semantics Better Than Hidden States

    arXiv:2602.01572v2 Announce Type: replace Abstract: Sentence representations are foundational to many Natural Language Processing (NLP) applications. While recent methods leverage Large Language Models (LLMs) to derive sentence representations, most rely on final-layer hidden sta…