New technique improves SPLADE retrieval models with larger encoders

By PulseAugur Editorial · [2 sources] · 2026-06-17 08:27

Researchers have identified a performance degradation issue when using larger, more powerful pretrained encoders with SPLADE, a neural sparse retrieval model. This problem, termed a "scale mismatch" in the MLM head, can lead to unstable training or even collapse. The researchers propose a simple, zero-cost adjustment that rescales the MLM head's projection before training, which significantly improves stability and retrieval effectiveness for models like ModernBERT and Ettin. This correction allows these larger backbones to match or surpass the performance of the classic BERT-SPLADE baseline. AI

IMPACT This research offers a method to enhance the performance of neural sparse retrieval systems by better calibrating larger pretrained models, potentially leading to more effective information retrieval.

RANK_REASON The cluster contains a research paper detailing a novel technique for improving neural sparse retrieval models.

Read on arXiv cs.IR (Information Retrieval) →

paper
infra

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New technique improves SPLADE retrieval models with larger encoders

COVERAGE [2]

arXiv cs.AI TIER_1 English(EN) · Youngjoon Jang, Seongtae Hong, Jonah Turner, Heuiseok Lim · 2026-06-18 04:00

Rescaling MLM-Head for Neural Sparse Retrieval

arXiv:2606.18811v1 Announce Type: cross Abstract: Learned sparse retrieval (LSR) models such as SPLADE have traditionally used BERT-style masked language models as backbone encoders. A natural expectation is that replacing BERT with stronger pretrained encoders should improve ret…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Heuiseok Lim · 2026-06-17 08:27

Rescaling MLM-Head for Neural Sparse Retrieval

Learned sparse retrieval (LSR) models such as SPLADE have traditionally used BERT-style masked language models as backbone encoders. A natural expectation is that replacing BERT with stronger pretrained encoders should improve retrieval effectiveness. However, we find that under …

COVERAGE [2]

Rescaling MLM-Head for Neural Sparse Retrieval

Rescaling MLM-Head for Neural Sparse Retrieval

RELATED ENTITIES

RELATED TOPICS