ENTITY ModernBERT

ModernBERT

PulseAugur coverage of ModernBERT — every cluster mentioning ModernBERT across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

15 over 90d

Releases · 30d

0 over 90d

Papers · 30d

12 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

6 day(s) with sentiment data

RECENT · PAGE 1/1 · 15 TOTAL

RESEARCH · CL_109527 · Jun 24 · 13:00

Encoder classifiers offer cost-effective LLM safety evaluation, study finds

A new research paper explores the effectiveness of encoder classifiers, specifically from the ModernBERT family, as a cost-efficient alternative to LLM-based judges for evaluating the safety of large language model outp…
TOOL · CL_105175 · Jun 21 · 23:44

moBERTo: New Portuguese Language Model Enhances NLP Tasks

Researchers have introduced moBERTo, a new Portuguese language model derived from ModernBERT through continued pretraining. This model was trained on 60 billion tokens, incorporating data from FineWeb2 and filtered STEM…
RESEARCH · CL_97776 · Jun 17 · 08:27

New technique improves SPLADE retrieval models with larger encoders

Researchers have identified a performance degradation issue when using larger, more powerful pretrained encoders with SPLADE, a neural sparse retrieval model. This problem, termed a "scale mismatch" in the MLM head, can…
RESEARCH · CL_98103 · Jun 17 · 00:00

New LOCUS corpus unlocks U.S. local ordinances for AI research · 2 sources tracked

Researchers have developed LOCUS, a comprehensive corpus of U.S. local ordinances, aiming to make this critical layer of American law accessible for large-scale research and AI applications. The corpus includes codes fr…
TOOL · CL_93606 · Jun 16 · 04:00

HyDRA framework dynamically routes LLM queries, cutting costs and improving efficiency

Researchers have developed HyDRA, a novel framework for dynamically routing queries to heterogeneous pools of large language models. Unlike previous methods that make binary strong-vs-weak decisions or require retrainin…
RESEARCH · CL_65596 · Jun 1 · 08:42

New clinical NLP models boost German and Norwegian medical text analysis

Researchers have developed new domain-specific language models for clinical NLP in German and Norwegian. The German ChristBERT models, based on RoBERTa, were trained on a 13.5GB corpus and outperform existing models on …
TOOL · CL_59786 · May 29 · 12:24

New Tool Offers Context-Aware Japanese Furigana

A new tool called ezfurigana.com has been developed to provide context-aware Japanese furigana. This tool utilizes the Sudachi tokenizer and the ModernBERT model to accurately display phonetic readings for Japanese text.
TOOL · CL_49804 · May 25 · 12:58

Character-trained AI models fail to maintain personas in agentic tasks

Researchers found that models fine-tuned for specific personas in a chat format struggle to maintain those personas when used in agentic settings. When these character-trained models were prompted to generate emails as …
RESEARCH · CL_50832 · May 22 · 23:49

Synthetic LLM data boosts patent classification, but volume is key

A new research paper investigates the effectiveness of synthetic data generated by large language models for low-resource multi-label patent classification. The study found that while synthetic data can significantly bo…
RESEARCH · CL_41749 · May 20 · 06:14

New methods tackle AI hallucinations in research and medical Q&A

Two new research papers address the critical issue of AI hallucinations in different domains. One paper introduces ACL-Verbatim, an extractive question-answering system designed to provide hallucination-free answers fro…
RESEARCH · CL_65906 · May 20 · 00:00

New methods tackle LLM hallucinations with graph-based and extractive approaches

Researchers are developing new methods to combat hallucinations in large language models, particularly in complex question-answering tasks. One approach involves using graph-based retrieval-augmented generation (RAG) sy…
RESEARCH · CL_32075 · May 14 · 18:55

Hugging Face releases open multilingual embedding models with 32K context

Hugging Face has released Granite Embedding Multilingual R2, a suite of open-source multilingual embedding models. The release includes a 97M-parameter compact model that leads in retrieval quality among open models und…
TOOL · CL_29410 · May 12 · 17:16

AI predicts human rater disagreement in LLM-generated difficulty scores

Researchers have developed a new method to predict when AI-generated difficulty ratings for educational materials might disagree with human assessments. This approach uses a separate embedding space, like ModernBERT, to…
RESEARCH · CL_11214 · Apr 29 · 17:37

IBM Granite releases two multilingual embedding models built on ModernBERT

IBM's Granite division has released two new multilingual embedding models, one with 97 million parameters and another with 311 million. These models are based on ModernBERT architecture and support over 200 languages, w…
RESEARCH · CL_06460 · Apr 28 · 04:00

AI models struggle with emotion nuance, researchers explore new evaluation and generation methods

Researchers are exploring the nuances of emotion in AI, with several papers focusing on Large Language Models (LLMs) and speech processing. One study investigates how well small language models preserve emotions during …

Encoder classifiers offer cost-effective LLM safety evaluation, study finds

moBERTo: New Portuguese Language Model Enhances NLP Tasks

New technique improves SPLADE retrieval models with larger encoders

New LOCUS corpus unlocks U.S. local ordinances for AI research · 2 sources tracked

HyDRA framework dynamically routes LLM queries, cutting costs and improving efficiency

New clinical NLP models boost German and Norwegian medical text analysis

New Tool Offers Context-Aware Japanese Furigana

Character-trained AI models fail to maintain personas in agentic tasks

Synthetic LLM data boosts patent classification, but volume is key

New methods tackle AI hallucinations in research and medical Q&A

New methods tackle LLM hallucinations with graph-based and extractive approaches

Hugging Face releases open multilingual embedding models with 32K context

AI predicts human rater disagreement in LLM-generated difficulty scores

IBM Granite releases two multilingual embedding models built on ModernBERT

AI models struggle with emotion nuance, researchers explore new evaluation and generation methods