ENTITY AlpacaEval

AlpacaEval

PulseAugur coverage of AlpacaEval — every cluster mentioning AlpacaEval across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

5 over 90d

Releases · 30d

0 over 90d

Papers · 30d

5 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL

RESEARCH · CL_93583 · Jun 15 · 10:30

New DoubtProbe defense significantly reduces LLM jailbreaks

Researchers have developed DoubtProbe, a novel defense mechanism designed to counter jailbreaking attempts on large language models (LLMs) in black-box scenarios. This dual-branch framework combines structural verificat…
RESEARCH · CL_62284 · May 29 · 10:49

EvoDefense uses LLMs to co-evolve defenses against black-box attacks

Researchers have developed EvoDefense, a novel approach to protect large language models (LLMs) from attacks in black-box scenarios. This system uses a guard LLM and an experience memory to continuously refine defense s…
RESEARCH · CL_10517 · Apr 30 · 10:24

IBM's new 8B Granite 4.1 model outperforms older 32B MoE version

IBM has released Granite 4.1, a family of open-source language models designed for enterprise use, featuring three sizes (3B, 8B, and 30B parameters). Notably, the 8B dense model demonstrates performance matching or exc…
RESEARCH · CL_06752 · Apr 28 · 04:00

Researchers develop new methods to debias and improve reward models for LLMs

Researchers have developed new methods to improve the reliability and interpretability of reward models (RMs) used in aligning large language models (LLMs). One approach introduces a causally motivated intervention tech…
RESEARCH · CL_44017 · Apr 17 · 00:00

New DPO methods enhance LLM alignment with adaptive techniques

Researchers have developed several advancements to Direct Preference Optimization (DPO), a method for aligning large language models (LLMs) with human preferences. AdaDPO introduces self-adaptive coefficients to balance…

New DoubtProbe defense significantly reduces LLM jailbreaks

EvoDefense uses LLMs to co-evolve defenses against black-box attacks

IBM's new 8B Granite 4.1 model outperforms older 32B MoE version

Researchers develop new methods to debias and improve reward models for LLMs

New DPO methods enhance LLM alignment with adaptive techniques