HaluEval
PulseAugur coverage of HaluEval — every cluster mentioning HaluEval across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
Research finds truthfulness is inherited across LLM model families
A new research paper explores the preservation of contextual truthfulness across model lineages, finding that truth scores are strongly maintained from foundational large language models (LLMs) to their downstream varia…
-
New research reveals limits of spectral diagnostics in understanding LLM hallucinations
Researchers have developed a new diagnostic framework to understand how large language models hallucinate by analyzing their self-attention mechanisms. The proposed method, which focuses on the "transport" properties of…
-
New framework uses multiple LLMs to reduce hallucination and bias
Researchers have developed a new framework called Council Mode designed to mitigate hallucinations and biases in Large Language Models. This approach involves querying multiple diverse LLMs simultaneously and then synth…