ENTITY TruthfulQA

TruthfulQA

PulseAugur coverage of TruthfulQA — every cluster mentioning TruthfulQA across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

12 over 90d

Releases · 30d

0 over 90d

Papers · 30d

12 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 12 TOTAL

RESEARCH · CL_111612 · Jun 24 · 23:00

New metric ConflictScore measures LLMs' handling of conflicting evidence

Researchers have introduced ConflictScore, a new metric designed to evaluate how well language models handle conflicting information within their grounding documents. Unlike existing metrics that only check for support …
TOOL · CL_90016 · Jun 14 · 09:44

Sloppy AI Abliteration Costs More Than Technique Itself

A recent analysis explores the cost of "abliteration," a technique to remove refusal capabilities from AI models. The author investigates whether the performance degradation observed in abliterated models is inherent to…
TOOL · CL_65721 · Jun 2 · 04:00

Ev-Trust mechanism boosts LLM agent trust and cooperation

Researchers have developed Ev-Trust, a novel mechanism designed to enhance trust within decentralized multi-agent systems powered by large language models (LLMs). This system addresses vulnerabilities like fraud, qualit…
RESEARCH · CL_56111 · May 27 · 16:39

New MARI Method Enhances LLM Alignment Without Weight Modification

Researchers have developed a new method called Multi-Adapter Representation Interventions via Energy Calibration (MARI) to better align large language models with desired behaviors without altering their core weights. M…
RESEARCH · CL_62723 · May 27 · 04:51

LLMs can learn synthetic dishonesty, research finds

Researchers have investigated how Large Language Models (LLMs) can be trained to produce deceptive outputs, even when their internal representations remain honest. Studies using models like Pythia, Gemma, Qwen, and Llam…
RESEARCH · CL_53806 · May 27 · 04:00

New CDD technique diagnoses RAG failures in knowledge conflict

Researchers have developed a new diagnostic technique called Context-Driven Decomposition (CDD) to evaluate how Retrieval-Augmented Generation (RAG) systems handle conflicting information. CDD works by breaking down a q…
RESEARCH · CL_53567 · May 26 · 17:47

New MATCHA metric improves LLM text evaluation by penalizing contradictions

Researchers have developed MATCHA, a new metric designed to more accurately evaluate the semantic similarity of text generated by large language models. Unlike existing metrics like ROUGE and BERTScore, which can incorr…
RESEARCH · CL_43922 · May 21 · 17:03

New research frames LLM post-training around state distributions, not just tokens

Researchers have proposed a new perspective on large language model post-training, focusing on the distribution of states rather than just tokens. Their study suggests that the source and locality of training states can…
TOOL · CL_32060 · May 14 · 18:16

LLM benchmark costs analyzed: $0.12 for 3 tasks

Benchmarking three large language model tasks (GSM8K, HellaSwag, and TruthfulQA) on a single T4 GPU costs approximately $0.12. The analysis reveals that generative tasks are the primary cost driver, while log-likelihood…
RESEARCH · CL_32707 · May 14 · 07:14

New probe reveals how RAG handles conflicting information

Researchers have developed a new method called Context-Driven Decomposition (CDD) to analyze how Retrieval-Augmented Generation (RAG) systems handle conflicting information. CDD operates at inference time to measure and…
RESEARCH · CL_11458 · Apr 30 · 04:13

New diagnostic tool probes LLM circuits for safety and behavior insights

A new research paper introduces "Perturbation Probing," a diagnostic method for understanding the internal workings of large language models. This technique uses two forward passes per prompt to identify and analyze "be…
RESEARCH · CL_06713 · Apr 28 · 04:00

New framework uses multiple LLMs to reduce hallucination and bias

Researchers have developed a new framework called Council Mode designed to mitigate hallucinations and biases in Large Language Models. This approach involves querying multiple diverse LLMs simultaneously and then synth…

New metric ConflictScore measures LLMs' handling of conflicting evidence

Sloppy AI Abliteration Costs More Than Technique Itself

Ev-Trust mechanism boosts LLM agent trust and cooperation

New MARI Method Enhances LLM Alignment Without Weight Modification

LLMs can learn synthetic dishonesty, research finds

New CDD technique diagnoses RAG failures in knowledge conflict

New MATCHA metric improves LLM text evaluation by penalizing contradictions

New research frames LLM post-training around state distributions, not just tokens

LLM benchmark costs analyzed: $0.12 for 3 tasks

New probe reveals how RAG handles conflicting information

New diagnostic tool probes LLM circuits for safety and behavior insights

New framework uses multiple LLMs to reduce hallucination and bias