LLMs show potential for asylum decision credibility assessment

By PulseAugur Editorial · [1 sources] · 2026-05-13 12:07

Researchers have explored the use of large language models (LLMs) for annotating credibility assessments in Danish asylum decisions, a novel legal NLP task. They introduced the RAB-Cred dataset, featuring expert annotations and metadata, to evaluate 21 open-weight models and various prompt combinations in zero-shot and few-shot settings. The study found that while LLMs show potential for cost-effective labeling, their annotations are imperfect and inconsistent, necessitating careful consideration beyond single model predictions. AI

IMPACT Demonstrates LLM utility in specialized legal domains, but highlights the need for careful validation of their outputs.

RANK_REASON Academic paper detailing a novel dataset and LLM evaluation for a specific NLP task. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Thomas B. Moeslund · 2026-05-13 12:07

LLMs as annotators of credibility assessment in Danish asylum decisions: evaluating classification performance and errors beyond aggregated metrics

Off-the-shelf large language models (LLMs) are increasingly used to automate text annotation, yet their effectiveness remains underexplored for underrepresented languages and specialized domains where the class definition requires subtle expert understanding. We investigate LLM-b…

COVERAGE [1]

LLMs as annotators of credibility assessment in Danish asylum decisions: evaluating classification performance and errors beyond aggregated metrics

RELATED ENTITIES

RELATED TOPICS