PulseAugur
EN
LIVE 08:46:22

LLMs tested for Turkish scam detection using new audio-transcript dataset

Researchers have explored the effectiveness of large language models (LLMs) in detecting phone call scams in Turkish, a low-resource language. They introduced a new dataset of 100 aligned audio-transcript pairs of scam and benign conversations. The study evaluated seven LLMs, including Gemini 2.5 variants, GPT-4o, and Qwen models, using raw audio, automatic transcripts, and human-corrected transcripts. Results indicated that transcript-based inputs were more effective than direct audio processing, with human-corrected and uncorrected transcripts performing similarly. AI

IMPACT Highlights the need for more inclusive AI safety research and multi-modal systems for fraud prevention in low-resource languages.

RANK_REASON The cluster contains an academic paper detailing research on LLM capabilities for a specific task.

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

LLMs tested for Turkish scam detection using new audio-transcript dataset

COVERAGE [2]

  1. arXiv cs.AI TIER_1 English(EN) · Arda Eren, Micheal Cheung, Youqian Zhang, Grace Ngai, Eugene Yujun Fu ·

    Poster: Exploring the Limits of Audio-Based Detection of Turkish Phone Call Scams

    arXiv:2606.24523v1 Announce Type: cross Abstract: Scam phone calls exploit vulnerable communities worldwide, yet research on detection has focused almost exclusively on English and other high-resource languages. In low-resource settings such as Turkish, detection is especially di…

  2. arXiv cs.AI TIER_1 English(EN) · Eugene Yujun Fu ·

    Poster: Exploring the Limits of Audio-Based Detection of Turkish Phone Call Scams

    Scam phone calls exploit vulnerable communities worldwide, yet research on detection has focused almost exclusively on English and other high-resource languages. In low-resource settings such as Turkish, detection is especially difficult, as annotated data is scarce and technolog…