PulseAugur
EN
LIVE 10:55:46

Survey maps 120 sign language datasets, calls for standardization

A new survey paper published on arXiv details the current landscape of sign language datasets, highlighting significant challenges in their development and utilization. The paper identifies fragmented datasets, inconsistent annotations, and limited linguistic coverage as major hurdles in advancing sign language recognition and translation technologies. To address these issues, the authors have compiled a comprehensive index of 120 resources across 35 sign languages and introduced a standardized 24-field Sign-Language Datasheet, along with a public GitHub repository to promote reproducible evaluation and inclusive technology development. AI

IMPACT This survey aims to standardize documentation and evaluation for sign language technologies, potentially accelerating the development of more inclusive and robust AI systems for the deaf and hard-of-hearing communities.

RANK_REASON The item is a survey paper published on arXiv detailing resources and benchmarks in a specific research area. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Survey maps 120 sign language datasets, calls for standardization

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Yiming Ni, Zhi-Qi Cheng, Jiayu Li, Wei Cheng ·

    Sign-Language Datasets at Scale: A Comprehensive Survey on Resources, Benchmarks, and Annotation Standards

    arXiv:2606.19352v1 Announce Type: cross Abstract: Sign languages are expressive visual languages used by Deaf and Hard-of-Hearing (DHH) communities. Despite substantial progress in sign-language recognition, translation, and production, advances remain constrained by fragmented d…