Chinese-SkillSpan dataset released for job skill extraction

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have introduced Chinese-SkillSpan, the first dataset specifically designed for extracting job-related skills from Chinese job advertisements. This dataset, aligned with the ESCO occupational skill standard, was created using a pipeline that combines LLM-powered annotation with expert review. The dataset contains over 20,000 annotated instances collected from major recruitment platforms between 2014 and 2025. Chinese-SkillSpan aims to address a gap in Chinese JobSkillNER resources and facilitate research in intelligent recruitment. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Provides a new benchmark for Chinese job skill extraction, potentially improving talent matching and recruitment platforms.

RANK_REASON Release of a new dataset for a specific NLP task (JobSkillNER) and its associated annotation methodology.

Read on arXiv cs.CL →

paper
other

COVERAGE [1]

arXiv cs.CL TIER_1 · Guojing Li, Zichuan Fu, Junyi Li, Wenxia Zhou, Xinyang Wu, Jinning Yang, Jingtong Gao, Feng Huang, Xiangyu Zhao · 2026-04-28 04:00

Chinese-SkillSpan: A Span-Level Dataset for ESCO-Aligned Competency Extraction from Chinese Job Ads

arXiv:2604.23009v1 Announce Type: new Abstract: Job Skill Named Entity Recognition (JobSkillNER) aims to automatically extract key skill information from large-scale job posting data, which is important for improving talent-market matching efficiency and supporting personalized e…

COVERAGE [1]

Chinese-SkillSpan: A Span-Level Dataset for ESCO-Aligned Competency Extraction from Chinese Job Ads

RELATED ENTITIES

RELATED TOPICS