PulseAugur
EN
LIVE 11:30:23

New dataset AutoSpecNER targets vehicle specification extraction

Researchers have introduced AutoSpecNER, a new dataset designed for fine-grained named entity recognition in vehicle advertisements. The dataset comprises 659 advertisements with over 10,000 entities annotated across 15 categories, including model and battery capacity, achieving a 91.5% inter-annotator agreement score. Benchmarking various methods, the DeBERTa model demonstrated the highest performance with a 90% micro-F1 score, significantly outperforming rule-based systems and other large language models. AI

IMPACT This dataset could improve the accuracy of information extraction from automotive listings, benefiting platforms and consumers.

RANK_REASON The cluster describes a new academic dataset and benchmark results for a specific NLP task.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New dataset AutoSpecNER targets vehicle specification extraction

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Jordan Lee, Filippos Ventirozos, Abdirahman Abdullahm, Ioanna Nteka, Peter Appleby, Matthew Shardlow ·

    AutoSpecNER: A Fine-Grained Named Entity Recognition Dataset for Vehicle Specification Extraction

    arXiv:2606.24387v1 Announce Type: new Abstract: Vehicle advertisements contain rich specification information, but automotive NER resources remain limited. We introduce AutoSpecNER, an expert-annotated dataset for fine-grained entity recognition in vehicle listings. The dataset i…

  2. arXiv cs.CL TIER_1 English(EN) · Matthew Shardlow ·

    AutoSpecNER: A Fine-Grained Named Entity Recognition Dataset for Vehicle Specification Extraction

    Vehicle advertisements contain rich specification information, but automotive NER resources remain limited. We introduce AutoSpecNER, an expert-annotated dataset for fine-grained entity recognition in vehicle listings. The dataset includes 659 advertisements from a popular car-se…