New dataset and fine-tuned Qwen2.5 model boost classical Chinese poetry analysis

By PulseAugur Editorial · [1 sources] · 2026-06-10 17:54

Researchers have developed a new dataset, CCPoetry-49K, containing over 49,000 instruction-response pairs specifically for classical Chinese poetry analysis. They then fine-tuned the Qwen2.5-14B model using Low-Rank Adaptation (LoRA) to create PoetryQwen, a domain-specialized LLM. This new model achieved a score of 0.757 on the CCL25-Eval Task 5 benchmark, outperforming the baseline Qwen2.5-14B-Instruct by 9.7% and demonstrating significant improvements in precise translation and emotional understanding of classical poetry. AI

IMPACT Enhances LLM capabilities for nuanced tasks like classical poetry translation and emotional inference, potentially paving the way for more specialized AI applications.

RANK_REASON The cluster describes a new dataset and a fine-tuned model presented in a system report on arXiv, detailing performance improvements on a specific benchmark. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Haotao Xie · 2026-06-10 17:54

System Report for CCL25-Eval Task 5: New Dataset and LoRA-Fine-Tuned Qwen2.5

Recently, large language models (LLMs) have achieved promising progress in the fields of classical Chinese translation and the generation of classical poetry. However, domain-specific research on precise translation and affective-semantic understanding of classical poetry remains…

COVERAGE [1]

System Report for CCL25-Eval Task 5: New Dataset and LoRA-Fine-Tuned Qwen2.5

RELATED ENTITIES

RELATED TOPICS