Qwen3-VL model refined for semiconductor defect detection

By PulseAugur Editorial · [1 sources] · 2026-06-09 04:00

Researchers have developed a two-stage vision-language model to improve the accuracy of detecting defects in semiconductor lithography images. The first stage uses a fine-tuned Qwen3-VL model to identify defect counts, categories, and locations. A second stage then refines these initial predictions by learning from the first stage's errors, thereby enhancing overall defect inference. AI

IMPACT Introduces a novel two-stage refinement approach for vision-language models, potentially improving accuracy in specialized industrial applications like defect detection.

RANK_REASON Academic paper detailing a new model architecture and training methodology. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Pangyun Jeong, Jiyeong Kong, Yuehua Hu, Dohee Jeong, Kyung-Tae Kang · 2026-06-09 04:00

Failure-Aware Refinement of Vision-Language Model for Lithography Defect Detection

arXiv:2606.08908v1 Announce Type: cross Abstract: Semiconductor lithography inspection requires reliable detection of small pattern defects such as bridge, burr, pinch, and contamination. In this study, we propose a two-stage vision-language framework that combines initial defect…

COVERAGE [1]

Failure-Aware Refinement of Vision-Language Model for Lithography Defect Detection

RELATED ENTITIES

RELATED TOPICS