A new study published on arXiv investigates the impact of Vision Transformer (ViT) architectures on demographic bias in face presentation attack detection (PAD) systems. The research compares ViTs against convolutional neural networks (CNNs) using the CASIA-SURF Cross-Ethnicity Face Anti-Spoofing (CeFA) dataset. Results indicate that ViT models, specifically the DeiT-S architecture, achieve higher accuracy and significantly reduce performance disparities across different ethnic groups compared to CNN baselines. AI
IMPACT Vision Transformer architectures may offer a path toward more equitable and robust face anti-spoofing systems, reducing bias against darker skin tones.
RANK_REASON Academic paper detailing a comparative study of AI model architectures. [lever_c_demoted from research: ic=1 ai=1.0]
- CASIA-SURF Cross-Ethnicity Face Anti-Spoofing
- convolutional neural network
- DeiT-S
- Jema David Ndibwile
- ResNet18
- Vision Transformers
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →