Researchers have introduced the Transformer Geometry Observatory (TGO), a framework designed to explore the representational geometry of Vision Transformers (ViTs). The initial installment, TGO-I, specifically examines the spectral geometry of ViT representations. Experiments using a ViT-Small/16 model trained on ImageNet-100 revealed that as training progresses, dimensional utilization increases, while anisotropy decreases. Contrary to expectations, information is redistributed across representational dimensions rather than concentrating into a few dominant directions, with the CLS token representation showing the highest effective dimensionality. AI
IMPACT Provides new insights into the internal workings of Vision Transformers, potentially guiding future model development and optimization.
RANK_REASON The cluster contains an academic paper detailing a new framework and experimental analysis for understanding AI models.
Read on Hugging Face Daily Papers →
- arXiv
- CLS token
- ImageNet-100
- TGO-I
- Transformer Geometry Observatory
- Vision Transformers
- ViTs
- ViT-Small/16
AI-generated summary · Google Gemini · from 3 sources. How we write summaries →