Researchers have developed CapStARE, a novel capsule-based architecture for gaze estimation. This system utilizes a frozen ConvNeXt backbone for efficient feature extraction and capsule formation with attention-based routing for structured facial reasoning. It employs dual GRU decoders for lightweight sequential modeling, achieving real-time inference speeds and strong performance on benchmark datasets like ETH-XGaze and MPIIFaceGaze. AI
IMPACT This new architecture offers a practical and robust framework for real-time gaze estimation, potentially improving human-computer interaction and robotics applications.
RANK_REASON The cluster contains a new academic paper detailing a novel architecture for gaze estimation. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →