FOVI: A biologically-inspired foveated interface for deep vision models
Researchers have developed FOVI, a novel interface for deep vision models inspired by human foveated vision. This system efficiently processes visual information by reformatting variable-resolution sensor data into a uniform manifold, mimicking the human retina and visual cortex. FOVI utilizes k-nearest-neighbor convolutions and can be applied to end-to-end architectures or adapted to existing models like DINOv3 using techniques such as LoRA, offering significant reductions in pixel count and computational cost while maintaining competitive performance. AI
IMPACT Enables more efficient processing of high-resolution visual data for AI systems, potentially reducing computational costs.