A new Python package called ANEForge allows developers to directly program the Apple Neural Engine (ANE) without relying on CoreML. This bypass enables more efficient use of the ANE, which is the dedicated neural accelerator in Apple devices. ANEForge compiles tensor graphs into ANE programs, supporting operations like fused attention, various weight formats, and even training steps directly on the engine. This allows for significantly faster execution of models, such as a ResNet-18 forward pass completing in 0.33ms. AI
RANK_REASON Research paper detailing a new software tool for hardware acceleration. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →