Tempus framework offers scalable, resource-efficient GEMM for edge AI

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-01 09:28

Researchers have developed Tempus, a new framework designed to optimize General Matrix Multiplication (GEMM) for edge AI deployments on AMD Versal SoCs. Unlike existing spatial scaling methods that fail on resource-constrained devices, Tempus uses a fixed compute block and temporal scaling through iterative execution and data tiling. This approach achieves significant performance gains, delivering 607 GOPS at 10.677W while demonstrating superior resource and power frugality compared to prior state-of-the-art methods. AI

影响 Enables more efficient LLM inference on resource-constrained edge devices by optimizing core matrix multiplication operations.

排序理由 Academic paper detailing a new framework for optimizing AI inference on edge hardware.

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · M. Grailoo, J. N\'u\~nez-Y\'a\~nez · 2026-05-04 04:00

Tempus: A Temporally Scalable Resource-Invariant GEMM Streaming Framework for Versal AI Edge

arXiv:2605.00536v1 Announce Type: cross Abstract: Scaling laws for Large Language Models (LLMs) establish that model quality improves with computational scale, yet edge deployment imposes strict constraints on compute, memory, and power. Since General Matrix Multiplication (GEMM)…
arXiv cs.LG TIER_1 English(EN) · J. Núñez-Yáñez · 2026-05-01 09:28

Tempus: A Temporally Scalable Resource-Invariant GEMM Streaming Framework for Versal AI Edge

Scaling laws for Large Language Models (LLMs) establish that model quality improves with computational scale, yet edge deployment imposes strict constraints on compute, memory, and power. Since General Matrix Multiplication (GEMM) accounts for up to 90\% of inference time, effici…

报道来源 [2]

Tempus: A Temporally Scalable Resource-Invariant GEMM Streaming Framework for Versal AI Edge

Tempus: A Temporally Scalable Resource-Invariant GEMM Streaming Framework for Versal AI Edge

相关实体

相关话题