English(EN) FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

FlexServe 提升了移动设备上 LLM 的推理速度和安全性

作者 PulseAugur 编辑部 · [2 个来源] · 2026-07-03 04:00

研究人员开发了 FlexServe，一个旨在提高移动设备上大型语言模型 (LLM) 推理速度和安全性的新型系统。通过利用 ARM TrustZone 技术，FlexServe 为内存 (Flex-Mem) 和神经网络处理单元 (Flex-NPU) 引入了灵活的资源隔离，允许在受保护和非受保护模式之间高效切换。与现有方法相比，这种方法显著降低了 TrustZone 通常带来的开销，在首次令牌生成时间 (TTFT) 和多模型工作流的端到端性能方面实现了大幅提速。 AI

影响该系统可以在用户设备上直接实现更强大、更私密的 LLM 应用，减少对云基础设施的依赖。

排序理由该集群描述了一篇关于移动设备 LLM 服务系统的新研究论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Yinpeng Wu, Yitong Chen, Lixiang Wang, Jinyu Gu, Zhichao Hua, Yubin Xia · 2026-07-03 04:00

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

arXiv:2603.09046v3 Announce Type: replace-cross Abstract: Device-side Large Language Models (LLMs) have witnessed explosive growth, offering higher privacy and availability compared to cloud-side LLMs. During LLM inference, both model weights and user data are valuable, and attac…
arXiv cs.LG TIER_1 English(EN) · Yinpeng Wu, Yitong Chen, Lixiang Wang, Jinyu Gu, Zhichao Hua, Yubin Xia · 2026-07-03 04:00

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

arXiv:2606.23370v2 Announce Type: replace-cross Abstract: Device-side Large Language Models (LLMs) have grown explosively, offering stronger privacy and higher availability than their cloud-side counterparts. During LLM inference, both the model weights and the user data are valu…

报道来源 [2]

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

相关实体

相关话题