English(EN) Is it possible to run a 14.5M tinyBERT LLM on esp32? Yes, you can but you need to get one of the best esp32 boards, namely ESP32-S3 N16R8 that has 16M Flash mem

TinyBERT LLM 成功在 ESP32 微控制器上运行

作者 PulseAugur 编辑部 · [1 个来源] · 2026-07-02 12:10

可以在 ESP32 微控制器上运行一个拥有 1450 万参数的 tinyBERT LLM，特别是 ESP32-S3 N16R8 型号，它配备了 16MB 闪存和 8MB PSRAM。该过程涉及将模型的参数矩阵转换为 ONNX 格式，然后将其从 32 位浮点量化为 4 位整数。这种优化是必要的，因为 Transformer 模型通常使用 32 位浮点数进行矩阵乘法。 AI

影响使得在资源受限的边缘设备上部署小型 LLM 成为可能，从而可能扩展嵌入式系统中的 AI 功能。

排序理由该项目描述了在特定硬件上运行 LLM 的技术实现，属于工具或基础设施类别，而不是核心 AI 发布或研究。

在 Mastodon — fosstodon.org 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-07-02 12:10

Is it possible to run a 14.5M tinyBERT LLM on esp32? Yes, you can but you need to get one of the best esp32 boards, namely ESP32-S3 N16R8 that has 16M Flash mem

Is it possible to run a 14.5M tinyBERT LLM on esp32? Yes, you can but you need to get one of the best esp32 boards, namely ESP32-S3 N16R8 that has 16M Flash memory and 8M PSRAM in it. The first step is to turn tinyBERT's parameter matrices into ONNX format. Then quantize it to 4 …

报道来源 [1]

Is it possible to run a 14.5M tinyBERT LLM on esp32? Yes, you can but you need to get one of the best esp32 boards, namely ESP32-S3 N16R8 that has 16M Flash mem

相关实体

相关话题