Jetson Orin Nano benchmarks 8 tiny LLMs across power modes

By PulseAugur Editorial · [1 sources] · 2026-06-02 12:56

A benchmark of eight small language models (135M to ~1B parameters) was conducted on a Jetson Orin Nano Super 8GB device. The tests explored four power modes (7W, 15W, 25W, MAXN) using the llama.cpp CUDA backend. The findings indicate that the 25W power mode offers the best balance of performance and efficiency for all tested models, outperforming both the 15W and MAXN modes in terms of tokens generated per joule. AI

IMPACT Identifies optimal power efficiency for running small LLMs on edge devices, guiding hardware and software configurations.

RANK_REASON Benchmark of multiple small LLMs on specific hardware. [lever_c_demoted from research: ic=1 ai=0.7]

Read on r/LocalLLaMA →

infra
paper

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Jetson Orin Nano benchmarks 8 tiny LLMs across power modes

COVERAGE [1]

r/LocalLLaMA TIER_1 Deutsch(DE) · /u/East-Muffin-6472 · 2026-06-02 12:56

Tiny LLM Benchmark: Jetson Orin Nano Super 8GB - Four Power Modes × Eight Models

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tuq5j1/tiny_llm_benchmark_jetson_orin_nano_super_8gb/"> <img alt="Tiny LLM Benchmark: Jetson Orin Nano Super 8GB - Four Power Modes × Eight Models" src="https://preview.redd.it/xy1e7dxe8v4h1.png?width=140&amp…

COVERAGE [1]

Tiny LLM Benchmark: Jetson Orin Nano Super 8GB - Four Power Modes × Eight Models

RELATED ENTITIES

RELATED TOPICS