PulseAugur
EN
LIVE 14:44:02

Jetson Orin Nano benchmarks 8 tiny LLMs across power modes

A benchmark of eight small language models (135M to ~1B parameters) was conducted on a Jetson Orin Nano Super 8GB device. The tests explored four power modes (7W, 15W, 25W, MAXN) using the llama.cpp CUDA backend. The findings indicate that the 25W power mode offers the best balance of performance and efficiency for all tested models, outperforming both the 15W and MAXN modes in terms of tokens generated per joule. AI

IMPACT Identifies optimal power efficiency for running small LLMs on edge devices, guiding hardware and software configurations.

RANK_REASON Benchmark of multiple small LLMs on specific hardware. [lever_c_demoted from research: ic=1 ai=0.7]

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Jetson Orin Nano benchmarks 8 tiny LLMs across power modes

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 Deutsch(DE) · /u/East-Muffin-6472 ·

    Tiny LLM Benchmark: Jetson Orin Nano Super 8GB - Four Power Modes × Eight Models

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tuq5j1/tiny_llm_benchmark_jetson_orin_nano_super_8gb/"> <img alt="Tiny LLM Benchmark: Jetson Orin Nano Super 8GB - Four Power Modes × Eight Models" src="https://preview.redd.it/xy1e7dxe8v4h1.png?width=140&amp…