Brief

last 24h

[2/2] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · Mastodon — mastodon.social Italiano(IT) · 15h

🧠 A 1 trillion parameter LLM is back in business thanks to old Optane memories: innovation also comes from intelligent hardware reuse. # AI # Te

A large language model with one trillion parameters has been successfully re-enabled using Intel Optane memory. This innovative approach leverages older hardware to run complex AI models, demonstrating the potential for intelligent reuse of existing technology. The project highlights how advancements in AI can be supported by creative solutions in hardware utilization. AI

IMPACT Demonstrates novel hardware utilization for running large AI models, potentially lowering costs and increasing accessibility.
- LLM
- Intel Optane
TOOL · Tom's Hardware English(EN) · 2d · [3 sources]

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

A Redditor has successfully run a 1-trillion-parameter LLM, specifically Kimi K2.5, locally on a single GPU workstation by utilizing 768GB of second-hand Intel Optane Persistent Memory modules as RAM. This setup achieved approximately 4 tokens per second, a performance deemed impressive given the hardware's budget constraints. The use of discontinued Optane DIMMs highlights a potential market gap for affordable, high-capacity memory solutions for large language model inference, especially as DRAM prices fluctuate. AI

IMPACT Demonstrates a cost-effective method for running large LLMs locally, potentially influencing future hardware configurations for AI inference.

Brief

🧠 A 1 trillion parameter LLM is back in business thanks to old Optane memories: innovation also comes from intelligent hardware reuse. # AI # Te

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second