New method allows ComfyUI and local LLMs to share a single GPU

By PulseAugur Editorial · [1 sources] · 2026-06-20 14:49

A new method has been developed to allow ComfyUI and a local large language model (LLM) to share a single GPU without encountering out-of-memory (OOM) errors. This solution involves a node that unloads ComfyUI's models and clears the cache when the LLM needs to use the GPU, and vice-versa. The goal is to enable smoother resource allocation between the two applications, preventing them from competing for VRAM and causing crashes. AI

IMPACT Enables more efficient use of hardware for running multiple AI models simultaneously.

RANK_REASON This is a user-developed tool to improve resource management for existing AI applications.

Read on r/StableDiffusion →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New method allows ComfyUI and local LLMs to share a single GPU

COVERAGE [1]

r/StableDiffusion TIER_2 English(EN) · /u/Bramha_dev · 2026-06-20 14:49

Got comfyui and a local llm to share one gpu without OOMing every time

<table> <tr><td> <a href="https://www.reddit.com/r/StableDiffusion/comments/1uaygx9/got_comfyui_and_a_local_llm_to_share_one_gpu/"> <img alt="Got comfyui and a local llm to share one gpu without OOMing every time" src="https://preview.redd.it/hrkcylvmag8h1.jpeg?width=640&crop…

COVERAGE [1]

Got comfyui and a local llm to share one gpu without OOMing every time

RELATED ENTITIES

RELATED TOPICS