Any benefit to a multi-machine setup?
A user on the r/LocalLLaMA subreddit is inquiring about the potential benefits of a multi-machine setup for running local large language models. They have three machines with varying GPU and RAM configurations, including a high-end NVIDIA 5090, a 5070ti, and an Apple M3 Max MacBook Pro. The user is specifically asking if there have been any recent advancements that would allow these machines to work together for larger context windows or faster inference speeds. AI