English(EN) Finally pioneering beyond the local 256k context window frontier!

本地LLM上下文窗口突破341k token

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-27 12:05

r/LocalLLaMA subreddit上的一位用户已成功将本地大型语言模型的上下文窗口限制推至256k token以上。该用户手动将自动压缩设置为341.5k token，目前正通过优化内存驱逐来进一步提高上限。这项进展归功于Apple、DeepSeek和oMLX的贡献。 AI

影响展示了本地运行的LLM拥有显著更大上下文窗口的潜力。

排序理由用户驱动的研究正在突破现有模型能力的界限。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/LocalLLaMA TIER_1 English(EN) · /u/challis88ocarina · 2026-05-27 12:05

Finally pioneering beyond the local 256k context window frontier!

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tp3k64/finally_pioneering_beyond_the_local_256k_context/"> <img alt="Finally pioneering beyond the local 256k context window frontier!" src="https://preview.redd.it/if09zsde6o3h1.png?width=640&crop=smart&…