PulseAugur
实时 13:54:29
实体 llama-cpp-turboquant

llama-cpp-turboquant

PulseAugur coverage of llama-cpp-turboquant — every cluster mentioning llama-cpp-turboquant across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
1
90 天内 1
发布 · 30天
0
90 天内 0
论文 · 30天
0
90 天内 0
层级分布 · 90 天
最近 · 第 1/1 页 · 共 1 条
  1. RESEARCH · CL_03569 ·

    Quantized Qwen3.6-27B model achieves 100k context on 16GB VRAM

    A user on Reddit's r/LocalLLaMA has detailed a method for running the Qwen3.6-27B model on a system with 16GB of VRAM, achieving a context length of 100,000 tokens. The process involves creating a custom GGUF quantizati…