English(EN) Does anyone have enough compute to make a distillation dataset out of GLM5.2?

社区寻求计算能力以创建 GLM5.2 蒸馏数据集来训练更小的模型

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-18 07:31

一位 r/LocalLLaMA 子版块的用户正在寻求拥有大量计算资源的人士的帮助，以便从 GLM5.2 创建一个大型蒸馏数据集。目标是生成 70 万到 100 万个示例的数据集，以支持 Qwen3.5 等更小模型的正确训练并提高其性能。此举被视为对 AI 社区的宝贵贡献。 AI

影响通过利用更大的模型来支持更小、更易于访问的模型的训练。

排序理由用户请求计算资源以从现有模型创建数据集。

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/LocalLLaMA TIER_1 English(EN) · /u/Hot_Example_4456 · 2026-06-18 07:31

Does anyone have enough compute to make a distillation dataset out of GLM5.2?

<div class="md"><p>Same as title. Some lucky ppl among us have massive amounts of compute and can run even GLM 5.2. Can someone plss make a BIG distillation dataset (eg 700k-1M examples) so that we can train smaller models like Qwen3.5 properly on it and have bette…