English(EN) RL fine-tuning is now live for @nvidiaai Nemotron 3 on Fireworks, starting with Nemotron 3 Super (LoRA). Train with GRPO and serve the model in one place.

Fireworks AI 为 NVIDIA Nemotron 3 模型启用 RL 微调

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-25 23:54

Fireworks AI 推出了新功能，支持对 NVIDIA 的 Nemotron 3 模型进行强化学习 (RL) 微调，首批支持 Nemotron 3 Super，并使用 LoRA 和 GRPO 方法。这个集成平台允许用户在同一地点训练和部署模型，定价基于 GPU 小时使用量而非 token 数量，以管理长时间交互的成本。 AI

影响此次集成简化了特定 NVIDIA 模型的微调过程，可能降低了开发人员定制和部署这些模型的门槛。

排序理由这是现有模型的新功能发布，而非前沿模型发布。

在 X — Fireworks (inference infra) 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

Fireworks AI 为 NVIDIA Nemotron 3 模型启用 RL 微调

报道来源 [1]

X — Fireworks (inference infra) TIER_1 English(EN) · FireworksAI_HQ · 2026-06-25 23:54

Fireworks现已支持对@nvidiaai Nemotron 3进行RL微调，首发Nemotron 3 Super (LoRA)。在同一地点训练GRPO并部署模型。

RL fine-tuning is now live for @nvidiaai Nemotron 3 on Fireworks, starting with Nemotron 3 Super (LoRA). Train with GRPO and serve the model in one place. We price by GPU-hour, not per token, so long multi-turn rollouts don't blow up your bill. Training shapes →

报道来源 [1]

Fireworks现已支持对@nvidiaai Nemotron 3进行RL微调，首发Nemotron 3 Super (LoRA)。在同一地点训练GRPO并部署模型。

相关实体

相关话题