实体
DeepSeek-V3
DeepSeek-V3
PulseAugur coverage of DeepSeek-V3 — every cluster mentioning DeepSeek-V3 across labs, papers, and developer communities, ranked by signal.
总计 · 30天
22
90 天内 22
发布 · 30天
0
90 天内 0
论文 · 30天
9
90 天内 9
层级分布 · 90 天
关系
情绪 · 30 天
8 天有情绪数据
最近 · 第 2/2 页 · 共 22 条
-
DeepSeek v3 leads open-weight models, Baseten enables mission-critical inference
DeepSeek v3, a new 671B parameter Mixture-of-Experts model, has been released and is currently the top-performing open-weights model available. Serving such large models presents significant challenges, but inference st…
-
[GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Researchers are developing new benchmarks and evaluation methods for large language models (LLMs) in mathematical reasoning and educational assessment. New datasets like ESTBook and Math-PT aim to go beyond simple accur…