PulseAugur
实时 19:51:31
实体 DeepSeek-R1

DeepSeek-R1

PulseAugur coverage of DeepSeek-R1 — every cluster mentioning DeepSeek-R1 across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
40
90 天内 40
发布 · 30天
0
90 天内 0
论文 · 30天
19
90 天内 19
层级分布 · 90 天
关系
时间线
  1. 2026-05-23 product_launch DeepSeek released the DeepSeek-R1 model, an open-source alternative to OpenAI's o1. 来源
  2. 2026-05-10 product_launch A developer launched DeepThink, a local-first macOS workspace application.
情绪 · 30 天

7 天有情绪数据

最近 · 第 1/2 页 · 共 40 条
  1. SIGNIFICANT · CL_49808 ·

    Meta releases Llama 4 with Mixture of Experts architecture

    Meta has released Llama 4 in April 2025, featuring a new Mixture of Experts (MoE) architecture. Two variants, Scout and Maverick, are available, with Scout serving as a balanced default and Maverick offering broader kno…

  2. TOOL · CL_46178 ·

    Alibaba's Qwen models offer versatile local AI with long context

    Alibaba Cloud's Qwen models are highlighted as versatile open-source options in mid-2026, offering a range of sizes from 0.5B to 72B parameters. Qwen 3.6 and 2.5 boast impressive features like a 262K context window, str…

  3. SIGNIFICANT · CL_46134 ·

    DeepSeek releases open-source R1 model rivaling OpenAI's o1

    DeepSeek has released DeepSeek-R1, an open-source model designed to rival OpenAI's o1 in reasoning capabilities. Available under the MIT license, this model can be run locally on a single GPU, offering enhanced privacy …

  4. TOOL · CL_45245 ·

    New 8B LLM Zaya1-8B signals major design shift

    A new 8-billion parameter local LLM, Zaya1-8B, is being hailed as a significant design shift in the field. Its architecture appears to represent a major departure from previous small reasoning models, potentially markin…

  5. TOOL · CL_44609 ·

    Guide: Run GPT-4 class LLMs locally on your own hardware for free

    This guide details how to run advanced large language models locally on personal hardware in 2026, bypassing expensive API costs. It emphasizes that VRAM is the primary hardware bottleneck, not raw compute power, and su…

  6. TOOL · CL_44357 ·

    Anyscale launches skill to automate LLM post-training runs

    Anyscale has introduced a new Anyscale Agent Skill designed to simplify and automate the process of generating LLM post-training runs. This skill assists users in selecting the most appropriate post-training method, suc…

  7. TOOL · CL_43602 ·

    AWS platform automates AI model evaluation for media summaries

    A media company developed a serverless platform on AWS to automate the evaluation of AI-generated podcast summaries. The system sends articles to multiple foundation models simultaneously via AWS Bedrock, then uses a se…

  8. TOOL · CL_44826 ·

    New T3S method boosts LLM distillation efficiency

    Researchers have developed a new method called Training-Trajectory-Aware Token Selection (T3S) to improve the efficiency of distilling knowledge from large language models. This technique addresses a common issue where …

  9. TOOL · CL_42828 ·

    Local LLM Setup Guides Detail llama.cpp Installation and Optimization

    This series of guides provides comprehensive instructions for setting up and running large language models (LLMs) locally on Linux systems. It details hardware and software prerequisites, recommends using llama.cpp for …

  10. TOOL · CL_40853 ·

    LLM clinical accuracy varies significantly by prompting language, study finds

    A new study published on arXiv reveals that the language used to prompt large language models significantly impacts their diagnostic reasoning and accuracy in clinical settings. Researchers found that four out of five e…

  11. TOOL · CL_37611 ·

    LLM benchmark shows routing strategy outperforms single model selection

    A recent benchmark tested 15 LLMs on 38 real-world coding tasks, revealing that a routing strategy combining different models is more effective than selecting a single top-tier model. The study found that cheaper models…

  12. TOOL · CL_34707 ·

    AI HAT+ 2 enables local LLMs on Raspberry Pi 5 with Hailo-10H

    The new AI HAT+ 2, featuring the Hailo-10H chip, offers 40 TOPS of processing power for the Raspberry Pi 5. This allows for local execution of large language models like Llama 3.2 and DeepSeek R1. The HAT+ 2 includes 8G…

  13. TOOL · CL_28283 ·

    AI reasoning studies flawed by focus on final answer, not computation

    A new research paper identifies a significant flaw in chain-of-thought (CoT) corruption studies, which are used to evaluate the faithfulness of AI reasoning. The study found that these evaluations often mistakenly ident…

  14. TOOL · CL_27991 ·

    New dataset trains AI in radiology clinical reasoning

    Researchers have introduced RadThinking, a new dataset designed to train AI systems in longitudinal clinical reasoning for radiology. The dataset includes visual question-answering pairs across three difficulty levels, …

  15. SIGNIFICANT · CL_23645 ·

    DeepSeek releases open-source coding model matching GPT-4o

    DeepSeek has released V3-0324, an open-source coding model that matches or surpasses leading models like GPT-4o and Claude 3.5 Sonnet in coding performance. This Mixture-of-Experts model, with 671 billion total paramete…

  16. TOOL · CL_23203 ·

    Ollama VRAM Guide: 8GB for 7B models, 16GB for 13B, 24GB+ for 34B

    This guide details Ollama's VRAM requirements for running various large language models in 2026. It explains that Ollama automatically quantizes models to fit available VRAM, but insufficient memory leads to slow CPU of…

  17. TOOL · CL_20915 ·

    Zyphra's ZAYA1-8B model matches top AI benchmarks with under 1B parameters

    Zyphra has released ZAYA1-8B, an open-source model that achieves performance comparable to DeepSeek-R1 on math benchmarks. The model also demonstrates competitive reasoning capabilities against Claude Sonnet 4.5 and app…

  18. COMMENTARY · CL_20705 ·

    AI models: Choose benchmarks over hype for true performance

    A recent analysis highlights that tech companies often select AI models based on hype rather than performance on relevant benchmarks. The article emphasizes that benchmarks like SWE-bench for coding, Terminal-Bench for …

  19. RESEARCH · CL_16203 ·

    Researchers distill DeepSeek-R1 reasoning into compact models for code clone detection

    Researchers have developed a knowledge distillation framework to improve the reliability and practicality of compact open-source models for cross-language code clone detection. This method transfers reasoning capabiliti…

  20. RESEARCH · CL_11885 ·

    LLMs generate privacy-safe synthetic clinical reports for data augmentation

    Researchers have developed a new evaluation framework to assess the quality of synthetic clinical data generated by Large Language Models (LLMs). The framework measures semantic fidelity, lexical diversity, and privacy …