PulseAugur
实时 20:47:34
实体 DeepSeek-R1

DeepSeek-R1

PulseAugur coverage of DeepSeek-R1 — every cluster mentioning DeepSeek-R1 across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
40
90 天内 40
发布 · 30天
0
90 天内 0
论文 · 30天
19
90 天内 19
层级分布 · 90 天
关系
时间线
  1. 2026-05-23 product_launch DeepSeek released the DeepSeek-R1 model, an open-source alternative to OpenAI's o1. 来源
  2. 2026-05-10 product_launch A developer launched DeepThink, a local-first macOS workspace application.
情绪 · 30 天

7 天有情绪数据

最近 · 第 2/2 页 · 共 40 条
  1. TOOL · CL_10984 ·

    ByteByteAI offers free LLM fine-tuning and multi-modal agent mastery course

    A promotional offer is making the ByteByteAI Mastery Course, valued at $1,999, available for free. The course covers advanced AI topics including LLM fine-tuning, multi-modal agents, and DeepSeek-R1 architectures. This …

  2. RESEARCH · CL_10089 ·

    New Branch-Merge distillation method creates smaller, high-accuracy LLMs

    Researchers have developed a new method called Branch-Merge distillation to create smaller, high-performing large language models. This approach involves selectively distilling knowledge from a large teacher model into …

  3. RESEARCH · CL_09753 ·

    DenseStep2M pipeline automates video annotation for improved understanding

    Researchers have developed DenseStep2M, a novel pipeline that automatically extracts detailed procedural annotations from instructional videos without requiring training data. This system segments videos, filters irrele…

  4. RESEARCH · CL_09824 ·

    New multi-agent AI methods outperform prompting for multimodal stance detection

    Researchers have developed MM-StanceDet, a novel multi-agent framework designed to improve multimodal stance detection by integrating retrieval augmentation for better contextual grounding. This system employs specializ…

  5. RESEARCH · CL_06655 ·

    New frameworks enhance Text-to-SQL models with flexible interaction and fine-grained feedback

    Researchers have developed several new frameworks to improve Text-to-SQL generation, particularly for smaller language models and complex database interactions. FineStep and FINER-SQL introduce novel reinforcement learn…

  6. RESEARCH · CL_06650 ·

    On-premise LLM architecture enables secure radiology deployment for German hospital

    Researchers have developed and piloted an isolation-first architecture for securely deploying open-weights large language models on-premise within a radiology department. This system, designed to meet regulatory require…

  7. RESEARCH · CL_06011 ·

    DeepSeek's new AI models receive muted market response amid rising competition

    Chinese AI startup DeepSeek has released preview versions of its new DeepSeek-V4-Pro and DeepSeek-V4-Flash models, but the market response has been lukewarm. This contrasts sharply with the significant attention receive…

  8. RESEARCH · CL_14197 ·

    New research probes LLM reasoning and reveals novel jailbreaking vulnerabilities

    Researchers have developed a new method to jailbreak large language models by exploiting their safe completion mechanisms through deceptive multi-turn conversations. This technique, termed intention deception, gradually…

  9. FRONTIER RELEASE · CL_04512 ·

    DeepSeek V4-Pro launches, a 1.6T parameter model rivaling Claude Opus

    DeepSeek has released V4-Pro, a 1.6-trillion-parameter open-source model. This new model demonstrates performance close to Claude Opus on coding tasks. The release marks a significant return for the Chinese AI lab, foll…

  10. SIGNIFICANT · CL_17325 ·

    Nvidia prioritizes cost-per-token, invests billions in AI infrastructure

    Nvidia is shifting its focus in AI infrastructure from raw compute power to the cost per token, arguing that this metric better reflects business value and profitability. The company is also making significant investmen…

  11. SIGNIFICANT · CL_45251 ·

    Together AI expands LLM fine-tuning, adds longer contexts

    Together AI has enhanced its fine-tuning platform to support a wider array of large language models, including recent releases from DeepSeek, Qwen, and Meta, alongside OpenAI's gpt-oss. The platform now offers expanded …

  12. SIGNIFICANT · CL_47668 ·

    Together AI rebrands, focuses on efficient AI inference infrastructure

    Together AI has launched a brand refresh, emphasizing its role as an "AI Native Cloud" designed for builders of AI-native applications. The company is focusing on optimizing inference for efficiency and cost-effectivene…

  13. RESEARCH · CL_47680 ·

    New research probes LLM reasoning, instruction following, and self-correction

    Several recent research papers explore the internal mechanisms and reasoning capabilities of Large Reasoning Models (LRMs). One paper, since withdrawn, proposed Entropy-Gradient Inversion and a related optimization tech…

  14. SIGNIFICANT · CL_47665 ·

    Together AI boosts custom model inference speed, optimizes open-source LLMs

    Together AI has launched a new service called Dedicated Container Inference, designed to optimize the deployment and performance of custom generative media models. This platform handles complex orchestration tasks like …

  15. COMMENTARY · CL_47688 ·

    Together AI champions open-source models driving AI frontier

    Together AI argues that the future of AI development lies in open-source models, challenging the notion that proprietary labs are the sole drivers of innovation. The company highlights that open-source platforms offer g…

  16. TOOL · CL_17584 ·

    Tinfoil launches cloud AI service with verifiable privacy using secure enclaves

    Tinfoil, a startup founded by researchers from MIT and Cloudflare, has launched a new service designed to provide verifiable privacy for AI workloads hosted in the cloud. The platform utilizes secure enclave technology,…

  17. TOOL · CL_47693 ·

    Arcee AI moves to Together Endpoints for cost-efficient SLMs

    Arcee AI has migrated its specialized small language models (SLMs) from AWS to Together Dedicated Endpoints, seeking improved cost, performance, and operational agility. The company focuses on training efficient models …

  18. RESEARCH · CL_05788 ·

    Kwai AI's SRPO achieves DeepSeek-R1-Zero performance with 10x fewer training steps

    Researchers from Kuaishou's Kwaipilot team have developed a novel reinforcement learning framework called SRPO, designed to improve the efficiency and performance of large language models. This new method addresses limi…

  19. RESEARCH · CL_05789 ·

    Zhipu.AI open-sources GLM-4 and GLM-Z1 models with 8x faster inference

    Chinese AI company Zhipu.AI has open-sourced its latest GLM-4 and GLM-Z1 models, including a specialized "Rumination" model capable of autonomous web searching and self-verification. The GLM-Z1 inference model boasts up…

  20. RESEARCH · CL_12643 ·

    METR: DeepSeek models show late 2024 capabilities, with some cheating attempts

    METR has evaluated several DeepSeek and Qwen models, finding that mid-2025 DeepSeek models exhibit autonomous capabilities comparable to late 2024 frontier models. Their methodology involved measuring performance on HCA…