PulseAugur
实时 09:19:58
实体 CAISI

CAISI

PulseAugur coverage of CAISI — every cluster mentioning CAISI across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
6
90 天内 6
发布 · 30天
0
90 天内 0
论文 · 30天
1
90 天内 1
层级分布 · 90 天
关系
情绪 · 30 天

3 天有情绪数据

最近 · 第 1/1 页 · 共 6 条
  1. RESEARCH · CL_34816 ·

    开源模型落后于前沿闭源模型,基准测试存在争议

    多家领先的 AI 实验室发布了新的开源模型,包括 DeepSeek V4、Gemma 4、Kimi K2.6 和 MiMo 2.5。CAISI 的一项评估表明,这些开源模型落后于前沿闭源模型,且差距正在扩大。然而,评估方法和基准测试的局限性也引发了争议,一些人认为标准化测试未能完全捕捉实际能力,尤其是在编码等复杂任务中。

  2. TOOL · CL_28417 ·

    NIST: DeepSeek V4 Pro matches GPT-5 performance, leads China models

    The U.S. National Institute of Standards and Technology (NIST) has evaluated DeepSeek V4 Pro, a new AI model from Chinese company DeepSeek. The evaluation found that DeepSeek V4 Pro performs comparably to OpenAI's GPT-5…

  3. COMMENTARY · CL_26547 ·

    AI regulation should preserve future options, researchers say

    Researchers propose "radical optionality" as a regulatory approach for AI, suggesting governments invest in tools and institutions now to manage future disruptions. This strategy emphasizes building information-gatherin…

  4. RESEARCH · CL_16707 ·

    NIST partners with Google DeepMind, Microsoft, and xAI on frontier AI security testing

    The National Institute of Standards and Technology's Center for AI Standards and Innovation (CAISI) has formalized new agreements with Google DeepMind, Microsoft, and xAI. These collaborations aim to enhance the securit…

  5. SIGNIFICANT · CL_00119 ·

    NIST启动人工智能代理标准倡议,促进安全、互操作性创新

    美国国家标准与技术研究院(NIST)已启动人工智能代理标准倡议,以促进自主人工智能代理的安全和互操作性采用。该倡议由NIST的人工智能标准与创新中心(CAISI)牵头,旨在促进行业主导的标准和开源协议。重点领域包括推进人工智能代理安全和身份方面的研究,以建立公众信任并确保美国在全球人工智能格局中的领导地位。

  6. COMMENTARY · CL_02317 ·

    OpenAI urges California to lead harmonized AI regulation with federal standards

    OpenAI has urged California Governor Gavin Newsom to harmonize state AI regulations with national and global standards to foster innovation and safety. The company advocates for a unified approach, suggesting that compl…