实体 GPT-4.1

GPT-4.1

PulseAugur coverage of GPT-4.1 — every cluster mentioning GPT-4.1 across labs, papers, and developer communities, ranked by signal.

总计 · 30天

31

90 天内 31

发布 · 30天

0

90 天内 0

论文 · 30天

18

90 天内 18

层级分布 · 90 天

frontier release 1
significant 3
research 12
tool 14
commentary 1

关系

情绪 · 30 天

7 天有情绪数据

最近 · 第 2/2 页 · 共 31 条

TOOL · CL_15790 · May 5 · 04:00

BareBones benchmark reveals Vision-Language Models suffer texture bias cliff

Researchers have introduced BareBones, a new benchmark designed to test the geometric comprehension abilities of Vision-Language Models (VLMs). The benchmark uses pixel-level silhouettes to evaluate if VLMs can understa…
RESEARCH · CL_06484 · Apr 28 · 04:00

New framework uses reconstruction to validate AI document processing outputs

Researchers have introduced RaV-IDP, a novel framework for intelligent document processing that incorporates reconstruction as a validation step. This approach aims to ensure extracted information accurately reflects th…
RESEARCH · CL_04970 · Apr 23 · 18:42

LLMs struggle to detect culturally specific health misinformation on YouTube

Two new research papers explore the limitations of Large Language Models (LLMs) in detecting culturally specific health misinformation, particularly concerning the promotion of cow urine as a remedy on YouTube in India.…
SIGNIFICANT · CL_02283 · Oct 2 · 10:00

OpenAI bolsters AI safety with external testing as GPT-5 powers Wrtn's user growth

OpenAI is enhancing its safety protocols for advanced AI models by incorporating external testing and assessments. This involves collaborating with independent experts to evaluate capabilities, risks, and mitigation str…
TOOL · CL_02305 · Sep 9 · 10:00

SafetyKit leverages GPT-5 and GPT-4.1 for enhanced AI risk detection and fraud prevention

OpenAI has launched SafetyKit, a platform that utilizes its most advanced models, including GPT-5 and GPT-4.1, to build multimodal AI agents for detecting fraud and prohibited activities. These agents can process text, …
SIGNIFICANT · CL_02336 · Jul 1 · 10:00

Genspark's Super Agent hits $36M ARR in 45 days with OpenAI's GPT-4.1

Genspark has launched Super Agent, a no-code AI assistant capable of automating real-world tasks such as making phone calls and generating presentations. The platform leverages OpenAI's GPT-4.1 and Realtime API, utilizi…
SIGNIFICANT · CL_02167 · May 21 · 08:00

From model to agent: Equipping the Responses API with a computer environment

OpenAI has enhanced its Responses API by integrating a computer environment, enabling models to act as agents capable of executing complex workflows. This new capability allows models to interact with command-line tools…
TOOL · CL_47693 · May 5 · 00:00

Arcee AI 迁移至 Together 端点以实现成本高效的 SLM

Arcee AI 已将其专业小型语言模型 (SLM) 从 AWS 迁移到 Together 专用端点，以寻求改进成本、性能和运营敏捷性。该公司专注于训练参数量在 720 亿以下的、用于编码和通用文本生成等特定任务的高效模型。Arcee AI 还开发了 Arcee Conductor，这是一个推理路由系统，可将查询定向到最合适的模型，包括 GPT-4.1 和 Claude 3.7 Sonnet 等第三方选项，以优化成本和性能。
RESEARCH · CL_00033 · Oct 17 · 02:00

[GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Researchers are developing new benchmarks and evaluation methods for large language models (LLMs) in mathematical reasoning and educational assessment. New datasets like ESTBook and Math-PT aim to go beyond simple accur…
FRONTIER RELEASE · CL_02309 · Aug 22 · 07:00

Introducing gpt-realtime and Realtime API updates

OpenAI has released GPT-4.1, a new series of models for its API that offer significant improvements in coding, instruction following, and long context comprehension, outperforming previous models like GPT-4o. The compan…
SIGNIFICANT · CL_39485 · Aug 24 · 07:00

OpenAI partners with Apple, Google DeepMind researches agent scaling

OpenAI has announced a partnership with Apple to integrate ChatGPT into iOS, iPadOS, and macOS, enhancing Siri and system-wide writing tools with GPT-4o capabilities. Google DeepMind has published research on scaling AI…