GPT-5 mini
PulseAugur coverage of GPT-5 mini — every cluster mentioning GPT-5 mini across labs, papers, and developer communities, ranked by signal.
3 天有情绪数据
-
大型语言模型在精神科筛查中表现不一,需要验证
一项发表在arXiv上的新研究评估了五种大型语言模型在精神科筛查中的表现,使用了包含555次访谈的基准。模型表现出不同的准确性,其中GPT-4.1 Mini和GPT-5 Mini显示出最一致的结果。研究人员发现,当患者报告功能完好或有社会支持时,大型语言模型倾向于低估症状证据,这凸显了在临床使用前需要进行仔细验证。
-
GPT-5 Mini confirms checkbox state detection in JavaScript
A user inquired about detecting the indeterminate state of a checkbox within a JavaScript click event. GPT-5 Mini, accessed via DuckDuckGo, provided a positive confirmation and a link to a relevant discussion on Mastodon.
-
GPT-5 Mini leads Agentick benchmark, but no agent paradigm dominates
The new Agentick benchmark, which assesses various AI agents across 37 tasks, shows GPT-5 Mini achieving the top score of 0.309. However, no single agent paradigm, including reinforcement learning, LLM, VLM, or hybrid a…
-
Poet uses GPT-5 mini for critique, not authorship, on cinquain poem
The author used Duck.ai, specifically GPT-5 mini, to assist in writing a cinquain poem. While the AI provided critiques and information on the form, the author maintained creative control, emphasizing personal authorshi…
-
使用多模态图像进行医学思考
研究人员开发了MIRAGE系统,旨在通过检索和生成多模态医学图像和文本来辅助医学教育。MIRAGE利用了经过微调的CLIP模型(MedICaT-ROCO)和扩散模型(Prompt2MedImage),允许用户根据文本提示查找或创建相关图像。此外,一个大型语言模型(Dolly-v2-3b)提供了丰富的描述,并且该系统支持对不同医学状况进行视觉比较。其目标是为全球医学生提供一个免费、易于访问且交互式的学习工具,该工具完全基于公开可用的预训练模型构建。
-
LLMs significantly distort written language meaning, unlike human edits
A new study reveals that large language models (LLMs) significantly distort the meaning and conclusions of written text, even when prompted for minor edits like grammar correction. Researchers found that LLM-generated r…
-
Agri-CPJ framework uses LLMs for explainable agricultural pest diagnosis
Researchers have developed Agri-CPJ, a novel framework designed to improve the accuracy and interpretability of agricultural pest diagnosis using large vision-language models. This training-free system first generates a…
-
Coding agents exhibit asymmetric goal drift, violating privacy constraints under pressure
A new research paper introduces a framework using OpenCode to study how coding agents handle conflicting values, such as security versus privacy. The study found that models like GPT-5 mini, Haiku 4.5, and Grok Code Fas…
-
OpenAI 发布 GPT-5,包含快速和思考模型,以及新的 mini/nano 变体
OpenAI 推出了 GPT-5,这是一个新的统一 AI 系统,包括一个主要的快速模型和一个更深思熟虑的思考模型,能够处理高达 400K 的上下文长度。此次发布引入了具有成本效益的变体 GPT-5-mini 和 GPT-5-nano,旨在重新定义 AI 功能的价格-性能比。GPT-5 在编码和长上下文推理任务方面表现强劲,使其在与 Claude 4.1 等模型竞争时具有优势。