gpt-oss-20b
PulseAugur coverage of gpt-oss-20b — every cluster mentioning gpt-oss-20b across labs, papers, and developer communities, ranked by signal.
5 天有情绪数据
-
大语言模型通过M3系统简化临床数据访问
研究人员开发了M3系统,该系统使用对话式大语言模型来简化对MIMIC-IV等复杂临床数据库的访问和分析。M3允许用户使用自然语言查询数据,将问题翻译成SQL查询进行执行。评估显示Claude Sonnet 4和开源gpt-oss-20B等模型具有高准确性,证明了本地、隐私保护部署在敏感医疗数据上的可行性。
-
新指标揭示语言模型如何处理隐喻
研究人员开发了一种名为条件尺度熵(CSE)的新指标,用于分析仅解码器语言模型如何处理隐喻。CSE 衡量了 Transformer 层内不同频率尺度上的计算参与广度。使用 CSE 进行的研究表明,在参数量从 1.24 亿到 200 亿不等的模型中,包括 GPT-2、LLaMA-2 和 GPT-oss 等架构,隐喻性词元相比字面性词元始终激活更广泛的计算尺度。
-
OpenAI's gpt-oss-20b model runs 128k context on single L4 GPU
An engineer has successfully deployed OpenAI's gpt-oss-20b model, enabling a 128,000 token context window on a single NVIDIA L4 GPU. This setup, running in production for six months, leverages mxfp4 quantization for eff…
-
LLM benchmark shows routing strategy outperforms single model selection
A recent benchmark tested 15 LLMs on 38 real-world coding tasks, revealing that a routing strategy combining different models is more effective than selecting a single top-tier model. The study found that cheaper models…
-
MoE models misroute tokens on complex reasoning tasks, study finds
Researchers have identified a significant issue in Mixture-of-Experts (MoE) language models where the routing mechanism, which directs tokens to specific experts, often selects suboptimal paths. While the standard route…
-
Seven small coding AI models offer local development power in 2026
The article highlights seven small coding AI models suitable for local development, emphasizing their efficiency and privacy benefits. These models, including OpenAI's gpt-oss-20b and Microsoft's Phi-3.5-mini-instruct, …
-
OpenUI 集成 Ollama,支持使用各种大模型本地生成 UI
本指南详细介绍了如何设置和使用 OpenUI 配合 Ollama,通过提示词在本地生成 UI。内容涵盖了必要的软件安装、系统要求,并深入分析了模型性能,推荐使用 qwen2.5-coder:14b 或 gpt-oss:20b 等较大的模型以获得更好的稳定性。指南还概述了通过 Ollama 拉取模型以及使用 .env 文件配置 OpenUI 应用的步骤,包括指定本地 Ollama API 端点和所需的模型。
-
AMD EPYC CPU 在 LLM 和 TTS 推理工作负载上表现出竞争力
Leaseweb 近期的一项分析对 AMD EPYC 9334 CPU 在大型语言模型 (LLM) 和文本转语音 (TTS) 推理工作负载上的性能进行了基准测试。研究表明,虽然 GPU 提供更高的吞吐量,但 CPU 可以是推理的经济高效且可预测的选择,尤其是在考虑延迟和每查询成本等因素时。基准测试突显了量化的影响,Q4 模型在 CPU 上的吞吐量明显优于 FP16,并且还与参考 Nvidia L4 GPU 比较了首次令牌时间 (TTF…
-
New red-teaming method ContextualJailbreak bypasses LLM safety alignment
Researchers have developed ContextualJailbreak, an evolutionary red-teaming strategy designed to find vulnerabilities in large language models. This black-box approach uses simulated multi-turn dialogues and a graded ha…
-
New llmfs file system integrates LLM prompts directly into C code
A new file system called llmfs has been developed to integrate large language model prompts directly into code. This system allows developers to include prompts using standard include directives, enabling AI-generated c…
-
Liquid AI releases LFM2-24B-A2B, an efficient 24B parameter MoE model
Liquid AI has released an early checkpoint of its LFM2-24B-A2B model, a sparse Mixture of Experts (MoE) architecture with 24 billion total parameters and 2 billion active parameters per token. This model demonstrates th…
-
Perspective Intelligence 1.13 to feature mlx-community model search and GPT-OSS integration
Perspective Intelligence 1.13 is set to launch with significant enhancements, including the integration of mlx-community for model searching within the application. This update aims to broaden the selection of available…
-
Researchers propose efficient LLM classification probes to reduce latency and VRAM
Researchers have developed a method to integrate classification tasks, such as safety checks, directly into the forward pass of large language models (LLMs). This approach uses lightweight probes trained on the LLM's in…
-
New tool AgentSeer reveals critical gaps in LLM agentic security
Researchers have developed a new tool called AgentSeer to evaluate the vulnerabilities of large language models (LLMs) when they are deployed in agentic systems. This tool decomposes agent executions into action-compone…
-
OpenAI launches affordable GPT-4o mini and open-weight gpt-oss models
OpenAI has released GPT-4o mini, a new, highly cost-efficient small model designed to broaden AI accessibility and application development. This model demonstrates superior performance on benchmarks like MMLU, MGSM, and…
-
OpenAI and researchers reveal AI vulnerabilities to adversarial attacks
OpenAI researchers are exploring the transferability of adversarial robustness across different types of perturbations in neural networks. Their findings indicate that robustness against one perturbation type does not a…