GPT-OSS 120B
PulseAugur coverage of GPT-OSS 120B — every cluster mentioning GPT-OSS 120B across labs, papers, and developer communities, ranked by signal.
- instance of large-language models 90%
- instance of GPT-4o mini 90%
- instance of LLMs 90%
- instance of GPT OSS 20B 90%
- competes with GPT OSS 20B 70%
- instance of LLM 70%
- competes with Qwen3 70%
- used by Together AI 70%
- competes with Qwen3.5-122B 60%
- competes with Claude Sonnet 4.6 50%
- affiliated with Together AI 50%
- authored by arXiv 50%
11 day(s) with sentiment data
-
LLMs and humans diverge in problem-solving strategies, research finds · 7 sources tracked
New research indicates that while both humans and large language models (LLMs) adjust their problem-solving time based on difficulty, their internal mechanisms differ significantly. Humans tend to disengage from problem…
-
DFlash accelerates AI inference with parallel token block drafting · 2 sources tracked
Researchers from the University of California, San Diego, have developed DFlash, a novel speculative decoding technique that significantly accelerates AI inference. Unlike traditional methods that generate tokens one by…
-
Wonda pipeline enhances SLM program verification with curated data
Researchers have developed a data curation pipeline called Wonda to improve the training of Small Language Models (SLMs) for program verification. This pipeline normalizes raw verifier output and uses LLMs to rewrite an…
-
Users seek best local LLMs for structured text-to-JSON conversion
A user on Reddit's r/LocalLLaMA subreddit is seeking recommendations for local large language models capable of converting unstructured text into structured JSON output. They have found that while larger models like GPT…
-
AI agent recommendations sought for Python web development
A user on the r/LocalLLaMA subreddit is seeking recommendations for an AI agent setup to assist with Python web development in PyCharm. They have a powerful hardware setup with 128GB of RAM capable of running large mode…
-
OpenAI releases GPT-Image-2 and GPT-5.5 Instant upgrades, plus new cybersecurity tools
OpenAI has released GPT-Image-2, making it available on Together AI for developers to integrate into their applications. This model supports up to 16 reference images per call and offers native 1K, 2K, and 4K outputs, w…
-
New method enhances LLM agent clarification seeking by decomposing uncertainty
Researchers have developed a novel method for LLM agents to improve their clarification-seeking capabilities by decomposing uncertainty. This approach separates action confidence from request uncertainty, allowing agent…
-
LLM community calls for urgent release of 80-160B parameter models
Users on the r/LocalLLaMA subreddit are expressing a strong need for new large language models (LLMs) in the 80-160 billion parameter range. Current models are either too small for users with high-capacity but slower un…
-
Hey, Reachy voice assistant prioritizes speed over intelligence
The developer of the Echo companion platform has rebuilt it as "Hey, Reachy," a voice-first AI assistant focused on low latency and natural interaction. The new system prioritizes speed over complex AI models, using gpt…
-
AI Community Questions Lack of New 100B-120B Parameter Language Models
A discussion on the r/LocalLLaMA subreddit highlights a perceived lack of new large language models in the 100B-120B parameter range. While models like GPT-OSS-120B, GLM-4.5-Air, Nemotron-3-Super, Qwen3.5-122B, and Mist…
-
AI Assistant Misinterprets Code Constant as Dutch Postcode
Large language models can make peculiar errors, as demonstrated by an AI assistant reviewing code. The model incorrectly identified a numerical constant, `1024UL`, as a potential Dutch postcode due to its visual similar…
-
Gemma-3 270M fine-tuned to control robot with natural language commands
A developer has fine-tuned Google's Gemma-3 270M language model to control a simulated robot. The model was trained to translate natural language commands into JSON instructions for movement and object manipulation with…
-
AI Engineer Details On-Premise LLM Hardware Calculation Challenges
An AI engineer details the challenges of accurately calculating hardware requirements for on-premise LLM deployments. Initial estimates using a popular calculator for a GPT-OSS-120B model on two RTX Pro 6000 Blackwell G…
-
New GeoNatureAgent benchmark tests LLM agents on environmental geospatial tasks
A new benchmark, GeoNatureAgent, has been released to evaluate the performance of AI agents in environmental geospatial analysis using real-world APIs. The benchmark includes 93 tasks across various categories, such as …
-
llama.cpp performance boosted 80% by optimizing thread count
A user on Reddit's r/LocalLLaMA subreddit has discovered a significant performance improvement in the llama.cpp inference engine by adjusting the `--threads` argument. Initially, it was believed that limiting threads to…
-
New framework OTora tests LLM agents for reasoning-level denial-of-service attacks
Researchers have developed OTora, a novel framework designed to test the resilience of large language model (LLM) agents against a specific type of attack known as Reasoning-Level Denial-of-Service (R-DoS). This attack …
-
Process mining reveals LLM red teaming defense differences
Researchers have developed a new method using process mining to analyze how Large Language Models (LLMs) respond to red teaming attacks. This approach moves beyond simple success/fail metrics to examine the sequential i…
-
LLMs improve public opinion data imputation via in-context learning
Researchers have developed a new method for imputing missing public opinion data using large language models (LLMs) through in-context learning (ICL). This approach was tested on survey data and showed consistent error …
-
LLMs evaluated for formal math proofs in Lean 4
A new research paper evaluates the performance of various Large Language Models (LLMs) in generating formal mathematical proofs using the Lean 4 theorem prover. The study employed pass@k and refine@k metrics on subsets …
-
LLMs improve heart medical Q&A with new GRPO reward framework
Researchers have developed a new method to improve the accuracy of Large Language Models (LLMs) in answering heart-related medical questions. Their approach utilizes Group Relative Policy Optimization (GRPO) with a nove…