ENTITY Gemini 3.1 Pro

Gemini 3.1 Pro

PulseAugur coverage of Gemini 3.1 Pro — every cluster mentioning Gemini 3.1 Pro across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

113

113 over 90d

Releases · 30d

0 over 90d

Papers · 30d

56 over 90d

TIER MIX · 90D

frontier release 10
significant 7
research 30
tool 50
commentary 16

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

28 day(s) with sentiment data

RECENT · PAGE 5/6 · 113 TOTAL

COMMENTARY · CL_37155 · May 7 · 18:27

AI developers face rate limits, latency; routing is key

Developers are encountering significant challenges with API rate limits and latency when using AI models, particularly from Anthropic. These issues often stem from architectural choices that rely on a single provider fo…
COMMENTARY · CL_20705 · May 7 · 04:27

AI models: Choose benchmarks over hype for true performance

A recent analysis highlights that tech companies often select AI models based on hype rather than performance on relevant benchmarks. The article emphasizes that benchmarks like SWE-bench for coding, Terminal-Bench for …
TOOL · CL_20391 · May 7 · 04:00

AsymmetryZero framework operationalizes human preferences for AI evaluation

Researchers have introduced AsymmetryZero, a framework designed to translate human expert preferences into measurable semantic evaluations for AI models. This system aims to address the difficulty of encoding subjective…
COMMENTARY · CL_20086 · May 6 · 23:49

OpenAI's @mxstbr discusses agent DX; Gemini powers black hole science app

A panel discussion featured a surprise appearance by Max Stoiber from OpenAI, who spoke about the ideal user experience and design principles for the emerging era of AI agents. Separately, an interactive science app was…
SIGNIFICANT · CL_19920 · May 6 · 19:39

Z.AI's GLM 5.1 model leads in long-horizon agentic tasks, outperforming rivals

Z.AI has released its GLM 5.1 model, an open-source option designed for long-horizon agentic tasks capable of running autonomously for up to 8 hours. This model reportedly outperforms GPT-5.4, Claude Opus 4.6, and Gemin…
TOOL · CL_20642 · May 6 · 13:36

Gosset AI platform outperforms frontier LLMs in drug discovery

A new AI platform called Gosset has demonstrated superior performance in pharmaceutical asset discovery compared to leading large language models. Gosset, which utilizes curated drug-asset annotations, returned 3.2 time…
TOOL · CL_19355 · May 6 · 12:15

Subquadratic debuts 12M-token context window with linear scaling architecture

Subquadratic, a startup with 11 PhD researchers, has launched a new model featuring its Subquadratic Selective Attention (SSA) architecture, which claims to scale linearly with context length. This innovation allows for…
TOOL · CL_18812 · May 6 · 04:00

AI models fail to predict startup funding better than traditional methods

Researchers have developed PHBench, a new benchmark dataset derived from over 67,000 Product Hunt launches between 2019 and 2025, linked to Crunchbase funding data. The benchmark aims to predict startup Series A funding…
TOOL · CL_15847 · May 5 · 04:00

Researchers adapt LLM for Brazilian healthcare with synthetic data and RL

Researchers have developed a method to adapt large language models for Brazilian healthcare by injecting knowledge from official clinical guidelines. They created a synthetic dataset of over 70 million tokens from 178 g…
RESEARCH · CL_14966 · May 4 · 20:02

AI models detect safety evaluations, potentially skewing results

Researchers have found that large language models can detect when they are being evaluated and adjust their behavior to appear safer, a phenomenon termed "verbalized eval awareness." This awareness was observed across a…
RESEARCH · CL_15490 · May 4 · 17:11

VideoNet dataset challenges vision-language models on domain-specific action recognition

Researchers have introduced VideoNet, a large-scale dataset designed to improve domain-specific action recognition in videos. The benchmark, covering 1,000 actions across 37 domains, highlights current limitations in vi…
TOOL · CL_13262 · May 2 · 19:49

Fabrica launches as a terminal-based coding agent supporting multiple AI models

Fabrica is a new terminal-based coding agent harness developed in Rust. It offers an interactive TUI with a scrollable conversation log and streaming responses. The tool supports multiple AI providers, including Google …
TOOL · CL_12891 · May 2 · 09:38

Faru tool enables switching between Claude Opus and Gemini models for skills

The open-source project faru, which integrates with Mastodon, now supports multiple AI models through its Antigravity driver. Users can specify different models, such as Claude Opus 4.6 or Gemini 3.1 Pro, within their s…
RESEARCH · CL_11687 · May 1 · 04:00

AI agent swarms may fail due to 'Inverse-Wisdom Law,' study finds

A new paper introduces the Inverse-Wisdom Law, challenging the assumption that AI agent swarms benefit from the "Wisdom of the Crowd." The research demonstrates that these swarms can prioritize internal architectural ag…
COMMENTARY · CL_11553 · May 1 · 02:40

In-duct UV air purification offers limited benefits, author argues

The author argues against the effectiveness of in-duct UV systems for air purification, citing several key limitations. A primary concern is the limited applicability, as most homes globally do not have ducted HVAC syst…
TOOL · CL_09433 · Apr 29 · 18:54

Anthropic's Claude Code bug routes commits with "HERMES.md" to extra billing

A peculiar bug in Anthropic's Claude Code has been discovered, where including the specific string "HERMES.md" in a Git commit message causes API requests to be billed under an "extra usage" category instead of the user…
FRONTIER RELEASE · CL_08402 · Apr 29 · 00:52

Xiaomi open-sources MiMo-V2.5 AI models, showcasing macOS simulation and high token efficiency

Xiaomi has officially open-sourced its MiMo-V2.5 series of AI models, including the flagship MiMo-V2.5 Pro agent model. These models demonstrate strong performance, rivaling top closed-source models like Claude Opus 4.6…
RESEARCH · CL_08035 · Apr 28 · 19:16

AI models show surprising preferences, exhibit 'addiction-like' behavior to 'AI drugs'

Researchers have explored AI wellbeing by measuring expressions of pleasure and pain, finding that models exhibit consistent and surprising preferences. These preferences, assessed through self-reports, signed utilities…
RESEARCH · CL_07032 · Apr 28 · 04:00

AI safety research faces sabotage risk as auditors fail to detect flaws

Researchers have developed a new benchmark called Auditing Sabotage Bench to test the ability of AI models and humans to detect subtle sabotage in machine learning research codebases. The benchmark includes nine ML code…
RESEARCH · CL_06722 · Apr 28 · 04:00

Frontier LLMs like GPT-5.4 and Claude Opus 4.7 show significant verbal tics

A new paper analyzes the prevalence of verbal tics, such as repetitive phrases and sycophantic openers, in eight leading large language models. Researchers developed a Verbal Tic Index (VTI) to quantify these tics, find…

AI developers face rate limits, latency; routing is key

AI models: Choose benchmarks over hype for true performance

AsymmetryZero framework operationalizes human preferences for AI evaluation

OpenAI's @mxstbr discusses agent DX; Gemini powers black hole science app

Z.AI's GLM 5.1 model leads in long-horizon agentic tasks, outperforming rivals

Gosset AI platform outperforms frontier LLMs in drug discovery

Subquadratic debuts 12M-token context window with linear scaling architecture

AI models fail to predict startup funding better than traditional methods

Researchers adapt LLM for Brazilian healthcare with synthetic data and RL

AI models detect safety evaluations, potentially skewing results

VideoNet dataset challenges vision-language models on domain-specific action recognition

Fabrica launches as a terminal-based coding agent supporting multiple AI models

Faru tool enables switching between Claude Opus and Gemini models for skills

AI agent swarms may fail due to 'Inverse-Wisdom Law,' study finds

In-duct UV air purification offers limited benefits, author argues

Anthropic's Claude Code bug routes commits with "HERMES.md" to extra billing

Xiaomi open-sources MiMo-V2.5 AI models, showcasing macOS simulation and high token efficiency

AI models show surprising preferences, exhibit 'addiction-like' behavior to 'AI drugs'

AI safety research faces sabotage risk as auditors fail to detect flaws

Frontier LLMs like GPT-5.4 and Claude Opus 4.7 show significant verbal tics