PulseAugur
实时 22:44:09
实体 Gemini 1 5

Gemini 1 5

PulseAugur coverage of Gemini 1 5 — every cluster mentioning Gemini 1 5 across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
6
90 天内 6
发布 · 30天
0
90 天内 0
论文 · 30天
4
90 天内 4
层级分布 · 90 天
时间线
  1. 2026-05-20 product_launch Google launched its Gemini 1.5 series of AI models.
情绪 · 30 天

2 天有情绪数据

最近 · 第 1/1 页 · 共 6 条
  1. TOOL · CL_44506 ·

    Specialized 3B-parameter AI model outperforms frontier APIs on OCR tasks

    A specialized 3-billion-parameter AI model has outperformed leading commercial frontier APIs in structured OCR tasks, demonstrating that domain-specific fine-tuning can surpass sheer model scale. This specialized model …

  2. SIGNIFICANT · CL_40264 ·

    Google launches Gemini 1.5, addresses AI dialogue leaks

    Google has unveiled its Gemini 1.5 series of models, signaling a significant advancement in its AI capabilities. The company is also addressing user concerns regarding potential 'dialogue leaks' associated with its AI t…

  3. TOOL · CL_18789 ·

    New MSI metric reveals nuanced bias in LLMs, with distillation reintroducing bias

    Researchers have developed a new metric, the Moral Sensitivity Index (MSI), to evaluate contextual bias in large language models. This index quantifies the probability of biased output across a seven-tier stress test, m…

  4. RESEARCH · CL_18669 ·

    UnAC method enhances LMMs for complex multimodal reasoning with adaptive prompting

    Researchers have introduced UnAC, a novel multimodal prompting method designed to enhance the reasoning capabilities of Large Multimodal Models (LMMs) on complex visual tasks. This method employs adaptive visual prompti…

  5. RESEARCH · CL_13057 ·

    GPT-5.5 and Opus 4.7 show systematic reasoning failures on ARC-AGI-3 benchmark

    A new benchmark, ARC-AGI-3, has revealed significant reasoning errors in advanced AI models like GPT-5.5 and Opus 4.7. These models achieved a mere 0.8% success rate on the benchmark, highlighting persistent gaps in abs…

  6. TOOL · CL_17686 ·

    LLMs fail 'pass the butter' robot test, scoring far below human performance

    A new evaluation called Butter-Bench has revealed that current state-of-the-art large language models struggle significantly with controlling robots for practical tasks. In tests designed to assess their ability to perf…