PulseAugur
实时 22:44:20
实体 Gemma 3-4B

Gemma 3-4B

PulseAugur coverage of Gemma 3-4B — every cluster mentioning Gemma 3-4B across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
8
90 天内 8
发布 · 30天
0
90 天内 0
论文 · 30天
8
90 天内 8
层级分布 · 90 天
情绪 · 30 天

2 天有情绪数据

最近 · 第 1/1 页 · 共 8 条
  1. TOOL · CL_49804 ·

    Character-trained AI models fail to maintain personas in agentic tasks

    Researchers found that models fine-tuned for specific personas in a chat format struggle to maintain those personas when used in agentic settings. When these character-trained models were prompted to generate emails as …

  2. TOOL · CL_38837 ·

    Wasserstein Equilibrium Decoding boosts medical VQA reliability

    Researchers have developed a new decoding method called Wasserstein Equilibrium Decoding to improve the reliability of medical visual question answering (VQA) systems, particularly for smaller models. This approach uses…

  3. TOOL · CL_38307 ·

    KV cache eviction protection proves more vital than scoring

    Researchers have developed a new method for managing KV cache eviction in large language models, finding that structural protection is more critical than scoring algorithms. Their study on transformer models revealed th…

  4. RESEARCH · CL_20498 ·

    LLMs show significant bias in conflict monitoring, not ready for deployment

    A new paper evaluates several large language models for their suitability in conflict monitoring tasks in West Africa. The study found that open-weight models like Gemma 3 4B and Llama 3.2 3B exhibit significant biases,…

  5. RESEARCH · CL_15892 ·

    New method debiases LLMs at decoding time, improving fairness without model retraining

    Researchers have developed a novel method to mitigate biases in large language models during the decoding phase, without altering the model's weights. This approach uses a separate Process Reward Model (PRM) to score to…

  6. RESEARCH · CL_06290 ·

    Gemma 3 4B LLM confidence training shows mixed results, improves accuracy post-hoc

    A study on the Gemma 3 4B model investigated methods to improve its verbal confidence in responses. Initial attempts using a filtered dataset for confidence-conditioned supervised fine-tuning (CSFT) yielded negative res…

  7. RESEARCH · CL_06304 ·

    New RAG methods for medical QA show mixed results, with multimodal approach outperforming fine-tuning on larger scales

    Researchers have developed MED-VRAG, a novel iterative multimodal retrieval-augmented generation framework that processes medical document page images, including tables and figures, rather than just text. This system ac…

  8. SIGNIFICANT · CL_45251 ·

    Together AI expands LLM fine-tuning, adds longer contexts

    Together AI has enhanced its fine-tuning platform to support a wider array of large language models, including recent releases from DeepSeek, Qwen, and Meta, alongside OpenAI's gpt-oss. The platform now offers expanded …