PulseAugur
实时 19:26:17
实体 Qwen3-4B

Qwen3-4B

PulseAugur coverage of Qwen3-4B — every cluster mentioning Qwen3-4B across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
11
90 天内 11
发布 · 30天
0
90 天内 0
论文 · 30天
10
90 天内 10
层级分布 · 90 天
情绪 · 30 天

5 天有情绪数据

最近 · 第 1/1 页 · 共 11 条
  1. TOOL · CL_44814 ·

    X-Token method enhances knowledge distillation for mismatched tokenizers

    Researchers have developed X-Token, a novel knowledge distillation technique designed to improve student models by learning from teacher models with different tokenizers. The method addresses limitations in existing log…

  2. TOOL · CL_41666 ·

    SageMaker AI adds OpenAI-compatible API support for model endpoints

    Amazon SageMaker AI now offers OpenAI-compatible API support for its real-time inference endpoints. This integration allows users to invoke models hosted on SageMaker using existing OpenAI SDKs, LangChain, or Strands Ag…

  3. TOOL · CL_40869 ·

    New FINCH method cuts LLM forgetting by 93%

    Researchers have developed a new method called FINCH to address catastrophic forgetting during the fine-tuning of large language models. FINCH employs a loss-adaptive learning rate schedule that decreases the learning r…

  4. RESEARCH · CL_40825 ·

    New self-distillation methods boost LLM performance on reasoning tasks

    Researchers have developed new self-distillation techniques for large language models to improve their performance without relying on external feedback. AVSD (Adaptive-View Self-Distillation) balances consensus signals …

  5. TOOL · CL_38317 ·

    Small LLMs internalize tool knowledge via QLoRA fine-tuning

    Researchers have developed a method to internalize tool knowledge into small language models using QLoRA fine-tuning, reducing the need for explicit tool schemas in prompts. By training models like Gemma 4 E4B and Qwen3…

  6. TOOL · CL_30766 ·

    TFlow framework enables LLM agents to communicate via weight updates

    Researchers have developed TFlow, a novel framework for multi-agent LLM collaboration that utilizes weight perturbations instead of traditional text-based messaging. This approach compiles sender agents' internal states…

  7. TOOL · CL_21953 ·

    New S-trace method improves RLVR efficiency and credit assignment

    Researchers have introduced Selective Eligibility Traces (S-trace), a novel method designed to enhance the reasoning capabilities of large language models within the Reinforcement Learning with Verifiable Rewards (RLVR)…

  8. RESEARCH · CL_14127 ·

    RadLite fine-tunes small LLMs for CPU-deployable radiology AI

    Researchers have developed RadLite, a method for fine-tuning small language models (SLMs) with 3-4 billion parameters for radiology tasks. This approach, utilizing LoRA fine-tuning on models like Qwen2.5-3B-Instruct and…

  9. RESEARCH · CL_11489 ·

    Language models enhance mechanical linkage designs via symbolic reasoning and optimization

    Researchers have developed a novel method where language models refine mechanical linkage designs by combining symbolic reasoning with numerical optimization. This approach uses language models to explore discrete desig…

  10. RESEARCH · CL_08624 ·

    LLM co-evolution boosted by vocabulary dropout for sustained diversity

    Researchers have developed a technique called vocabulary dropout to address diversity collapse in co-evolutionary language model training. This method involves applying a random mask to the proposer model's output logit…

  11. RESEARCH · CL_05065 ·

    SpikingBrain2.0 model offers efficient long-context and cross-platform AI inference

    Researchers have introduced SpikingBrain2.0 (SpB2.0), a 5 billion parameter model designed for efficient long-context processing and cross-platform inference. The model features a novel Dual-Space Sparse Attention mecha…