PulseAugur
实时 21:57:17
实体 Qwen2.5

Qwen2.5

PulseAugur coverage of Qwen2.5 — every cluster mentioning Qwen2.5 across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
12
90 天内 12
发布 · 30天
0
90 天内 0
论文 · 30天
12
90 天内 12
层级分布 · 90 天
情绪 · 30 天

4 天有情绪数据

最近 · 第 1/1 页 · 共 12 条
  1. RESEARCH · CL_44784 ·

    New methods enhance on-policy distillation for LLM training

    Researchers have developed new methods to improve on-policy distillation (OPD), a technique for training smaller language models using larger ones. One approach, TIP, identifies informative tokens by analyzing student e…

  2. TOOL · CL_35797 ·

    Fine-tuning Qwen2.5 with LoRA yields structured, not necessarily correct, outputs

    This article explores the process of fine-tuning the Qwen2.5 model using the LoRA technique. It demonstrates that while fine-tuning can lead to more structured outputs, this does not necessarily equate to improved reaso…

  3. TOOL · CL_36540 ·

    LLM judge circuits revealed in Gemma, Qwen, Llama models

    Researchers have identified a generalized 'Latent Evaluator' sub-graph within large language models like Gemma-3, Qwen2.5, and Llama-3 that is responsible for making judgments. This sub-graph is located in the mid-to-la…

  4. TOOL · CL_29136 ·

    Tiny models outperform frontier AI in agent coding benchmark

    A recent agent coding benchmark revealed that smaller, more efficient models are outperforming larger, frontier models. The SmolLM3 3B model, capable of running on a laptop, achieved a score of 93.3, significantly surpa…

  5. RESEARCH · CL_21935 ·

    Apple's RVPO framework enhances LLM alignment by penalizing reward variance

    Researchers have introduced Reward-Variance Policy Optimization (RVPO), a novel framework designed to improve the alignment of large language models with multiple objectives. Unlike existing methods that average rewards…

  6. TOOL · CL_20626 ·

    Mistral, QWen models show divergent strategies in biomedical text simplification

    A new research paper compares the text simplification strategies of Mistral-Small and QWen2.5 when applied to biomedical information. The study found that Mistral-Small effectively balances readability and accuracy, per…

  7. TOOL · CL_15849 ·

    Component-aware self-speculative decoding boosts hybrid language model inference

    Researchers have developed a new method called component-aware self-speculative decoding, which enhances the efficiency of hybrid language models. This technique leverages the internal architectural differences within t…

  8. RESEARCH · CL_15547 ·

    HeadQ: Model-Visible Distortion and Score-Space Correction for KV-Cache Quantization

    Researchers are developing several novel methods to optimize the Key-Value (KV) cache in large language models, which is a major bottleneck for long-context processing. These approaches include training models to inhere…

  9. RESEARCH · CL_11730 ·

    LLMs compute Nash equilibrium but suppress it via final-layer overrides

    Researchers have investigated why large language models (LLMs) deviate from Nash equilibrium play in strategic interactions. By examining open-source models like Llama-3 and Qwen2.5, they found that while opponent histo…

  10. RESEARCH · CL_09890 ·

    CoQuant paper introduces joint projection for efficient LLM mixed-precision quantization

    Researchers have introduced CoQuant, a novel method for mixed-precision quantization in Large Language Models (LLMs). This technique addresses limitations in existing approaches by jointly considering both weight and ac…

  11. RESEARCH · CL_06709 ·

    Diffusion LLMs show greater representational redundancy, enabling compression

    A new paper analyzes the internal representations of autoregressive (AR) and diffusion language models (dLLMs). Researchers found that diffusion models create more global representations with early-layer redundancy, unl…

  12. RESEARCH · CL_05005 ·

    New metrics reveal RLVR doesn't guarantee reliable reasoning in LLMs

    A new paper questions the effectiveness of Reinforcement Learning from Verifiable Rewards (RLVR) in ensuring that language models' reasoning chains accurately reflect their problem-solving processes. Researchers introdu…