ENTITY Qwen2.5-72B

Qwen2.5-72B

PulseAugur coverage of Qwen2.5-72B — every cluster mentioning Qwen2.5-72B across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

9 over 90d

Releases · 30d

0 over 90d

Papers · 30d

6 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL

TOOL · CL_178397 · Aug 3 · 04:00

New 'Chain-of-Models' method audits LLM bias

A new research paper introduces "Chain-of-Models" (CoM), a method for auditing Large Language Models (LLMs) for bias. CoM uses a second LLM to inspect the reasoning trace of a primary LLM before it delivers a final judg…
TOOL · CL_144285 · Jul 15 · 10:16

DeepSeek, GLM, and Qwen: Chinese LLMs Compared for Free API Use

Three leading Chinese AI labs, DeepSeek, Zhipu AI (GLM), and Alibaba Cloud (Qwen), offer powerful, free LLM APIs that cater to different project needs. DeepSeek-V2, with its Mixture-of-Experts architecture, provides the…
TOOL · CL_141836 · Jul 12 · 02:01

New framework unifies context engineering and fine-tuning for MMEA

Researchers have developed PTFEA, a novel framework that bridges the gap between context engineering and model fine-tuning for Multimodal Entity Alignment (MMEA). This framework theoretically demonstrates that prompt co…
RESEARCH · CL_128507 · Jul 4 · 00:00

New benchmarks and methods tackle LLM agent tool-use failures

Researchers are developing new methods to identify and mitigate failures in large language model (LLM) agents that use external tools. One approach, "Reason Less, Verify More," introduces deterministic pre-execution gat…
RESEARCH · CL_108834 · Jun 22 · 04:27

New speculative decoding methods boost LLM inference speed and safety

Researchers are developing advanced speculative decoding techniques to accelerate large language model inference. HyperDFlash optimizes decoding for DeepSeek-V4's multi-hyper-connection architecture, improving draft acc…
RESEARCH · CL_93583 · Jun 15 · 10:30

New DoubtProbe defense significantly reduces LLM jailbreaks

Researchers have developed DoubtProbe, a novel defense mechanism designed to counter jailbreaking attempts on large language models (LLMs) in black-box scenarios. This dual-branch framework combines structural verificat…
RESEARCH · CL_06733 · Apr 28 · 04:00

AgentHER framework boosts LLM agent training with failed trajectory relabeling

Researchers have developed AgentHER, a new framework designed to improve the training of LLM agents by repurposing failed trajectories. The system adapts Hindsight Experience Replay to natural language, identifying alte…
RESEARCH · CL_05137 · Apr 27 · 04:00

HACHIMI generates 1M student personas for educational LLMs using orchestrated agents

Researchers have developed HACHIMI, a novel multi-agent framework designed to generate scalable and controllable student personas for educational large language models. This system addresses limitations in prior methods…
TOOL · CL_47672 · Jan 12 · 00:00

Multi-node training enables scaling foundation models across GPU clusters

Training large foundation models necessitates distributing the workload across numerous GPUs housed in multiple interconnected machines, a process known as multi-node training. This approach is essential for handling mo…

New 'Chain-of-Models' method audits LLM bias

DeepSeek, GLM, and Qwen: Chinese LLMs Compared for Free API Use

New framework unifies context engineering and fine-tuning for MMEA

New benchmarks and methods tackle LLM agent tool-use failures

New speculative decoding methods boost LLM inference speed and safety

New DoubtProbe defense significantly reduces LLM jailbreaks

AgentHER framework boosts LLM agent training with failed trajectory relabeling

HACHIMI generates 1M student personas for educational LLMs using orchestrated agents

Multi-node training enables scaling foundation models across GPU clusters