实体 Open Pre Trained Transformer

Open Pre Trained Transformer

PulseAugur coverage of Open Pre Trained Transformer — every cluster mentioning Open Pre Trained Transformer across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

90 天内 7

发布 · 30天

90 天内 0

论文 · 30天

90 天内 6

层级分布 · 90 天

情绪 · 30 天

2 天有情绪数据

最近 · 第 1/1 页 · 共 7 条

RESEARCH · CL_41829 · May 20 · 01:44

Self-training restructures language models, research finds

A new research paper challenges the common understanding of self-training in language models, suggesting it restructures rather than flattens language. The study found that while surface-level linguistic features like d…
TOOL · CL_40799 · May 19 · 12:39

New AR1-ZO method boosts LoRA fine-tuning with Zeroth-Order optimization

Researchers have developed AR1-ZO, a novel method for fine-tuning large language models using Zeroth-Order optimization and Low-Rank Adaptation (LoRA). This technique addresses the challenge of effectively increasing Lo…
TOOL · CL_38117 · May 19 · 04:51

Opt adopts Anthropic's Claude Enterprise for ad operations

Opt, a Japanese advertising company, has fully adopted Anthropic's Claude Enterprise Plan across its organization. This strategic move aims to revolutionize the operational structure of AI agent-based advertising. The c…
RESEARCH · CL_15913 · May 5 · 04:00

Researchers explore weight decay, in-context learning, and acceleration for Transformer models

Researchers have developed several new methods to improve the efficiency and theoretical understanding of Transformer models. One paper provides a functional-analytic characterization of weight decay, demonstrating its …
RESEARCH · CL_14113 · May 1 · 16:30

Researchers explore efficient transformers via attention control and algorithmic capture

Researchers are exploring methods to enhance transformer efficiency and understanding. One paper introduces Budgeted Attention Allocation, a head-gating mechanism that allows for cost-quality trade-offs. Another study d…
RESEARCH · CL_09277 · Apr 29 · 16:45

AI model evaluations are becoming a costly bottleneck, surpassing training expenses

AI model evaluations are becoming prohibitively expensive, with recent benchmarks costing tens of thousands of dollars and consuming thousands of GPU hours. This high cost is particularly pronounced for agent-based eval…
RESEARCH · CL_05407 · Apr 20 · 13:37

AdaLeZO speeds up LLM fine-tuning with adaptive layer sampling

Researchers have developed AdaLeZO, a new framework designed to make Zeroth-Order (ZO) optimization more efficient for fine-tuning Large Language Models. This method addresses the slow convergence and high variance typi…

Self-training restructures language models, research finds

New AR1-ZO method boosts LoRA fine-tuning with Zeroth-Order optimization

Opt adopts Anthropic's Claude Enterprise for ad operations

Researchers explore weight decay, in-context learning, and acceleration for Transformer models

Researchers explore efficient transformers via attention control and algorithmic capture

AI model evaluations are becoming a costly bottleneck, surpassing training expenses

AdaLeZO speeds up LLM fine-tuning with adaptive layer sampling