EleutherAI releases Pile-T5, an improved T5 model trained on The Pile with LLaMA tokenizer

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

EleutherAI has released Pile-T5, an updated version of the T5 language model. This new iteration was trained on the Pile dataset and utilizes the LLaMA tokenizer, addressing weaknesses in the original T5's handling of code and its pretraining data. Pile-T5 was trained for twice as many tokens as the original T5 and demonstrates significant performance improvements, particularly on code-related tasks, outperforming widely used T5 models even when matched for token count. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

RANK_REASON Release of a new, improved version of an existing open-source model by a known research group.

Read on EleutherAI Blog →

model release
paper
other

EleutherAI releases Pile-T5, an improved T5 model trained on The Pile with LLaMA tokenizer

COVERAGE [1]

EleutherAI Blog TIER_1 · 2024-04-14 17:00

Pile-T5

Trained T5 on the Pile

COVERAGE [1]

Pile-T5

RELATED TOPICS