EleutherAI ablates activation functions in GPT-like models

By PulseAugur Editorial · [1 sources] · 2021-05-24 20:00

Researchers at EleutherAI conducted an experiment to study the impact of different activation functions on GPT-like language models with approximately 100 million parameters. The models were trained for a limited duration of 10,000 iterations. While the initial goal was to demonstrate that activation functions have minimal impact, the experiment was not extensive enough to provide statistically significant conclusions, and the results are being shared publicly for potential use by others. AI

RANK_REASON This is a research paper detailing an ablation study on activation functions in language models.

Read on EleutherAI Blog →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

EleutherAI ablates activation functions in GPT-like models

COVERAGE [1]

EleutherAI Blog TIER_1 English(EN) · 2021-05-24 20:00

Activation Function Ablation

An ablation of activation functions in GPT-like autoregressive language models.

COVERAGE [1]

Activation Function Ablation

RELATED TOPICS