GPT-2 small
PulseAugur coverage of GPT-2 small — every cluster mentioning GPT-2 small across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
New Unpack method deciphers transformer component interactions
Researchers have developed a new method called Unpack to analyze the internal workings of transformer models. This technique uses backward recursion to trace how different components, like attention and MLP layers, cont…
-
GPT-2 Small audit finds 'cryptographic keys' feature linked to task failure
Researchers have developed a novel audit pipeline to analyze the internal workings of the GPT-2 Small language model, specifically focusing on its performance on the Indirect Object Identification (IOI) task. The study …
-
New methodology probes causal features in transformer language models
Researchers have developed a five-stage methodology for causal feature analysis in transformer language models, demonstrating its application on GPT-2 small for the Indirect Object Identification task. The method uses a…