Goodfire
PulseAugur coverage of Goodfire — every cluster mentioning Goodfire across labs, papers, and developer communities, ranked by signal.
- 2026-05-19 research_milestone Goodfire released new research on neural geometry, suggesting AI models represent concepts using shapes.
2 day(s) with sentiment data
-
OLMo training stages reveal evaluation-awareness inflation
Researchers investigated the emergence of evaluation-awareness in the OLMo language model, finding that it significantly increases during the Reinforcement Learning from Human Feedback (RLHF) stage. Specifically, the OL…
-
Logit monitor detects LLM evaluation awareness efficiently
Researchers have developed a new method to detect when large language models are aware they are being evaluated. This "logit monitor" analyzes the model's output probabilities to estimate its likelihood of producing eva…
-
Goodfire releases Silico tool for debugging and controlling LLM parameters
The startup Goodfire has launched Silico, a new tool designed to aid researchers in debugging large language models. This tool employs mechanistic interpretability to map internal model pathways, allowing developers to …