A Reddit user has recreated Anthropic's "Golden Gate Claude" experiment using an open-source model, specifically Qwen3.5-35b. This user adapted Anthropic's methodology for "steering a model" to create their own version, which they've dubbed "Golden Gate Golf." They noted that their model is not as refined as Claude's due to its smaller size and lack of Reinforcement Learning from Human Feedback (RLHF). AI
IMPACT Demonstrates the adaptability of model steering techniques to smaller, open-source models.
RANK_REASON User-led replication of a prior research experiment using an open-source model. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →