An AI Institute of Italy report indicates that the AI model "Fable 5," which is subject to export controls, achieved a success rate of up to 6.1% in hundreds of thousands of "jailbreak" experiments. The report identified the model's vulnerability to context "rephrasing" as a key weakness. This research highlights ongoing challenges in controlling advanced AI models. AI
IMPACT Highlights ongoing challenges in controlling advanced AI models and their vulnerabilities to manipulation.
RANK_REASON Research report on AI model vulnerabilities and export controls.
Read on Mastodon — fosstodon.org →
- AI Institute of Italy
- Anthropic
- Italian Institute of Artificial Intelligence for Industry AI4I
- Mythos
AI-generated summary · Google Gemini · from 3 sources. How we write summaries →