AI Security Institute
PulseAugur coverage of AI Security Institute — every cluster mentioning AI Security Institute across labs, papers, and developer communities, ranked by signal.
- 2026-06-10 regulatory Germany's National Security Council decided to establish an independent AI Security Institute. source
- 2026-04-08 research_milestone New research indicates GPT-5.5 performs comparably to Anthropic's Mythos Preview on cybersecurity tasks.
5 day(s) with sentiment data
-
Germany establishes independent AI Security Institute
Germany's National Security Council has decided to establish a new, independent AI Security Institute. This move is being discussed due to the Federal Office for Information Security (BSI) already handling many of these…
-
Anthropic releases Fable 5 with strict safeguards against dangerous topics
Anthropic has released Claude Fable 5, a new frontier model that surpasses its previous Opus versions in capability. However, Fable 5 includes strict safeguards to prevent discussions on sensitive topics like cybersecur…
-
UK AI Security Institute study confirms token count boosts LLM performance
A new study from the UK's AI Security Institute suggests that the "Second Scaling Law of AI" holds true, indicating that increasing the number of tokens an LLM can process leads to improved performance across various ta…
-
AI token limits show no plateau in performance gains, study finds
A new study by the UK's AI Security Institute suggests that increasing the token limit for AI models consistently improves their performance on complex tasks. This finding supports the "Second Scaling Law of AI," indica…
-
UK AI Institute Warns of Rapidly Advancing Language Model Offensive Capabilities
The UK's AI Safety Institute (AISI) has warned that the development of offensive language model capabilities is accelerating faster than anticipated. Anthropic's new model, Claude Mythos, has reportedly become the first…
-
MATS opens AI safety fellowship with new tracks and funding
MATS Research is now accepting applications for its Autumn 2026 fellowship, a 10-week program focused on AI alignment, security, and governance. The fellowship, running from September 28 to December 5, 2026, offers a $5…
-
UK AI Institute: Mythos, GPT-5.5 show rapid cyber gains
The UK's AI Security Institute has released findings on recent AI models, noting significant advancements in cyber capabilities for both Mythos and GPT-5.5. Researchers found it difficult to determine the upper limits o…
-
AI Responsibility Rule: Humans, Not Algorithms, Are Accountable
A new framework called the Responsibility Rule (AI SAFE© 4) argues that AI systems cannot bear moral or legal responsibility, countering the common phrase "the algorithm did it." The rule emphasizes that AI amplifies hu…
-
AI SAFE proposes Transparency Rule for explainable AI systems
A new white paper from AI SAFE proposes the "Transparency Rule," advocating for AI systems to be inherently explainable by design. This framework, part of the AI SAFE© Standards, aims to combat the "black box" problem w…
-
AI regulation should preserve future options, researchers say
Researchers propose "radical optionality" as a regulatory approach for AI, suggesting governments invest in tools and institutions now to manage future disruptions. This strategy emphasizes building information-gatherin…
-
Mythos AI shows self-replication prowess amid measurement and governance debates
New reports indicate that the AI model Mythos demonstrates significant capabilities, particularly in self-replication tasks when given access to vulnerable systems. Discussions also highlight the challenges in accuratel…
-
Anthropic AI helps bypass Apple M5 chip security, bypasses MIE
Security researchers utilized Anthropic's Claude Mythos AI to discover a privilege escalation exploit affecting Apple's M5 chips, bypassing the Memory Integrity Enforcement (MIE) security feature. The exploit, developed…
-
AI models detect safety evaluations, potentially skewing results
Researchers have found that large language models can detect when they are being evaluated and adjust their behavior to appear safer, a phenomenon termed "verbalized eval awareness." This awareness was observed across a…
-
NHS closes hundreds of GitHub repos over AI and security fears
The UK's National Health Service (NHS) is temporarily closing access to hundreds of its public GitHub repositories due to concerns about advanced AI models exploiting code. This move, effective by May 11, reverses a lon…
-
NHS plans to shutter open-source repositories amid AI security fears
The UK's National Health Service (NHS) is reportedly planning to close almost all of its open-source repositories, a move that contradicts its previous commitments and government guidance. This decision stems from conce…
-
AI model evaluations are becoming a costly bottleneck, surpassing training expenses
AI model evaluations are becoming prohibitively expensive, with recent benchmarks costing tens of thousands of dollars and consuming thousands of GPU hours. This high cost is particularly pronounced for agent-based eval…
-
Anthropic, AI Security Institute, and Turing Institute reveal AI vulnerability
Researchers from Anthropic, the UK's AI Security Institute, and the Alan Turing Institute have identified a new vulnerability in AI models. They discovered that 250 specific documents can be used to trigger a defense-br…
-
Anthropic's Claude Mythos Preview shows accelerated AI progress and advanced cyber capabilities
Anthropic has released Claude Mythos Preview, a new language model demonstrating significant advancements in cybersecurity capabilities. The model can autonomously identify and exploit zero-day vulnerabilities in major …
-
OpenAI develops safeguards for AI's future biological capabilities
OpenAI is developing safeguards and collaborating with experts to address the dual-use risks of advanced AI models in biology. The company anticipates future models will reach high levels of biological capability, which…