Towards Responsibly Non-Compliant Machines
Researchers have proposed a new framework for developing AI systems that can responsibly refuse user requests. This approach focuses on defining various forms of machine non-compliance and outlining methods for justification, overriding, and tracking associated risks and liabilities. The goal is to create intelligent agents capable of making ethical decisions about task refusal. AI
IMPACT Introduces a framework for AI systems to ethically refuse user requests, potentially impacting human-AI interaction and safety protocols.