A shared playbook for trustworthy third party evaluations
COVERAGE [1]
-
A shared playbook for trustworthy third party evaluations
OpenAI shares guidance on third-party AI evaluations, covering how to assess model capabilities, safeguards, and validity for frontier systems.