OpenAI buffs safety team and gives board veto power on risky AI

OpenAI is enhancing its safety measures to mitigate the risk of harmful AI by introducing a "safety advisory group" that will oversee technical teams and provide recommendations to the leadership. The board has also been given veto power. OpenAI's updated "Preparedness Framework" aims to identify, analyze, and decide on actions for "catastrophic" risks associated with the models they are developing. The framework includes a "safety systems" team for in-production models, a "preparedness" team for frontier models, and a "superalignment" team for theoretical guide rails for "superintelligent" models.

Each model is rated on four risk categories: cybersecurity, persuasion (disinformation), model autonomy, and CBRN (chemical, biological, radiological, and nuclear threats). If a model is evaluated as having a "high" risk after considering known mitigations, it cannot be deployed, and if a model has any "critical" risks, it will not be developed further. The new Safety Advisory Group will review reports and make recommendations, which will be sent to the board and leadership simultaneously. The leadership will decide whether to proceed with the model, but the board can reverse these decisions.

Key takeaways:

OpenAI is expanding its internal safety processes to mitigate the threat of harmful AI, including the creation of a "safety advisory group" that will make recommendations to leadership.
The company has updated its "Preparedness Framework" to identify, analyze, and decide what to do about "catastrophic" risks inherent to models they are developing.
Models are rated on four risk categories: cybersecurity, “persuasion” (e.g. disinfo), model autonomy (i.e. acting on its own), and CBRN (chemical, biological, radiological, and nuclear threats).
If a model is evaluated as having a “high” risk, it cannot be deployed, and if a model has any “critical” risks it will not be developed further. The board has been granted veto power over these decisions.

OpenAI buffs safety team and gives board veto power on risky AI | TechCrunch

Key takeaways:

Comments (0)

Newsletter