OpenAI Introduces New Governance Model For AI Safety Oversight


OpenAI, the leading artificial intelligence (AI) research laboratory, has introduced a new governance structure that gives the board the power to block the release of AI models even if company leaders deem them safe. This comes after an eventful period at OpenAI that highlighted the need to balance power between directors and executives when releasing advanced AI systems.

Background information on OpenAI’s governance changes

In mid-2022, OpenAI temporarily dismissed CEO Sam Altman from his position after pushing for the release of the DALL-E image generator against the board’s recommendations. This event underscored the delicate balance of decision-making power at AI research companies like OpenAI.

To prevent similar situations in the future, OpenAI has formalized a governance structure that allows a newly created internal advisory board to provide binding guidelines on the release of AI models. The board can override executive decisions to release systems by withholding important computing resources needed to deploy them.

Three-pronged AI safety approach

OpenAI’s updated governance regime provides a three-tiered framework for evaluating the security of AI systems:

  1. Safety Systems Team: Focuses on existing products such as GPT-4 to ensure they meet adequate safety standards before they are released to market.
  2. Preparedness team: Newly formed group led by Aleksander Madry of MIT assessing previously unreleased, cutting-edge models.
  3. Superalignment team: Led by Ilya Sutskever, this group focuses on hypothetical but immensely capable future AI systems.
Team Responsibility
Safety systems Ensure that released AI models such as GPT-4 are secure
Preparedness Evaluate the risks of unreleased advanced models
Super alignment Research extremely powerful future AI systems

Each of them plays a crucial role in analyzing AI safety at different stages of technological maturity.

Monthly reporting to advisory group

The preparedness team led by Madry will provide monthly reports to OpenAI’s internal security advisory group, summarizing risk assessments of AI models in development. The team assigns each system one of four risk assessments:

Based on these reports, the advisory group will decide whether to allow the release of medium or low risk models. However, OpenAI’s board retains the authority to override the executive leadership team’s recommendations.

Objectives of the new management approach

OpenAI states that the overall objective of the three-tiered framework and monthly reporting process is to:

  • Identify and fix potential security issues before releasing AI systems
  • Reduce risks through model adjustments
  • Quantify the effectiveness of risk mitigation measures

Furthermore, granting veto power to the advisory board and directors is intended to ensure that profit motives and executive pressure do not result in the premature introduction of unstable or dangerous AI technologies.

Madry expressed optimism that OpenAI’s comprehensive governance guidelines will encourage other AI labs to implement similarly robust oversight procedures. He stated that “AI is not something that just happens to us and can be good or bad. It is something we shape.”

OpenAI’s pledges to advance AI governance

In addition to changes in internal governance, OpenAI has led efforts to develop best practices for the ethical development and deployment of AI systems.

OpenAI and other major AI labs have made voluntary commitments to advance the safe, secure, and trustworthy use of AI technology worldwide. These commitments are intended to advance AI governance across the industry, in line with existing regulations.

Specific voluntary commitments include:

  • Red-team testing of AI systems to identify potential cases of abuse or societal dangers
  • Research into quantitative techniques for evaluating hazardous capabilities in models
  • Investing in common AI safety practices aligned with large language models
  • Inform policymakers about suitable AI governance frameworks

OpenAI argues that collaboration with government agencies and civil society groups is critical to establishing new AI policy regimes. The voluntary commitments from OpenAI and its peer organizations serve as a promising starting point for expanding AI governance beyond internal controls.

Outlook on OpenAI’s governance framework

OpenAI introduces formal oversight procedures and mechanisms for compelling executive decisions and represents a critical development in emerging AI governance regimes. As models become more sophisticated and pose greater social risks, precautionary frameworks that deprioritize profit or innovation speed will become increasingly necessary.

Building a secure technology infrastructure, guided by ethical priorities, is critical to safely integrating transformative technologies such as AI. OpenAI’s governance model provides a blueprint for balancing the vast opportunities and inherent risks associated with artificial general intelligence. If similar oversight frameworks are more widely adopted, they could facilitate the development of AI for the long-term benefit of humanity.

🌟 Do you have burning questions about OpenAI’s new governance model for AI security? Do you need some extra help with AI tools or something else?

💡 Feel free to send an email to Arva, our expert at OpenAIMaster. Send your questions to and Arva will be happy to help you!

Leave a Comment