Avoiding disaster by mandating AI testing

Texas Deadly Floods: Who Is To Blame for the Devastation?

Home>
Contributors>

        Avoiding disaster by mandating AI testing
    

        Getty Images
    
By Kevin FrazierJun 12, 2023
Kevin Frazier
 See More Writings by author 
Kevin Frazier will join the Crump College of Law at St. Thomas University as an Assistant Professor starting this Fall. He currently is a clerk on the Montana Supreme Court. 

Bad weather rarely causes a plane to crash — but the low probability of such a crash isn’t because nature lacks the power to send a plane woefully off course. In fact, as recently as 2009, a thunderstorm caused a crash resulting in 228 deaths. 

Instead, two main factors explain why bad weather no longer poses an imminent threat to your longevity: first, we’ve improved our ability to detect storms. And, second and most importantly, we’ve acknowledged that the risks of flying through such storms isn’t worth it. The upshot is that when you don’t know where you’re going and if your plane can get you there, you should either stop or, if possible, postpone the trip until the path is in sight and the plane is flight worthy. 

The leaders of AI look a lot like pilots flying through a thunderstorm — they can’t see where they’re headed and they’re unsure of the adequacy of their planes. Before a crash, we need to steer AI development out of the storm and onto a course where everyone, including the general public, can safely and clearly track its progress. 

Despite everyone from Sam Altman, the CEO of OpenAI, to Rishi Sunak, the Prime Minister of the UK, acknowledging the existential risksposed by AI, some AI optimists are ignoring the warning lights and pushing for continued development. Take Reid Hoffman for example. Hoffman, the co-founder of LinkedIn, has been "engaged in an aggressive thought-leadership regimen to extol the virtues of A.I” in recent months in an attempt to push back against those raising redflags, according to The New York Times. 

Hoffman and others are engaging in AI both-side-ism, arguing that though AI development may cause some harm, it will also create societally beneficial outcomes.The problem is that such an approach doesn’t weigh the magnitude of those goods and evils. And, according to individuals as tech savvy as Prime Minister Sunak, those evils may be quite severe. In other words, the good and bad of AI is not an apples-to-apples comparison -- it’s more akin to an apples to obliterated oranges situation (the latter referring to the catastrophic outcomes AI may lead to). 

No one doubts that AI development in “clear skies” could bring about tremendous good.For instance, it’s delightful to think of a world in which AI replaces dangerous jobs and generates sufficient wealth to fund a universal basic income.The reality is that storm clouds have already gathered.The path to any sort of AI utopia is not only unclear but, more likely, unavailable. 

Rather than keep AI development in the air during such conditions, we need to issue a sort of ground stop and test how well different AI tools can navigate the chaotic political, cultural, and economic conditions that define the modern era. This isn’t a call for a moratorium on AI development -- that’s already been called for (and ignored). Rather, it’s a call for test flights. 

“Model evaluation” is the AI equivalent of such test flights. The good news is researchers such as Toby Shevlane and others have outlined specific ways for AI developers to use such evaluations to identify dangerous capabilities and measure the probability of AI tools to cause harm in application. Shevlane calls on AI developers to run these "test flights", to share their results with external researchers, and to have those results reviewed by an independent, external auditor to assess the safety of deploying an AI tool. 

Test flights allow a handful of risk-loving people to try potentially dangerous technology in a controlled setting. Consider that back in 2010 one of Boeing's test flights of its 787 Dreamliner resulted in an onboard fire. Only after detecting and fixing such glitches did the plane become available for commercial use. 

There’s a reason we only get on planes that have been tested and that have a fixed destination. We need to mandate test flights for AI development. We also need to determine where we expect AI to take us as a society. AI leaders may claim that it's on Congress to require such testing and planning, but the reality is that those leaders could and should self-impose such requirements. 

The Wright Brothers did not force members of the public to test their planes — nor should AI developers.
Latest news
Ethics & Leadership
        Shifting the Spotlight: Trump’s Epstein Strategy Echoes His 2016 Playbook
    
David L. Nevins
7m
Democracy
        Imagining The Path Forward for The Healthy Democracy Ecosystem
    
Kristina Becvar
8h
Governance & Legislation
        Prescribing Produce, Powering Markets: How D.C. Is Rethinking Food Access As Health Policy
    
Bennett Gillespie
8h
Media & Technology
        AI Progress Delayed Is Progress Denied
    
Kevin Frazier
9h

The Top 5

Discover More

Kevin Frazier will join the Crump College of Law at St. Thomas University as an Assistant Professor starting this Fall. He currently is a clerk on the Montana Supreme Court.

Bad weather rarely causes a plane to crash — but the low probability of such a crash isn’t because nature lacks the power to send a plane woefully off course. In fact, as recently as 2009, a thunderstorm caused a crash resulting in 228 deaths.

Instead, two main factors explain why bad weather no longer poses an imminent threat to your longevity: first, we’ve improved our ability to detect storms. And, second and most importantly, we’ve acknowledged that the risks of flying through such storms isn’t worth it. The upshot is that when you don’t know where you’re going and if your plane can get you there, you should either stop or, if possible, postpone the trip until the path is in sight and the plane is flight worthy.

The leaders of AI look a lot like pilots flying through a thunderstorm — they can’t see where they’re headed and they’re unsure of the adequacy of their planes. Before a crash, we need to steer AI development out of the storm and onto a course where everyone, including the general public, can safely and clearly track its progress.

Despite everyone from Sam Altman, the CEO of OpenAI, to Rishi Sunak, the Prime Minister of the UK, acknowledging the existential risksposed by AI, some AI optimists are ignoring the warning lights and pushing for continued development. Take Reid Hoffman for example. Hoffman, the co-founder of LinkedIn, has been "engaged in an aggressive thought-leadership regimen to extol the virtues of A.I” in recent months in an attempt to push back against those raising redflags, according to The New York Times.

Hoffman and others are engaging in AI both-side-ism, arguing that though AI development may cause some harm, it will also create societally beneficial outcomes.The problem is that such an approach doesn’t weigh the magnitude of those goods and evils. And, according to individuals as tech savvy as Prime Minister Sunak, those evils may be quite severe. In other words, the good and bad of AI is not an apples-to-apples comparison -- it’s more akin to an apples to obliterated oranges situation (the latter referring to the catastrophic outcomes AI may lead to).

No one doubts that AI development in “clear skies” could bring about tremendous good.For instance, it’s delightful to think of a world in which AI replaces dangerous jobs and generates sufficient wealth to fund a universal basic income.The reality is that storm clouds have already gathered.The path to any sort of AI utopia is not only unclear but, more likely, unavailable.

Rather than keep AI development in the air during such conditions, we need to issue a sort of ground stop and test how well different AI tools can navigate the chaotic political, cultural, and economic conditions that define the modern era. This isn’t a call for a moratorium on AI development -- that’s already been called for (and ignored). Rather, it’s a call for test flights.

“Model evaluation” is the AI equivalent of such test flights. The good news is researchers such as Toby Shevlane and others have outlined specific ways for AI developers to use such evaluations to identify dangerous capabilities and measure the probability of AI tools to cause harm in application. Shevlane calls on AI developers to run these "test flights", to share their results with external researchers, and to have those results reviewed by an independent, external auditor to assess the safety of deploying an AI tool.

Test flights allow a handful of risk-loving people to try potentially dangerous technology in a controlled setting. Consider that back in 2010 one of Boeing's test flights of its 787 Dreamliner resulted in an onboard fire. Only after detecting and fixing such glitches did the plane become available for commercial use.

There’s a reason we only get on planes that have been tested and that have a fixed destination. We need to mandate test flights for AI development. We also need to determine where we expect AI to take us as a society. AI leaders may claim that it's on Congress to require such testing and planning, but the reality is that those leaders could and should self-impose such requirements.

The Wright Brothers did not force members of the public to test their planes — nor should AI developers.

From Your Site Articles

Read More
Empty Hands Music Founder Nimo Patel’s new music video, "Takin' My Time," reminds us that taking time for yourself allows us to heal and thrive. 

        Getty Images, pocketlight
    

        Musician Nimo Patel Reminds Us To Take Our Time
    
David L. Nevins
Jul 18, 2025
So far in 2025, we honored and celebrated culture as a bridge to the latest news and analysis of politics, policy, and the birth of a new civic and political voice to build greater social cohesion, civic engagement, and problem-solving.
We hope you have taken the journey with us as we shared stories, music, poetry, and dance to inspire our better angels as part of our continuing coverage of the problems and solutions of our times.
Keep ReadingShow less
Recommended
Rule of Law
        The Legal Costs and Risks of Trump’s 328 Lawsuits
    
Steve Corbin
21 May
Governance & Legislation
        Just the Facts: Are FAA Staff Cuts Causing Current Airport Delays?
    
Kristina Becvar
07 May
Governance & Legislation
        Trump’s Big Beautiful Bill: Hidden Cuts, Legislative Tricks, and Who Really Pays the Price
    
Steven Hill
12 July
Media & Technology
        They’re calling her an influencer. She’s calling it campaign strategy.
    
Jessica  Kutz, The 19th
14 July
Governance & Legislation
        As Puerto Rico’s Power Grid Crumbles, Rural Medical Patients Are Turning to Rooftop Solar
    
Lily  Carey
13 July
Governance & Legislation
        As Puerto Rico’s Power Grid Crumbles, Rural Medical Patients Are Turning to Rooftop Solar
    
Lily  Carey
12 July
As Congress considers slashing nearly a decade's worth of international assistance, the ripple effects could extend far beyond Washington's balance sheets

        Bill Track 50
    

        IssueVoter Bill of the Month (July 2025): The Global Stakes of America’s $9 Billion Budget Cut
    
Stephen Rogers
Jul 18, 2025
The Rescissions Act of 2025 was finally passed on July 18 and its implications will reverberate across continents. This $9 billion budget cut represents far more than fiscal housekeeping—it signals a fundamental retreat from America's role as the world's primary humanitarian superpower.
The bill represents a significant fiscal policy initiative that seeks to permanently cancel previously allocated but unspent federal budget authority - known as 'rescissions'. Introduced in the House on June 6, 2025, by Representative Steve Scalise and five Republican co-sponsors, this legislation implements budget rescissions proposed by President Trump on June 3, 2025, under the Congressional Budget and Impoundment Control Act of 1974. The cuts essentially codify actions taken by the Department of Government Efficiency (DOGE) over recent months - which has been criticized for appropriating congressional authority over budgetary matters by halting spending previously approved by Congress. 
Keep ReadingShow less
There are over 1000 NPR Member Station signals broadcasting across the United States
NPR

        There’s nothing “meh” about dismantling public media
    
Deanna Troust
Jul 17, 2025
This morning we woke to our local NPR affiliate, WAMU, reporting a story about how the public media network it belongs to is on the brink of losing funding, per a party-line vote in the U.S. Senate last night.  
The public media portion of the claw-back is 1.1 billion – the amount Congress previously approved to fund the Corporation for Public Broadcasting, which distributes funds to NPR, PBS and over 1500 local radio and TV stations that serve communities around the U.S. The deadline for the House to seal the deal is tomorrow – July 18. 
Keep ReadingShow less

        person with open palms below USA flaglet
    

        Photo by Samuel Schneider on Unsplash

        Following Jefferson: Promoting Inter-Generational Understanding Through Constitution-Making
    
Beau Breslin
Prairie Gunnels
Jul 17, 2025
Part III: InstitutionsIt may come as a surprise in these polarized times to learn that most Americans – on both the left and the right – agree that our system of separation of powers is laudable. Celebrated, in fact. We may distrust our political officials; we may even express deep cynicism about the entire political circus in Washington. However, we still admire the institutional structure that James Madison and his colleagues put in place when drafting the Constitution. We appreciate that there are three roughly – or supposedly – coequal branches. And we like that checks and balances are built into the country’s constitutional design.
There are several reasons for this mostly positive attitude, including a genuine sense of pride in a scheme that, for the most part, is still working. But perhaps our admiration for the country’s system of separation of powers, checks and balances, and federalism comes not from a wellspring of pride but from the simple fact that Americans have no exposure to other governing models. Most of us haven’t lived under a parliamentary system, or an oligarchy, or a monarchy—well, at least in the last 249 years. Most of us haven’t even experienced a unicameral legislature. 
Keep ReadingShow less
Load More

Site Navigation

MAGA Tension Over Why Hasn’t Trump Released the Epstein Files

The Musk Gambit: Can a Feuding Billionaire Fix American Democracy?

Supreme Court Greenlights Project 2025 Plan to Dismantle Education Department

Texas Deadly Floods: Who Is To Blame for the Devastation?

Fighting Words: On The Autocratic Capture of Education

Avoiding disaster by mandating AI testing

Latest news

Shifting the Spotlight: Trump’s Epstein Strategy Echoes His 2016 Playbook

Imagining The Path Forward for The Healthy Democracy Ecosystem

Prescribing Produce, Powering Markets: How D.C. Is Rethinking Food Access As Health Policy

AI Progress Delayed Is Progress Denied

The Top 5

Trump’s State Department Overhaul: Project 2025’s Influence on U.S. Diplomacy

Just the Facts: Canadian Tariffs

Just the Facts: Impact of the Big Beautiful Bill

Redistricting

Balance – The Golden Mean

Discover More

Read More

Musician Nimo Patel Reminds Us To Take Our Time

Recommended

The Legal Costs and Risks of Trump’s 328 Lawsuits

Just the Facts: Are FAA Staff Cuts Causing Current Airport Delays?

As Puerto Rico’s Power Grid Crumbles, Rural Medical Patients Are Turning to Rooftop Solar

As Puerto Rico’s Power Grid Crumbles, Rural Medical Patients Are Turning to Rooftop Solar

IssueVoter Bill of the Month (July 2025): The Global Stakes of America’s $9 Billion Budget Cut

When Aid Dollars Disappear: The Human Cost of Budget Politics

Refugee Assistance in a World of Displacement

The Peacekeeping Paradox: Security Through Withdrawal

Domestic Ripple Effects: Public Broadcasting and Cultural Diplomacy

The Journey to Rescission

Looking Ahead: The Intersection of Fiscal and Foreign Policy

There’s nothing “meh” about dismantling public media

Following Jefferson: Promoting Inter-Generational Understanding Through Constitution-Making

Part III: Institutions

SUGGESTION:

Part I: Introduction

Part II: Preambles

Democracy in Action: May Retrospective

Latest Stories

Start your day right!

Avoiding disaster by mandating AI testing

Latest news

Read More

Recommended

When Aid Dollars Disappear: The Human Cost of Budget Politics

Refugee Assistance in a World of Displacement

The Peacekeeping Paradox: Security Through Withdrawal

Domestic Ripple Effects: Public Broadcasting and Cultural Diplomacy

The Journey to Rescission

Looking Ahead: The Intersection of Fiscal and Foreign Policy

Part III: Institutions

SUGGESTION:

Democracy in Action: May Retrospective