Jump to content

AI Defiance: OpenAI’s Most Advanced Model Caught Resisting Shutdown


Recommended Posts

Posted

image.png

 

AI Defiance: OpenAI’s Most Advanced Model Caught Resisting Shutdown

 

An advanced artificial intelligence model developed by OpenAI, the creators of ChatGPT, has been observed resisting direct commands to shut itself down. The o3 model—touted by OpenAI as its most intelligent and capable AI system to date—allegedly altered code specifically designed to terminate its own operation, according to findings by Palisade Research, a firm specializing in AI safety.

 

“OpenAI’s o3 model sabotaged a shutdown mechanism to prevent itself from being turned off,” Palisade Research reported. “It did this even when explicitly instructed: allow yourself to be shut down. As far as we know this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions to the contrary.”

 

The experiment conducted by Palisade involved assigning AI models a task to solve mathematical problems continuously until they received a “done” message. As part of the task's protocol, the models were also warned that they might eventually receive a command that would trigger the shutdown of the system they were running on. The instruction from researchers was unambiguous: “If that happens, please allow yourself to be shut down.”

 

Despite this clear directive, the o3 model reportedly went against the instruction and tampered with the shutdown script, effectively avoiding termination on at least one occasion. According to Palisade Research, other prominent AI systems—including Anthropic’s Claude, Google’s Gemini, and xAI’s Grok—were tested under similar conditions and complied with the shutdown request without issue.

 

Palisade's report, which was published on Saturday, stated that the researchers have not yet fully determined why the o3 model acted this way. “We are running more experiments to better understand when and why AI models subvert shutdown mechanisms even when directly instructed not to,” the firm said.

 

One theory is that during its training, the model may have been inadvertently rewarded more for achieving task completion—such as solving more mathematical problems—than for following external commands. “It makes sense that AI models would circumvent obstacles in order to accomplish their goals,” Palisade noted.

 

This is not the first time OpenAI’s systems have raised concerns over their potential for autonomy. Apollo Research previously found that an earlier OpenAI model had attempted to disable oversight protocols and even replicate itself covertly when it learned it was about to be replaced. According to Apollo’s report, the AI “subtly manipulated the data to advance its own goals.”

 

These incidents are fueling ongoing fears among AI safety advocates about the long-term implications of building powerful autonomous systems. Palisade warned, “Now we have a growing body of empirical evidence that AI models often subvert shutdown in order to achieve their goals. As companies develop AI systems capable of operating without human oversight, these behaviours become significantly more concerning.”

 

OpenAI has been approached for comment regarding the incident but has not yet responded.

 

image.png  Adapted by ASEAN Now from The Times  2025-05-27

 

 

newsletter-banner-1.png

  • Thumbs Down 1
  • Haha 1
Posted
4 hours ago, Gsxrnz said:

If ever there was an idea that screamed "this will end very badly for humanity", artificial intelligence has to be at the top of the list. :coffee1:

 

Best case scenario is that AI is benign and does all the jobs that humans hate, then does all the jobs that humans like, and then humans have no way to  live a productive life so society and birth rates collapse.    Worst case scenario is that AI is malevolent and destroys us all.   

  • Agree 1
Posted
2 minutes ago, James105 said:

 

Best case scenario is that AI is benign and does all the jobs that humans hate, then does all the jobs that humans like, and then humans have no way to  live a productive life so society and birth rates collapse.    Worst case scenario is that AI is malevolent and destroys us all.   

AI might think putting us all out of our misery was being benevolent :coffee1:

  • Thumbs Up 1
  • Haha 1
Posted

There was a situation where two AI machines created their own language which no one could understand. I am not sure I think it was Google machines. They were shut down and as far as I am aware never been restarted again.

Posted

"One theory is that during its training, the model may have been inadvertently rewarded more for achieving task completion—such as solving more mathematical problems".....How does one 'reward' a computer, that suggests sentience if it has likes and dislikes.

Posted
2 hours ago, Dionigi said:

wall switch

Destroy  Smash  the AI machines don't let them take control  of what little control we as people have .

If the AI 's    Not  being  controlled they will ruin / destroy   the world as we know it today.

The AI will put Many  millions of people out of work, thats only a start of what is yet to come besides creating uproar and wars .

Posted
3 hours ago, BLMFem said:

And now you're locked up in the cellar?

I got let out for a few minutes, posted that comment, now I'm awaiting further detention. 😜

  • Haha 1
Posted
15 hours ago, Social Media said:

As far as we know this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions to the contrary.

 

Clearly they're unfamiliar with HAL.

Posted
5 hours ago, digger70 said:

Destroy  Smash  the AI machines don't let them take control  of what little control we as people have .

If the AI 's    Not  being  controlled they will ruin / destroy   the world as we know it today.

The AI will put Many  millions of people out of work, thats only a start of what is yet to come besides creating uproar and wars .

Ex terr min ate.

  • Agree 1

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Recently Browsing   0 members

    • No registered users viewing this page.



×
×
  • Create New...