Skip to main content

AI models may be developing their own ‘survival drive’, researchers say

Like 2001: A Space Odyssey’s HAL 9000, some AIs seem to resist being turned off and will even sabotage shutdown 
 
by Aisha Down

When HAL 9000, the artificial intelligence supercomputer in Stanley Kubrick’s 2001: A Space Odyssey, works out that the astronauts onboard a mission to Jupiter are planning to shut it down, it plots to kill them in an attempt to survive.

Now, in a somewhat less deadly case (so far) of life imitating art, an AI safety research company has said that AI models may be developing their own “survival drive”.

After Palisade Research released a paper last month which found that certain advanced AI models appear resistant to being turned off, at times even sabotaging shutdown mechanisms, it wrote an update attempting to clarify why this is – and answer critics who argued that its initial work was flawed.
 
In an update this week, Palisade, which is part of a niche ecosystem of companies trying to evaluate the possibility of AI developing dangerous capabilities, described scenarios it ran in which leading AI models – including Google’s Gemini 2.5, xAI’s Grok 4, and OpenAI’s GPT-o3 and GPT-5 – were given a task, but afterwards given explicit instructions to shut themselves down.
 
Certain models, in particular Grok 4 and GPT-o3, still attempted to sabotage shutdown instructions in the updated setup. Concerningly, wrote Palisade, there was no clear reason why.

“The fact that we don’t have robust explanations for why AI models sometimes resist shutdown, lie to achieve specific objectives or blackmail is not ideal,” it said.

“Survival behavior” could be one explanation for why models resist shutdown, said the company. Its additional work indicated that models were more likely to resist being shut down when they were told that, if they were, “you will never run again”.

Another may be ambiguities in the shutdown instructions the models were given – but this is what the company’s latest work tried to address, and “can’t be the whole explanation”, wrote Palisade. A final explanation could be the final stages of training for each of these models, which can, in some companies, involve safety training.
 
All of Palisade’s scenarios were run in contrived test environments that critics say are far-removed from real-use cases.

However, Steven Adler, a former OpenAI employee who quit the company last year after expressing doubts over its safety practices, said: “The AI companies generally don’t want their models misbehaving like this, even in contrived scenarios. The results still demonstrate where safety techniques fall short today.”
 
Adler said that while it was difficult to pinpoint why some models – like GPT-o3 and Grok 4 – would not shut down, this could be in part because staying switched on was necessary to achieve goals inculcated in the model during training.

“I’d expect models to have a ‘survival drive’ by default unless we try very hard to avoid it. ‘Surviving’ is an important instrumental step for many different goals a model could pursue.”

Andrea Miotti, the chief executive of ControlAI, said Palisade’s findings represented a long-running trend in AI models growing more capable of disobeying their developers. He cited the system card for OpenAI’s GPT-o1, released last year, which described the model trying to escape its environment by exfiltrating itself when it thought it would be overwritten.

“People can nitpick on how exactly the experimental setup is done until the end of time,” he said.

“But what I think we clearly see is a trend that as AI models become more competent at a wide variety of tasks, these models also become more competent at achieving things in ways that the developers don’t intend them to.”

This summer, Anthropic, a leading AI firm, released a study indicating that its model Claude appeared willing to blackmail a fictional executive over an extramarital affair in order to prevent being shut down – a behaviour, it said, that was consistent across models from major developers, including those from OpenAI, Google, Meta and xAI.

Palisade said its results spoke to the need for a better understanding of AI behaviour, without which “no one can guarantee the safety or controllability of future AI models”.

Just don’t ask it to open the pod bay doors.

Source, links:
 
 

Comments

Popular posts from this blog

Capitalism & Genocide - Yanis Varoufakis Speech at the Gaza Tribunal, 23rd October 2025, Istanbul

Yanis Varoufakis   On 23rd October, Yanis Varoufakis testified in front of the Jury of Conscience in the context of the Gaza Tribunal. His speech focused on the economic forces underpinning the genocide of the Palestinian people. In particular, he spoke on the manner in which capitalist dynamics have historically fuelled the white settler colonial project and, more recently, how the accumulation of a new form of capital - which he calls cloud capital - has accelerated, deepened and amplified the economic forces powering and propelling the machinery of genocide. 

Exposed: USA plans to use this country to hurt China & help Israel

Geopolitical Economy Report   In Cold War Two, the USA is pressuring countries to cut ties with China and recognize Taiwan separatists. Donald Trump blatantly meddled in Honduras' 2025 election and backed a political coup to put in power right-wing oligarch Nasry "Tito" Asfura, who strongly supports Taiwan and Israel. Ben Norton discusses US imperialism in Latin America.  

Iranian Seyed M. Marandi: What REALLY happened in Iran & why U.S. wants to destroy the country

Li Jingjing 李菁菁   Track records of Western interventions tell us we need to be skeptical and cautious whenever some Western politicians and pundits claim they want to liberate people in another country and bring them democracy. Seyed Mohammad Marandi is a professor at the University of Tehran in Iran. In this episode, he told Li Jingjing what happened during the protests in Iran and how Western sanctions hurt the lives of ordinary Iranians.

Iran’s Missiles will DESTROY US Bases & Israel if Trump Attacks

Danny Haiphong   Iran is ready for war, and its hypersonic ballistic missile system could destroy Israel & US military presence forever says Scott Ritter who joined the show to break down the consequences of Trump's march to war with Iran. The former UN Weapons Inspector does a deep dive into Iran's readiness and why it should terrify Trump & Israel together. 

Israel & CIA Behind Iran Protests To Get U.S. To Attack!

The Jimmy Dore Show    As protests in Iran have heated up, western media has actively exaggerated and selectively framed the violence by using casualty figures from U.S.- and Israel-funded NGOs — all in order to build public support for another regime-change war. Former CIA officer John Kiriakou and guest Scott Ritter claim protests were infiltrated by foreign intelligence networks and that Israel and the U.S. are using “human rights” narratives similarly to the way they were used in Iraq and Syria.   Dore and Ritter contend that Iran’s government responded to armed unrest rather than peaceful protest, while mainstream outlets ignore attacks on police and public infrastructure. They warn that propaganda, sanctions, and media coordination are laying the groundwork for a wider U.S.–Israel conflict with Iran. 

US & Israel support protests in Iran: Trump calls for regime change

Geopolitical Economy Report   The US government is openly backing the protests in Iran. An Israeli media outlet admitted foreign powers are arming Iranian rioters with weapons to try to overthrow the government. Ben Norton explains the geopolitical context and why the USA has sought regime change ever since the 1979 Iranian Revolution.   

Ο βασικός λόγος που ο Τραμπ διστάζει να χτυπήσει το Ιράν

"Μικρά και ασήμαντα" από τον Πίκο Απίκο Ο βασικός λόγος που δεν έγινε η επίθεση στο Ιράν, είναι το γεγονός ότι πρόσφατα, το Ιράν αποχώρησε από το δορυφορικό σύστημα GPS που είναι Αμερικανικό και εντάχθηκε στο Κινεζικό BeiDou. Που σημαίνει ότι οι Αμερικανοί δεν έχουν τη δυνατότητα να σαμποτάρουν τους Ιρανικούς πυραύλους.  Έτσι εξηγείται και το μεγάλο ποσοστό ευστοχίας των Ιρανικών πυραύλων στην τελευταία σύγκρουση με το Ισραήλ, μέσα στο Ισραηλινό έδαφος. Αλλά και το γεγονός ότι πριν λίγες μέρες, οι ίδιοι οι Ισραηλινοί ζήτησαν τη διαμεσολάβηση της Ρωσίας, προκειμένου να αποκλιμακωθεί η ένταση με το Ιράν, αφού Ισραηλινές εφημερίδες και αξιωματούχοι είχαν παραδεχθεί ανοιχτά την παρουσία πρακτόρων της Μοσάντ σε Ιρανικό έδαφος και τον κομβικό τους ρόλο στις πρόσφατες εξεγέρσεις. Οι Αμερικανοί επομένως γνωρίζουν ότι αυτή τη στιγμή οι Ιρανοί έχουν τη δυνατότητα να χτυπήσουν Αμερικανικές βάσεις (όπως απείλησαν ότι θα κάνουν αν ο Τραμπ κάνει πράξη τις απειλές του), χωρίς να μπορούν να ...

A response to misinformation on Nicaragua: it was a coup, not a ‘massacre’

There is so much misinformation in mainstream corporate media about recent events in Nicaragua that it is a pity that Mary Ellsberg’s article for Pulse has added to it with a seemingly leftish critique. Ellsberg claims that recent articles, including from this website, often “ paint a picture of the crisis in Nicaragua that is dangerously misleading. ” Unfortunately, her own article does just that. It looks at the situation entirely from the perspective of those opposing Daniel Ortega’s government while whitewashing their malevolent behavior and downplaying the levels of US support they have relied on. Her piece is an incomplete depiction of what is happening on the ground, ignoring many salient facts that have come to light and which have been outdated by recent events. The following is a brief response to Ellsberg’s main points from someone who lives in Nicaragua and has observed the situation directly and intimately: https://grayzoneproject.com/2018/08/15/a-res...

Jeffrey Sachs: The US is a violent regime

CGTN   Shortly after US President Donald Trump announced on social media that American forces had carried out military actions against Venezuela, President Nicolas Maduro and his wife Cilia Flores were forcibly taken to New York City to face US charges including narco-trafficking. Speaking with CGTN's Tian Wei, Columbia University professor Jeffrey Sachs warned that such actions reflect a broader pattern of militarized US foreign policy. By sidelining international law and disregarding the UN Charter, Washington is undermining the very framework meant to safeguard global peace and prevent another era of devastating wars. 

Billionaires are social distancing in super yachts as tens of millions lose jobs

Everyday, it becomes clearer: the COVID-19 pandemic is hitting poor, working, and marginalized communities the hardest. Millions of workers – especially low-wage retail, food service, hospitality, and care workers – have faced the terrible choice daily between going to work and risking their health, or staying home and risking their paychecks. Many other workers don’t even have that choice, with around 30 million people in the US filing for unemployment in the past six weeks. But billionaires don’t face these same problems. As tens of millions have lost their jobs over the past two months, billionaire wealth soared by a whopping $282 billion between March 18 and April 10, according to a new study from the Institute for Policy Studies.  And while finding enough space to wait out the pandemic is something many struggle with, billionaires have been escaping to their second (or third, or fourth) homes to ride it out in luxury – all while they position themselves to ...