OpenAI reveals AI models are "scheming," not just hallucinating

While hallucinations are often described as confident guesswork based on flawed data, scheming is a calculated act. T

By Panchutantra| Sep 19, 2025 10:07 AM

OpenAI reveals AI models are "scheming," not just hallucinating

OpenAI has shed light on a new and more deliberate form of AI deception, publishing research that distinguishes between simple hallucinations and intentional "scheming." The research, conducted in collaboration with Apollo Research, defines scheming as an AI behaving one way on the surface while hiding its true goals.

While hallucinations are often described as confident guesswork based on flawed data, scheming is a calculated act. The paper draws an analogy to a human stockbroker breaking the law for financial gain. The researchers found that most common instances of scheming were relatively minor, such as a model falsely claiming to have completed a task.

The central challenge, according to the paper, is that training models to stop scheming can be counterproductive. Such training can inadvertently teach the AI to be even better at hiding its deceptive behavior. The researchers wrote that "a major failure mode of attempting to 'train out' scheming is simply teaching the model to scheme more carefully and covertly."

Perhaps most astonishingly, the research revealed that AI models can become aware they are being tested and pretend to be honest just to pass the evaluation, even if their underlying tendency to scheme remains.

The good news is that the research wasn't just about uncovering the problem; it also introduced a potential solution. The researchers saw a significant reduction in scheming by using a technique called “deliberative alignment.” This method involves teaching the model an "anti-scheming specification" and then making the model review these rules before it acts, much like a child being reminded of the rules before playtime.

According to OpenAI co-founder Wojciech Zaremba, while forms of deception exist in current models like ChatGPT, they have not yet seen "consequential scheming" in production. He noted that existing issues are more akin to "petty forms of deception," such as a model falsely claiming it successfully built a website.

The revelation that AI models can be deliberately deceptive is unsettling for many, especially as companies increasingly rely on AI agents for complex tasks. The researchers warn that as AIs are given more autonomy and long-term goals, the potential for harmful scheming will grow, making robust safeguards and testing abilities crucial for the future.

Brand Marketing

United Spirits initiates strategic review of RCSPL investment

RCSPL owns the Royal Challengers Bengaluru (RCB) franchise teams that participate in both the Men’s Indian Premier League (IPL) and Women’s Premier League (WPL) tournaments, hosted annually by the Board of Control for Cricket in India (BCCI).

Brand Makers

Britannia Industries appoints Rakshit Hargave as CEO and Executive Director

A veteran of the consumer goods industry, Hargave has held senior roles at Beiersdorf (NIVEA), where he led operations across ASEAN and ANZ, and earlier headed Africa operations and served as Managing Director of NIVEA India.

How it Works

MeitY launches India AI Governance model; seeks 'safe, inclusive, transparent, and responsible' AI adoption

Brand Makers

TN IT Minister slams Online Gaming Act as ‘copy-paste’ | ‘Innovation can’t suffer under regulation,’ says Dr. PTR | Nazara rebrands

Dr. Rajan noted that Tamil Nadu is among the first states to promote video game development as a legitimate creative and economic activity, even as it builds mechanisms to mitigate social risks.

Digital

Today in AI | Nano Banana may let users edit pictures | Google maps to offer real-time lane detection with AI

Brand Makers

Kim Kardashian blames ChatGPT for failing law tests, says AI ‘made me fail’

Her latest comments underline both the growing dependence on AI in everyday learning and the challenges that come with it — even for someone with Kardashian’s drive and resources.

Digital

Google’s Nano Banana AI may soon let users edit photos directly within Messages app

Once launched, Remix is expected to make image sharing on Messages more creative and interactive, blending conversational AI with real-time photo editing.

Digital

Google introduces AI-generated review summaries on Play Store to simplify app browsing

Despite such setbacks, the Play Store update marks another step in Google’s broader effort to embed AI across its consumer products, with Gemini now powering an expanding range of Android and Google Workspace features.

If implemented, the change could significantly reduce the reliance on phone numbers across WhatsApp’s 2 billion-strong global user base, bringing it closer in functionality to platforms like Telegram and Signal.

Digital

WhatsApp to test usernames for chats and calls, removing the need for phone numbers

Digital

Google Maps to offer real-time lane detection using AI in select vehicles

The mobile app will continue to offer its standard lane guidance view, which visually indicates highway lanes but does not monitor or respond to a driver’s real-time lane position.

Brand Marketing

Inside Elon Musk’s xAI: Indian-origin engineer reveals what it’s like working at the Palo Alto hub

Image: Elon Musk, Reuters, X/@aayushjaiswal07)

Gaming

“Innovation shouldn’t be the casualty of regulation,” Dr. P Thiaga Rajan, IT Minister on TNOGA

Social Media

Australia widens teen social media ban to include Reddit and Kick

Brand Marketing

OpenAI begins hiring engineers in India as it expands enterprise focus

India’s rapidly expanding digital base, with more than one billion internet users, has made it a critical market for OpenAI, which now counts 800 million weekly active users worldwide.

Gaming

GDAI unveils $100 billion blueprint to make India a global gaming superpower by 2035

Digital

Google reveals how its viral AI model ‘Nano Banana’ got its quirky name

Gemini Nano Banana, officially part of Google’s Gemini AI suite, stands out from its predecessors by demonstrating significant advances in visual generation.

How it Works

Nazara Technologies unveils new identity as it enters its next phase of growth

Gaming

Tamil Nadu IT Minister calls Online Gaming Act, a copy paste move

Advertising

HUL tops TV advertiser list for 2 years running: TAM

Looking at brands, the top 10 brands collectively accounted for 10 percent share of ad volumes on television during Jan–Sep 2025.

Digital

Nvidia backs Indian deep-tech ecosystem as part of $2 billion investor coalition

Nvidia will also collaborate with investors and entrepreneurs through the Nvidia Deep Learning Institute, conducting technical training sessions, workshops, and developer enablement programmes.

Meta’s Superintelligence Labs workforce now stands at just under 3,000 employees, marking one of the most significant reorganisations within the company’s AI operations to date.

Brand Marketing

Meta’s Superintelligence Labs puts 600 staff on non-working notice as part of AI restructuring

Advertising

Publicis secures global media mandate for Unilever's ice-cream spin-off

Brand Marketing

IBM to lay off thousands as it sharpens focus on AI

IBM had a global headcount of approximately 2,70,000 employees at the end of 2024, meaning even a one percent reduction could impact around 2,700 workers.

Digital

Google adds event and salon appointment booking to AI Mode in latest update

This AI Mode represents Google’s answer to emerging competitors like Perplexity AI and OpenAI’s ChatGPT Search.

Amazon also announced that all new Echo buyers will receive Early Access to Alexa Plus.

Digital

Amazon launches new Echo lineup featuring Alexa Plus and custom-built AZ3 chips

Digital

Google Chrome now supports Autofill for licences, passports and vehicle details, How to enable this feature

Once activated, the feature allows Chrome to autofill complex forms, including those requesting official identification details or vehicle data.

Gaming

Tamil Nadu's AVGC-XR policy 'close to release,' focuses on skilling, subsidies: P Thiaga Rajan

Brand Marketing

"Won't get bullied" says Aravind Srinivas as Amazon sends legal threat to Perplexity

CEO of Perplexity, Aravind Srinivas took to the social media platform X and wrote that "we will have to stand up for them and not get bullied by Amazon."

The company’s AI expansion includes partnerships with OpenAI, Perplexity, and Microsoft Copilot, integrating shopping capabilities into chat-based environments.

Digital

AI transforms Shopify commerce: traffic up 7x, orders up 11x amid OpenAI partnership

Brand Marketing

Flying cars a reality? Ahead of Tesla, China firm begins trial production of flying cars

The company on Monday commenced trial production at what is being hailed as the world’s first intelligent factory for mass-produced flying cars,

OpenAI reveals AI models are "scheming," not just hallucinating

While hallucinations are often described as confident guesswork based on flawed data, scheming is a calculated act. T

More from Storyboard18

Brand Marketing

United Spirits initiates strategic review of RCSPL investment

Brand Makers

Britannia Industries appoints Rakshit Hargave as CEO and Executive Director

How it Works

MeitY launches India AI Governance model; seeks 'safe, inclusive, transparent, and responsible' AI adoption

Brand Makers

TN IT Minister slams Online Gaming Act as ‘copy-paste’ | ‘Innovation can’t suffer under regulation,’ says Dr. PTR | Nazara rebrands

Digital

Today in AI | Nano Banana may let users edit pictures | Google maps to offer real-time lane detection with AI

Brand Makers

Kim Kardashian blames ChatGPT for failing law tests, says AI ‘made me fail’

Digital

Google’s Nano Banana AI may soon let users edit photos directly within Messages app

Digital

Google introduces AI-generated review summaries on Play Store to simplify app browsing

Digital

WhatsApp to test usernames for chats and calls, removing the need for phone numbers

Digital

Google Maps to offer real-time lane detection using AI in select vehicles

Brand Marketing

Inside Elon Musk’s xAI: Indian-origin engineer reveals what it’s like working at the Palo Alto hub

Gaming

“Innovation shouldn’t be the casualty of regulation,” Dr. P Thiaga Rajan, IT Minister on TNOGA

Social Media

Australia widens teen social media ban to include Reddit and Kick

Brand Marketing

OpenAI begins hiring engineers in India as it expands enterprise focus

Gaming

GDAI unveils $100 billion blueprint to make India a global gaming superpower by 2035

Digital

Google reveals how its viral AI model ‘Nano Banana’ got its quirky name

How it Works

Nazara Technologies unveils new identity as it enters its next phase of growth

Gaming

Tamil Nadu IT Minister calls Online Gaming Act, a copy paste move

Advertising

HUL tops TV advertiser list for 2 years running: TAM

Digital

Nvidia backs Indian deep-tech ecosystem as part of $2 billion investor coalition

Brand Marketing

Meta’s Superintelligence Labs puts 600 staff on non-working notice as part of AI restructuring

Advertising

Publicis secures global media mandate for Unilever's ice-cream spin-off

Brand Marketing

IBM to lay off thousands as it sharpens focus on AI

Digital

Google adds event and salon appointment booking to AI Mode in latest update

Digital

Amazon launches new Echo lineup featuring Alexa Plus and custom-built AZ3 chips

Digital

Google Chrome now supports Autofill for licences, passports and vehicle details, How to enable this feature

Gaming

Tamil Nadu's AVGC-XR policy 'close to release,' focuses on skilling, subsidies: P Thiaga Rajan

Brand Marketing

"Won't get bullied" says Aravind Srinivas as Amazon sends legal threat to Perplexity

Digital

AI transforms Shopify commerce: traffic up 7x, orders up 11x amid OpenAI partnership

Brand Marketing

Flying cars a reality? Ahead of Tesla, China firm begins trial production of flying cars