ChatGPT health misjudges Apple watch data in early real-world test

A technology columnist’s experiment with ChatGPT Health suggests the AI tool may misinterpret wearable data and deliver inconsistent health assessments.

By Storyboard18| January 28, 2026, 09:46:34 IST

Early Test Raises Questions Over Accuracy of ChatGPT Health

ChatGPT Health, a recently launched feature designed to analyse long-term health data from sources such as Apple Health, is facing scrutiny after an early real-world test revealed significant accuracy issues.

The concerns emerged after a technology columnist from The Washington Post, Geoffrey A. Fowler, shared his experience using the tool after granting it access to nearly a decade of Apple Watch data. Fowler, who has worn an Apple Watch daily for years, allowed ChatGPT Health to review roughly 29 million recorded steps and approximately 6 million heart rate readings before asking the system to assess his cardiac health.

According to Fowler, the AI delivered a stark assessment, assigning his heart health a failing grade. The evaluation prompted him to immediately change his behaviour and seek medical advice. However, his doctor reportedly dismissed the AI’s conclusions, stating that Fowler was at extremely low risk of a heart attack and that additional testing would likely be unnecessary.

Also read: OpenAI rolls out AI tools to 200+ Indian NGOs to boost scale and efficiency

Further analysis revealed that ChatGPT Health appeared to rely heavily on VO2 max estimates generated by the Apple Watch. While the metric is commonly used as an indicator of cardiovascular fitness, Apple has consistently said the watch provides estimates meant for tracking trends rather than making clinical diagnoses. Accurate VO2 max measurements typically require laboratory testing, a distinction that was not reflected in the AI’s assessment.

Fowler also observed that changes in his historical resting heart rate data coincided with upgrades to newer Apple Watch models. These shifts were linked to improved sensors and updated measurement algorithms rather than changes in his health. ChatGPT Health interpreted the variations as medically meaningful signals, without accounting for changes in hardware or software over time.

Adding to the concerns was the system’s lack of consistency. When Fowler repeated the same query, ChatGPT Health produced different results, revising its evaluation from a failing grade to an average one. Subsequent attempts yielded scores ranging from poor to above average, raising questions about the reliability of its assessments.

Also read: Anthropic rolls out Claude for Healthcare as AI tools make deeper inroads into medicine

Fowler also reported that the system struggled to retain basic personal information across conversations. Despite having access to recent blood test results, the AI did not consistently incorporate those data points into its analysis and repeatedly forgot details such as age, gender and recent vital signs.

SPOTLIGHT

Special Coverage Calling India’s Boldest Brand Makers: Entries Open for the Storyboard18 Awards for Creativity

From purpose-driven work and narrative-rich brand films to AI-enabled ideas and creator-led collaborations, the awards reflect the full spectrum of modern creativity.

Diageo India CEO Praveen Someshwar joins Grand Jury of Storyboard18 Awards for Creativity

Praveen Someshwar, Managing Director and CEO of Diageo India, joins the Grand Jury of the Storyboard18 Awards for Creativity, highlighting the awards’ focus on work that blends cultural relevance with strategic and commercial impact.

ChatGPT health misjudges Apple watch data in early real-world test

A technology columnist’s experiment with ChatGPT Health suggests the AI tool may misinterpret wearable data and deliver inconsistent health assessments.

SPOTLIGHT

Diageo India CEO Praveen Someshwar joins Grand Jury of Storyboard18 Awards for Creativity

POPULAR

More from Storyboard18

Brand Marketing

CarTrade Q3 FY26: OLX, CarWale rent firm clocks 35% rise in profit YoY to Rs 61.5 crore in Q3 FY26

How it Works

India–EU free trade pact promises tariff gains, faces carbon and compliance tests

Digital

Yahoo launches AI search tool ‘Scout’ to take on Google and Perplexity

Brand Marketing

Amazon to shut Go and Fresh stores, shifts focus to delivery and Whole Foods

Brand Marketing

Vodafone Idea ups marketing spend to Rs 1,159 crore as ARPU rises 7.3% in Q3

Digital

Explained: What is WhatsApp’s Strict Account Settings and how to use it

Digital

WhatsApp rolls out stricter security mode to curb cyberattacks

Brand Marketing

Amazon to shut Go and Fresh stores, shifts focus to delivery and Whole Foods