BACK TO MAGAZINE
News9 September 2025

OpenAI Admits GPT-5 Still Hallucinates: Why Even the Most Advanced AI Gets Things Wrong

Even the most powerful AI models aren’t immune to mistakes. OpenAI has openly acknowledged that GPT-5—the company’s latest and most advanced large language model—still suffers from hallucinations, where the AI generates answers that sound credible but are factually incorrect. What “Hallucinations” Mean in AI In AI terms, a hallucination occurs when a chatbot confidently provides […]

OpenAI Admits GPT-5 Still Hallucinates: Why Even the Most Advanced AI Gets Things Wrong

Even the most powerful AI models aren’t immune to mistakes. OpenAI has openly acknowledged that GPT-5—the company’s latest and most advanced large language model—still suffers from hallucinations, where the AI generates answers that sound credible but are factually incorrect.

What “Hallucinations” Mean in AI

In AI terms, a hallucination occurs when a chatbot confidently provides an answer that looks and feels correct but is simply untrue. OpenAI explains it this way: “GPT-5 has significantly fewer hallucinations, especially when reasoning⁠, but they still occur. Hallucinations remain a fundamental challenge for all large language models.”

That means even a casual question—like asking about an author’s dissertation title or birth date—can trip the system into giving multiple, conflicting, and ultimately wrong answers.

Why GPT-5 Still Gets Things Wrong

According to OpenAI, hallucinations stem from the way AI models are trained. Large language models like GPT-5 are built by predicting the next word in a sentence using massive amounts of text data. While this process helps them master grammar, spelling, and sentence structure, it doesn’t verify whether the information is actually true. As a result, rare or specific facts are especially error-prone.

The company also points to the way models are evaluated. Current benchmarks reward accuracy but don’t penalize confident mistakes. This sets up a perverse incentive: rather than admitting “I don’t know,” models are nudged to make their best guess—much like a student guessing on a multiple-choice test, since leaving the answer blank would score zero.

“When models are graded only on accuracy, they are encouraged to guess rather than say ‘I don’t know,’” OpenAI explained.

The Bigger Picture: Accuracy vs. Confidence

OpenAI argues that if evaluation methods continue rewarding “lucky guesses,” AI systems will keep producing hallucinations. The solution, they suggest, is to rethink evaluation itself—penalizing overconfident errors more heavily and rewarding expressions of uncertainty when appropriate.

That shift could help align AI with human expectations, making models more trustworthy in high-stakes areas like healthcare, legal advice, or scientific research where confidently wrong answers can do real harm.

Why It Matters

For everyday users, GPT-5’s hallucinations highlight an important reality: no matter how fluent or convincing an AI sounds, its answers should never be taken as unquestionable truth. At its core, a chatbot is a powerful word-predictor—not a fact-checking oracle.

The takeaway? AI continues to evolve, but skepticism and human oversight remain critical. The more we understand the limits of these systems, the better we can use them responsibly.

Do you think AI should be trained to say “I don’t know” more often—even if it feels less human-like? Share your thoughts in the comments.

0
INTELLIGENCE SOURCE:INVENTRIUM RESEARCH
MORE INTELLIGENCE

Continue the Exploration

Fewer Handshakes, Bigger Cheques: Inside Africa's $887M Sprint Toward a $1 Billion Half-Year
11 June 2026

Fewer Handshakes, Bigger Cheques: Inside Africa's $887M Sprint Toward a $1 Billion Half-Year

$1.3 Billion in Six Months: Africa's Tech Ecosystem Is Playing a Completely Different Game in 2026
9 June 2026

$1.3 Billion in Six Months: Africa's Tech Ecosystem Is Playing a Completely Different Game in 2026

$920 Million a Month: Why Google Is Renting Elon Musk's Computers to Power Its AI
9 June 2026

$920 Million a Month: Why Google Is Renting Elon Musk's Computers to Power Its AI