Why AI Chatbots Hallucinate, According to OpenAI Researchers

[ad_1]

OpenAI researchers claim they’ve cracked one of the biggest obstacles to large language model performance — hallucinations.

Hallucinations occur when a large language model generates inaccurate information that it presents as fact. They plague the most popular LLMs, from OpenAI’s GPT-5 to Anthropic’s Claude.

OpenAI’s baseline finding, which it made public in a paper released on Thursday, is that large language models hallucinate because the methods they’re trained under reward guessing more than admitting uncertainty.

In other words, LLMs are being told to fake it till they make it. Some are better than others, however. In a blog post last month, OpenAI said that Claude models are more “aware of their uncertainty and often avoid making statements that are inaccurate.” It also noted that Claude’s high refusal rates risked limiting its utility.

“Hallucinations persist due to the way most evaluations are graded — language models are optimized to be good test-takers, and guessing when uncertain improves test performance,” the researchers wrote in the paper.

Large language models are essentially always in “test-taking mode,” answering questions as if everything in life were binary — right or wrong, black or white.

In many ways, they’re not equipped for the realities of life, where uncertainty is more common than certainty, and true accuracy is not a given.

What's Hot

Nvidia’s AI empire: A look at its top startup investments

I Used ChatGPT to Plan a Trip to Tunisia, While My Partner Used Claude

I Turned Down NYU for a Debt-Free Community College Path

Why AI Chatbots Hallucinate, According to OpenAI Researchers

I Used ChatGPT to Plan a Trip to Tunisia, While My Partner Used Claude

AWS Exec Colleen Aubrey: 3 Signs You Should Make a Career Change

Former Apple CEO Says OpenAI Is Its ‘First Real Competitor’ in Decades

Intel cuts 15% of its staff as it pushes to make a comeback

Tesla’s stock is tumbling after Elon Musk failure to shift the narrative

Women will soon be able to request a female Uber driver in these US cities

Top Insights

French companies’ borrowing costs fall below government’s as debt fears intensify

The Digital Dollar Dilemma: Why Central Banks Are Rushing to Create Digital Currencies

FCA opens investigation into Drax annual reports

What's Hot

Why AI Chatbots Hallucinate, According to OpenAI Researchers

Related stories

Business Insider tells the innovative stories you want to know

Business Insider tells the innovative stories you want to know

Related Posts

Subscribe to Updates