Here’s how OpenAI plans to cleanse ChatGPT of false information

OpenAI aims to reduce AI hallucinations in ChatGPT by enhancing math skills, as process supervision shows promise in improving accuracy.

On May 31, OpenAI announced its efforts to enhance ChatGPT’s mathematical problem-solving capabilities, aiming to reduce instances of artificial intelligence (AI) hallucinations. OpenAI emphasized mitigating hallucinations as a crucial step toward developing aligned AI.

In March, the introduction of the latest version of ChatGPT — ChatGPT-4 — further propelled AI into the mainstream. However, generative AI chatbots have long grappled with factual accuracy, occasionally generating false information, commonly referred to as “hallucinations.“ The efforts to reduce these AI hallucinations were announced through a post on OpenAI’s website.

AI hallucinations refer to instances where artificial intelligence systems generate factually incorrect outputs, misleading or unsupported by real-world data. These hallucinations can manifest in various forms, such as generating false information, making up nonexistent events or people, or providing inaccurate details about certain topics.

OpenAI conducted research to examine the effectiveness of two types of feedback: “outcome supervision” and “process supervision.“ Outcome supervision involves feedback based on the final result, while process supervision provides input for each step in a chain of thought. OpenAI evaluated these models using math problems, generating multiple solutions and selecting the highest-ranked solution according to each feedback model.

After thorough analysis, the research team found that process supervision yielded a superior performance as it encouraged the model to adhere to a human-approved process. In contrast, outcome supervision proved more challenging to scrutinize consistently.

OpenAI recognized that the implications of process supervision extend beyond mathematics, with further investigation necessary to understand its effects in different domains. It expressed the possibility that if the observed outcomes hold in broader contexts, process supervision could offer a favorable combination of performance and alignment compared with outcome supervision. To facilitate research, the company publicly released the complete data set of process supervision, inviting exploration and study in this area.

Although OpenAI did not provide explicit instances that prompted its investigation into hallucinations, two recent occurrences exemplified the problem in real-life scenarios.

In a recent incident, lawyer Steven Schwartz in the Mata vs. Avianca Airlines case acknowledged relying on the chatbot as a research resource. However, the information provided by ChatGPT turned out to be entirely fabricated, highlighting the issue at hand.

OpenAI’s ChatGPT is not the only example of artificial intelligence systems encountering hallucinations. During a demonstration of its chatbot technology in March, Microsoft’s Bing AI chatbot examined earnings reports and generated inaccurate figures for companies like Gap and Lululemon.

Magazine: 25K traders bet on ChatGPT’s stock picks, AI sucks at dice throws, and more

M	T	W	T	F	S	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

SEC Unveils Pro-Crypto Task Force As Trump Takes Office

Is It Too Late to Invest in Crypto? Here’s What Experts Say

Top Cryptos for Long-Term Investment: Why RWAs Are a Winning Bet

Exploring What Is a BTC Wallet Address and Its Significance

eToro Has Filed Confidentially for an IPO

VPTrade Review: A Detailed Look at Its Offerings for 2025

Hidden fees in Crypto Trading and How They’re Hurting your Portfolio

Zignaly Founders Extend $ZIG Token Lock-Up for Another Year

Orbitt: Revolutionizing Trading & Project Development on Solana With AI

Nuant Introduces Comprehensive Digital Asset Management Platform

Why is QuantWise the Best AI Crypto Trading Tool?

Best Bitcoin & Crypto Trading Bots: The Ultimate Guide

You may have missed

Crypto Meets Real Estate: Propy Lets You Buy Homes With Bitcoin and Ethereum

Tornado Cash Co-Founder Speaks Out: ‘This Is a Fight for Every Developer’ as DOJ Threatens 45-Year Prison Sentence

XRP Market Update: Traders Brace for Action as Key Levels Tighten

NFTs Hit $187M: Ethereum Sales Soar While Bitcoin Slips in a Tumultuous Week

New Meme Coin Flockerz to Launch Tomorrow After $13M Presale – Last Chance to Buy FLOCK Before Listing

More Stories

Leave a Reply Cancel reply

You may have missed