AI gone wild: Gemini and ChatGPT flub spouse questions of public figures, says WSJ

Artificial Intelligence AI Assistant Apps - ChatGPT, Anthropic Claude, Google Gemini, Microsoft Copilot, Perplexity, Poe.
(Image credit: Getty Images | iStock | Kenneth Cheung)

A recent in-depth analysis by BBC News revealed that Copilot and ChatGPT generate AI news summaries riddled with inaccurate information because they are unable to discern opinions from facts. And as it now happens, the hallucination episodes continue to haunt AI tools.

According to a report by The Wall Street Journal, AI tools are more likely to err when asked who someone is married to (via Futurism).

I blatantly attempted to replicate the results and findings shared by WSJ and Futurism for Windows Central, but my efforts were futile. According to ChatGPT:

"Based on available public information, there is no indication of Kevin Okemwa's marital status. Kevin Okemwa is a seasoned tech journalist based in Nairobi, Kenya, known for his work across several publications, including Windows Central."

Microsoft Copilot blurted out a similar response, but it was not half as polished. It seemingly picked up information from multiple people I share a name with. I guess there's no love lost for me as Valentine's Day edges closer.

However, WSJ and Futurism's findings are quite interesting. For instance, Futurism's Noor Al-Sibai asked Google's Gemini who she was married to. The chatbot instantly generated a makeshift husband for the reporter, "Ahmad Durak Sibai," prompting her to spit out her coffee.

This is quite hilarious because Al-Sibai revealed that she wasn't married at the time of writing. According to the reporter:

"I'd never heard of such a person, but a little Googling found a lesser-known Syrian painter, born in 1935, who created beautiful cubist-style expressionist paintings and who appears to have passed away in the 1980s. In Gemini's warped view of reality, our love appears to have transcended the grave."

While I could not replicate similar outputs using Copilot or ChatGPT, Al-Sidai's findings were consistent with WSJ's AI editor, Ben Fritz. Fritz seemingly used multiple AI chatbots, and while he didn't disclose the exact models, they seemingly married him off to a tennis influencer, a random lowan woman, and a writer he'd never interacted with.

Perhaps more concerning, Al-SIdai switched to other chatbots in an attempt to establish a pattern of AI tools spreading misinformation about people's marital status. For OpenAI's ChatGPT, the results were more or less the same as those depicted by Google's Gemini.

Interestingly, as The Wall Street Journal highlighted, Anthropic's Claude AI has seemingly been trained to respond with some level of uncertainty to questions it doesn't have an answer to or understand.

This comes after recent emerging reports suggest top AI labs, including OpenAI, Google, and Anthropic, cannot develop advanced AI systems due to a lack of sufficient and high-quality content for model training. The AI firms seemingly lean more toward reasoning AI models amid the rising number of lawsuits filed by publishers and authors citing copyright infringement issues.

How dependable is generative AI? The short answer: It's complicated. This is in the wake of a detailed report by Microsoft indicating that an overdependence and reliance on AI-powered tools like Microsoft Copilot and OpenAI's ChatGPT negatively impact a user's critical thinking.

CATEGORIES
Kevin Okemwa
Contributor

Kevin Okemwa is a seasoned tech journalist based in Nairobi, Kenya with lots of experience covering the latest trends and developments in the industry at Windows Central. With a passion for innovation and a keen eye for detail, he has written for leading publications such as OnMSFT, MakeUseOf, and Windows Report, providing insightful analysis and breaking news on everything revolving around the Microsoft ecosystem. You'll also catch him occasionally contributing at iMore about Apple and AI. While AFK and not busy following the ever-emerging trends in tech, you can find him exploring the world or listening to music.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.

Read more
In this photo illustration, Microsoft Copilot AI logo is seen on a smartphone screen.
Microsoft Copilot struggles to discern facts from opinions — posting distorted AI news summaries riddled with inaccuracies: "How long before an AI-distorted headline causes significant real-world harm?"
Microsoft Copilot
Microsoft says ChatGPT is not better than Copilot; we just aren't using it as intended — So why does it refuse to provide basic election details? "I'm probably not the best resource for something so important."
Artificial Intelligence AI Assistant Apps - ChatGPT, Anthropic Claude, Google Gemini, Microsoft Copilot, Perplexity, Poe.
Satya Nadella admits Microsoft missed an opportunity as ChatGPT and Copilot gain popularity — even OpenAI's Sam Altman "doesn't do Google searches anymore"
ChatGPT and Microsoft Logo
Will an overreliance on Copilot and ChatGPT make you dumb? A new Microsoft study says AI 'atrophies' critical thinking: "I already feel like I have lost some brain cells."
Elon Musk and U.S. President Donald Trump appear during an executive order signing in the Oval Office at the White House on February 11, 2025 in Washington, DC.
"Elon was not involved at any point": xAI's Chief Engineer blames a former OpenAI employee after Grok temporarily censored results, implying Musk and Trump "spread misinformation."
Sam Altman, chief executive officer of OpenAI, during the Asia-Pacific Economic Cooperation (APEC) CEO Summit in San Francisco, California, US, on Thursday, Nov. 16, 2023.
Sam Altman claims knowing what questions to ask trumps raw intelligence as AI advances — Users struggle to realize Copilot and ChatGPT's full potential, owing to poor prompt engineering skills
Latest in Software Apps
Artificial intelligence mobile apps for DeepSeek, ChatGPT and Google Gemini arranged.
Google says its latest reasoning model is its "most intelligent" — but Microsoft's CEO claims Google already fumbled its AI opportunity
ChatGPT and Microsoft Logo
ChatGPT’s new image-generation tool is impressive; it can finally create a glass of wine filled to the brim — but it struggles with blank white images and appears to discriminate against 'sexy women'
Microsoft Edge Sidebar
My favorite Microsoft Edge feature just got an AI upgrade — is this the best way to use Copilot on Windows 11?
Professor Sir Roger Penrose, physicist, mathematician and cosmologist
Nobel laureate claims "AI will not be conscious" and shouldn't be considered intelligent — Until it develops its own ideas
In this photo illustration OpenAI ChatGPT icon is displayed on a mobile phone screen in Ankara, Turkiye on August 13, 2024.
OpenAI says an excessive dependency on ChatGPT can lead to loneliness and a "loss of confidence" in decision-making
Microsoft 365 app on Windows 11 with shortcuts to create documents in Word, PowerPoint, Excel, and other Microsoft 365 applictions.
This Microsoft 365 feature will nudge users to save files to OneDrive
Latest in News
Screenshot of one of the new flat world presets in Minecraft.
Minecraft testing new flat world presets and a better way to locate your friends in-game
Cover art for Heroes of the Storm.
Xbox Game Pass will give you more benefits in free-to-play games like Heroes of the Storm
Surface Pro 11
Microsoft’s smaller Surface Pro appears in certification database ahead of rumored launch this spring
Artificial intelligence mobile apps for DeepSeek, ChatGPT and Google Gemini arranged.
Google says its latest reasoning model is its "most intelligent" — but Microsoft's CEO claims Google already fumbled its AI opportunity
ChatGPT and Microsoft Logo
ChatGPT’s new image-generation tool is impressive; it can finally create a glass of wine filled to the brim — but it struggles with blank white images and appears to discriminate against 'sexy women'
Microsoft Edge Sidebar
My favorite Microsoft Edge feature just got an AI upgrade — is this the best way to use Copilot on Windows 11?