ChatGPT and other AI bots are terrible at telling news, BBC study finds

The world’s four most popular AI chatbots make too many mistakes when reporting news stories, a BBC study has found, with inaccuracies reported in more than half of cases.

Image source: Growtika / unsplash.com

In an experiment, BBC journalists asked chatbots OpenAI ChatGPT, Microsoft Copilot, Google Gemini, and Perplexity to summarize 100 news stories from the agency, then assessed the systems’ responses to determine how accurate they were. The study found that “51% of all AI responses to news-related questions were rated as having some form of significant problem.” Additionally, “19% of AI responses to BBC stories contained factual errors, such as incorrect factual statements, numbers, and dates.”

Google’s Gemini chatbot, in particular, radically distorted a statement from the UK’s National Health Service, while ChatGPT and Copilot continued to consider retired politicians as active. The AI’s careless handling of information is systemic, British journalists point out: it “had difficulty distinguishing between opinions and facts, ranted and often missed important context.” Earlier, it became known that iOS 18.3 temporarily disabled the news summaries function included in the Apple Intelligence package. Not all AI systems performed equally in the study: “Microsoft Copilot and Google Gemini have more significant problems than OpenAI ChatGPT and Perplexity,” the BBC concluded.

The experiment has once again shown that information from AI chatbots should be taken with a grain of salt. AI is developing rapidly, large language models are released almost every week, and errors in such a volume of data are inevitable. On the other hand, “hallucinations,” that is, deliberately incorrect answers, are now less common in advanced systems than before. AI is progressing faster than Moore’s Law suggests, OpenAI CEO Sam Altman recently said in his personal blog. But at the moment, it is still too much to trust chatbots, especially when it comes to news materials.

admin

Share
Published by
admin

Recent Posts

Alan Wake 2 Finally Starts Making Remedy Profitable as Game Sales Reach New High

Developers from the Finnish studio Remedy Entertainment reported that the psychological horror Alan Wake 2,…

11 minutes ago

GeForce RTX 5070 Ti appeared in European stores at a price much higher than the recommended price

The GeForce RTX 5070 Ti video card is expected to go on sale on February…

22 minutes ago

A Gravitational Portal Has Been Discovered on the Outskirts of Our Galaxy—A Perfect Einstein Ring

The European Space Observatory Euclid has helped make an amazing discovery literally within walking distance…

32 minutes ago

Hard Drive Reliability Has Improved — Backblaze Statistics Shows the Best and Worst HDDs for 2024

Cloud storage provider Backblaze has released a report on hard drive failure statistics for Q4…

42 minutes ago

Chinese chipmakers to cut equipment spending this year

According to experts from Canadian TechInsights, who regularly take part in “exposing” the progress of…

1 hour ago