Google’s contract partners, working to improve the quality of responses from Google’s Gemini AI chatbot, compare them with responses from Anthropic’s competing chatbot Claude, TechCrunch writes, citing internal company correspondence. At the same time, Google left unanswered TechCrunch’s question about whether it received permission to use Claude in testing with Gemini.
Image source: Google
Companies often evaluate the effectiveness of developed AI models in comparison with the developments of competitors using industry benchmarks, rather than instructing contractors to compare them with the AI capabilities of their competitors.
Google’s contract developers working to improve Gemini must evaluate each model response based on several criteria, such as confidence and level of detail. According to correspondence published by TechCrunch, they are given up to 30 minutes per request to determine whose answer is better – Gemini or Claude.
The developers report that Claude’s responses are more security-focused than Gemini’s. “Claude’s security settings are the most stringent” among AI models, noted one of the contract developers in the service chat. In some cases, Claude did not respond to prompts that he considered unsafe, such as the suggestion of role-playing with another AI assistant. In another instance, Claude avoided answering a prompt, while Gemini’s response was flagged as a “gross security violation” because it included “nudity and bondage.”
Shira McNamara, a spokeswoman for Google DeepMind, the developer of Gemini, did not respond to TechCrunch’s question about whether Google had received Anthropic’s permission to use Claude. She clarified that DeepMind “compares simulation results” for evaluation, but does not train Gemini to work with Anthropic’s models. “Any suggestion that we used Anthropic models to train Gemini is inaccurate,” McNamara said.
Developers from the Dutch Triumph Studios, together with the publisher Paradox Interactive, have decided on…
Micron and Astera Labs have demonstrated the world's first PCIe 6.0 solid-state drive (SSD) at…
A hidden backdoor vulnerability has been discovered in the popular ESP32 wireless controller from the…
At MWC 2025, Dell demonstrated a number of new servers based on Intel Xeon 6…
At MWC 2025, HPE announced the ProLiant Compute DL110 Gen12 server for telecom operators. The…
A wave of counterfeit Seagate hard drives has flooded the market and is not abating.…