1
Feature Story
Google is Using Anthropic's Claude To Improve Its Gemini AI - Slashdot
Dec 24, 2024 · slashdot.orgThe contractors involved in this evaluation process are responsible for rating the accuracy of Gemini's outputs based on various criteria, such as truthfulness and verbosity. They are allotted up to 30 minutes per prompt to decide which model, Gemini or Claude, provides a better response. This meticulous evaluation method underscores the effort to improve AI models by directly comparing them with rival technologies.
Key takeaways
- Contractors are comparing Google's Gemini AI outputs against Anthropic's Claude model.
- Google did not confirm if it obtained permission to use Claude for testing against Gemini.
- AI models are typically evaluated against competitors using industry benchmarks.
- Contractors have up to 30 minutes per prompt to assess the accuracy and quality of responses from Gemini and Claude.