AIFuture
Back to news
IndustryThe Decoder·

GPT and Claude failed Bridgewater's finance tests because the right answers were never public

GPT and Claude failed Bridgewater's finance tests because the right answers were never public

Bridgewater and Thinking Machines Lab—the startup from former OpenAI CTO Mira Murati—have fine-tuned a Qwen3-235B model for financial tasks. According to their own testing, the model hits 84.7 percent accuracy, beating Gemini, Claude, and GPT at roughly one-fourteenth of the cost. The numbers haven't been verified by anyone outside the two companies, though. The article GPT and Claude failed…

This is a summary curated by AIFuture. Read the complete article at the original source:

Read the full story on The Decoder

Build the skills behind the headlines

Data ScienceCoursera

Machine Learning Specialization

Andrew Ng's flagship program covering supervised and unsupervised learning, neural networks, and best practices for real-world ML.

Beginner·Subscription
View Course
Data ScienceedX

CS50's Introduction to AI with Python

Harvard's deep dive into the algorithms behind modern AI — search, knowledge, optimization, and machine learning.

Intermediate·Free / Verified
View Course
Generative AICoursera

Generative AI for Everyone

Andrew Ng explains how generative AI works and how to apply it in your work and life — no coding required.

Beginner·Subscription
View Course

Never miss what matters in AI

Get the most important AI news and course picks in your inbox.