Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model series

Anthropic released Claude Sonnet 5, which beats its predecessor Sonnet 4.6 across all benchmarks and even edges past the larger Opus 4.8 on the GDPval-AA v2 knowledge work test with a score of 1,618. Anthropic is also quick to point out that the model scores far below the models the US government currently has blocked when it comes to cybersecurity tasks, a likely deliberate signal given the…
This is a summary curated by AIFuture. Read the complete article at the original source:
Read the full story on The Decoder