Optiver heeft dit gerepost
Claude Opus 4.8 is out, and we've been testing it on some of our real world trading problems as we introduced here: https://bit.ly/4uINgjw This chart shows how Opus 4.8 scores on the trading internship exam we use at Optiver, across different reasoning-effort settings and relative to previous generations of Claude models. It's exciting to see the continued progress on this exam, particularly at lower effort settings. Congrats to the Anthropic team on the release.