Concrete Data Proving AI's Decline in Intelligence: Claude Opus 4.5
This data shows the Marginlab team using Claude Code's Opus 4.5 daily to perform pass/fail tests on 50 questions in the SWE-Bench-Pro test.
The data shows that the pass/fail rate has dropped from 60% in early January to 54% currently. The rate of AI decline is 10%.