

Anthropic right now introduced Claude Opus 4.8, with enhancements throughout benchmarks, and is a more practical collaborator.
Opus 4.8 launches alongside a number of new options. Customers on claude.ai now have management over the quantity of effort Claude places right into a job. Claude Code has a brand new “dynamic workflows” characteristic that enables it to deal with very large-scale issues. And quick mode for Opus 4.8—the place the mannequin can work at 2.5× the velocity—is now 3 times cheaper than it was for earlier fashions.
Opus 4.8’s capabilities
Probably the most outstanding enhancements in Opus 4.8 is its honesty, as an illustration, to keep away from making claims that they’ll’t help. However a normal downside with AI fashions is that they often bounce to conclusions, confidently claiming to have made progress of their work regardless of the proof being skinny. Early testers report that Opus 4.8 is extra more likely to flag uncertainties about its work and fewer more likely to make unsupported claims. That is borne out in evaluations, which present that Opus 4.8 is round 4 occasions much less possible than its predecessor to permit flaws in code it has written to cross unremarked.
An in depth alignment evaluation on the mannequin earlier than launch by Anthropic’s Alignment staff, and the corporate concluded that Opus 4.8 “reaches new highs on our measures of prosocial traits like supporting consumer autonomy and appearing within the consumer’s finest curiosity.” The evaluation additionally confirmed Opus 4.8 to have charges of misaligned habits (resembling deception or cooperation with misuse) which might be considerably decrease than Opus 4.7, and much like our best-aligned mannequin, Claude Mythos Preview. The total alignment evaluation, accompanied by a set of pre-deployment security checks, is reported within the Claude Opus 4.8 System Card.
Along with Claude Opus 4.8, the next updates additionally had been made:
- Dynamic workflows. Out there in analysis preview, this characteristic permits Claude to tackle even greater duties in Claude Code. Claude can plan the work after which run a whole lot of parallel subagents in a single session (and with Opus 4.8, the brokers can run for even longer). It then verifies its outputs earlier than reporting again to the consumer. For instance, Claude Code with Opus 4.8 can now perform codebase-scale migrations throughout a whole lot of 1000’s of strains of code from kickoff to merge, with the present check suite as its bar. You may learn extra about dynamic workflows—obtainable in Claude Code for Enterprise, Workforce, and Max plans—in this publish.
- Effort management in claude.ai and Cowork. A brand new management alongside the mannequin selector lets customers select how a lot effort Claude places right into a response. On larger effort settings, Claude will suppose extra often and extra deeply to provide higher responses. On decrease effort settings, Claude will reply quicker and dissipate a consumer’s fee limits extra slowly. Customers now have this alternative—the trouble management is accessible on all plans.
- The Messages API now accepts system entries contained in the messages array. Builders can replace Claude’s directions mid-task with out breaking the immediate cache or routing the replace by way of a consumer flip. This can be utilized in a given harness to replace permissions, token budgets, or setting context as an agent runs.
Claude Opus 4.8 is accessible in all places right now. Pricing for normal utilization is unchanged from Opus 4.7: $5 per million enter tokens and $25 per million output tokens. Pricing for quick mode is $10 per million enter tokens and $50 per million output tokens. Builders can use claude-opus-4-8 by way of the Claude API.
