August 5, 2025: This article has been updated to reflect the availability of Claude Opus 4.1.
Key takeaways
- Claude 4 hybrid reasoning models let customers choose between near-instant responses and deeper reasoning.
- These models can transform how businesses can deploy AI for both complex tasks and everyday high-volume operations.
- Both models are designed to power more capable, autonomous AI agents for multi-step workflows across thousands of steps.
- Claude Opus 4.1 is Anthropic’s most powerful model yet and an industry-leader for coding.
Amazon Web Services (AWS) has announced the availability of Claude Opus 4.1 and Claude Sonnet 4—the latest generation of models from Anthropic—in Amazon Bedrock. These new, hybrid reasoning models (meaning they can toggle between near-instant responses and extended thinking) set new standards across coding, advanced reasoning, and multi-step workflows. They enable sustained performance on complex, long-running tasks and can power AI agents capable of doing hours of work in minutes.
The addition of Claude Opus 4.1 and Claude Sonnet 4 to Amazon Bedrock expands customers’ AI choices with Anthropic’s most advanced models, simplifying how customers build better, more transformative applications with enterprise-grade security and responsible AI controls.
Why you should care

Methodology
1. Opus 4.1, Opus 4, and Sonnet 4 were run using pass@1 with bash/editor tools (averaged over 10 trials, single-attempt patches, no test-time compute, using nucleus sampling with a top_p of 0.95).
2. All scores reported here use the default agent framework (“Terminus 1”) averaged over 5 trials.
3. Claude scores on MMMLU are the average over 14 non-English languages.
4. Opus 4.1, Opus 4, and Sonnet 4 were run on AIME using nucleus sampling with a top_p of 0.95.
1. Opus 4.1, Opus 4, and Sonnet 4 were run using pass@1 with bash/editor tools (averaged over 10 trials, single-attempt patches, no test-time compute, using nucleus sampling with a top_p of 0.95).
2. All scores reported here use the default agent framework (“Terminus 1”) averaged over 5 trials.
3. Claude scores on MMMLU are the average over 14 non-English languages.
4. Opus 4.1, Opus 4, and Sonnet 4 were run on AIME using nucleus sampling with a top_p of 0.95.
The new Claude 4 models fundamentally change how teams approach complex projects. This is especially true for large enterprises tackling work that requires sustained effort and deep expertise. Launched today, Claude Opus 4.1 is a drop-in replacement for Opus 4 that delivers improved performance and precision for real-world coding and agentic tasks. According to Anthropic, Claude Opus 4.1 is its most intelligent model to date and “an industry leader for coding and agents.” Its advanced coding capabilities include independently planning and executing complex end-to-end development tasks while adapting to the user’s style, while maintaining high quality. The model also offers improved frontend code generation, delivering strong visual output quality with a focus on effectively handling complex logic. Additionally, Opus 4.1’s long-horizon task handling and complex problem-solving abilities make it an ideal virtual collaborator for sustained reasoning and long chains of actions . It also enhances AI agent performance, enabling them to tackle complex, multi-step tasks with peak accuracy.
Claude Sonnet 4 surpasses its predecessor (Claude Sonnet 3.7) on both coding and reasoning, and offers a balance of performance and cost optimization for high-volume use cases, making it ideal for most production applications. Claude Sonnet 4 can power everything from real-time customer support agents to everyday development tasks like code reviews and bug fixes, and can also serve as a task-specific sub agent to handle multiple tasks at once, such as search, data analysis, or content synthesis. Customers in travel and hospitality can use Claude Sonnet 4 to run customer requests and deliver personalized responses in near real-time.
Both models include “extended thinking,” which allows Claude to switch between two modes: deep reasoning and action performance. Claude can run data analysis as needed, improving accuracy as it works, which helps it better anticipate and execute next steps.
Meet the AI
Claude Opus 4.1 operates like a brilliant detailed-oriented collaborator that aces in agentic search and research, content creation, and memory and context management, allowing for comprehensive insight synthesis, high-quality content production, and effective summarization. Meanwhile, Claude Sonnet 4 is efficient, creating a perfect blend of quick thinking and practical intelligence for every project. With a balance of speed and performance, Claude Sonnet 4 can seamlessly switch between tasks—all while maintaining a pragmatic approach and unwavering commitment to getting things done right the first time.
Straight from the source, Anthropic on Claude
"Claude Opus 4 and Claude Sonnet 4 transform AI from a tool into a true collaborator for every person and every team. Our customers will see project timelines shrink—in many cases from weeks to hours," said Kate Jensen, head of Growth and Revenue, Anthropic. "The Claude 4 models set new standards in coding, advanced reasoning, and multi-step workflows while understanding full business contexts and delivering precise results. The real breakthrough is freeing your talent for strategic work while Claude handles the heavy lifting."
Crunching the numbers
- Both models feature a 200K token context window, enabling customers to process and generate long bodies of content—like document analysis and research—with consistent quality and coherence. A token is the smallest unit of text data a model can process (e.g., a word, phrase, or an individual character). Longer responses are particularly effective for rich code and content generation.
- According to Anthropic, Claude Opus 4.1 advances its state-of-the-art* coding performance to 74.5% on SWE-bench, delivering steady, deliberate progress that keeps developers and their applications at the fore-front. It navigates large codebases with more focus and accuracy than its predecessor and excels at long-running tasks with improved planning and orchestration for coding agents. Beyond coding, Opus 4.1 improves Claude’s in-depth research and data analysis skills, especially around detail tracking and agentic search.
- The models can switch between providing a quick, direct answer and step-by-step thinking—improving performance for multi-step workflows by substantial margins on key industry benchmarks.
The bigger story
This next generation of Claude models represents a significant leap forward in agentic AI capabilities, transforming how businesses can deploy AI for both specialized complex tasks and everyday high-volume operations. Rather than simply generating content, Claude Opus 4.1 and Claude Sonnet 4 function more like expert virtual collaborators—maintaining focus across complex tasks, preserving relevant context, and delivering complete solutions without constant guidance. This capability transforms how organizations can tackle challenges from developing software systems to creating comprehensive marketing strategies. For everyday users, it means working with AI that better understands their needs and can take on more significant portions of projects independently.
What's around the corner?
According to Anthropic, Claude Opus 4.1 and Claude Sonnet 4 point toward a future where AI systems become increasingly capable partners in both creative and knowledge work. For example: taking on more specialized roles in organizations like handling routine analysis, coordinating across departments, and even managing complete workflows with minimal oversight.
Dive deeper
Visit the Anthropic’s Claude in Amazon Bedrock product page. for more detailed information on Claude Opus 4.1 and Claude Sonnet 4.