Claude 4 Opus and Sonnet have been released and appear to be the latest state of the art..
Anthropic just launched Claude 4, featuring major upgrades across coding, reasoning, and AI agents:
🧠 New Models
- Claude Opus 4: Best-in-class coding model; leads SWE-bench (72.5%) & Terminal-bench (43.2%)
- Claude Sonnet 4: Big leap over 3.7; excels at instruction following, reasoning, and efficiency
🛠️ Tool Use & Reasoning
- Extended thinking with integrated tool use (e.g. web search)
- Parallel tool execution for more complex workflows
- Improved memory via local file access; stores & recalls key info over long tasks
💻 Claude Code (General Availability)
- IDE integrations: VS Code & JetBrains (inline edits)
- GitHub Actions & Claude Code SDK for custom dev agents
- Responds to PR feedback, CI errors, and more
🧰 New API Features
- Code execution tool
- Files API (upload, manage, access project data)
- MCP connector (structured environments)
- Prompt caching (retain prompts up to 1 hour)
📊 Performance & Safety
- Hybrid mode: instant responses or extended thinking
- 65% fewer shortcut behaviors vs. Sonnet 3.7 in agent tasks
- Developer Mode for full access to Claude’s thought chains
- ASL-3 safety protections now in effect
💲Availability & Pricing
- Opus 4: $15 (input) / $75 (output) per million tokens
- Sonnet 4: $3 / $15; available to free-tier users
- Models on Anthropic API, Amazon Bedrock, Google Cloud Vertex AI
Claude 4 pushes the boundary for long-term reasoning, agent performance, and code quality. Ideal for serious dev workflows.