Hot off the presses....
Codex Updates Summary
Code Quality & Style
- Models were trained to maintain high code quality and style.
- Improved at avoiding unnecessary or unrequested code generation.
- Useful features include the ability to quickly verify agent activity summaries (e.g., preview of test results).
Performance Highlights
- Codex-1 performs strongly even without .md agents or custom scaffolding.
- Consistently outperforms o3-high on software engineering tasks.
Alignment Improvements
- Significant alignment work went into Codex.
- Results in cleaner patches and code that better aligns with developer preferences, standards, and instructions.
Internal Usage at OpenAI
OpenAI engineers use Codex for:
- Refactoring
- Renaming
- Writing tests
- Scaffolding new features
- Wiring components
- Fixing bugs
- Drafting documentation
New habits are forming as background tasks are increasingly offloaded to agents.
Codex-Mini-Latest (Responses API)
looks expensive though.. charging on token usage model.
I think claude code is a much better deal, you can get pretty good limits for $100 / mo
but not sure how it compares.
let us know if you try codex and how it is working for you