On March 14, 2026, OpenAI shipped Codex subagents to general availability — a production-ready system where one manager agent coordinates multiple specialized coding agents, each running in its own isolated cloud sandbox. It's not a research preview or a beta flag. It's the real thing: autonomous coding teams that execute complex, multi-file tasks while you sleep.
The architecture is fundamentally different from local-first tools like Claude Code. Every Codex subagent runs in a containerized environment with no access to your local machine. You describe the work, the manager decomposes it, subagents execute in parallel, and you get a pull request when it's done. Powered by GPT-5.4 with a 1M token context window, native computer use, and tool search, Codex represents OpenAI's bet that the future of AI coding is fully autonomous, cloud-native agents.
In this guide, we cover the full Codex subagent ecosystem: the manager-worker architecture, the Codex desktop app, Codex Security, the Symphony orchestration framework, API pricing, how it compares to Claude Code and Cursor, real-world workflows, and what to watch out for. Whether you're evaluating Codex for your team or already using it, this is the comprehensive reference.
📋 Table of Contents
- 1.What Are Codex Subagents?
- 2.Architecture: Manager Agent + Specialized Workers
- 3.Codex Desktop App: The Command Center
- 4.Codex Security: Automated Vulnerability Detection
- 5.Symphony Framework: Elixir-Based Agent Orchestration
- 6.API Integration & Pricing
- 7.Codex vs Claude Code vs Cursor
- 8.Real-World Workflows
- 9.Limitations & Known Issues
- 10.Why Lushbinary for AI Development
1What Are Codex Subagents?
Codex subagents are OpenAI's production multi-agent coding system, shipped to GA on March 14, 2026. The core concept: instead of one AI agent working through your codebase sequentially, a manager agent decomposes your task into subtasks and spawns specialized worker agents that execute in parallel — each in its own isolated cloud sandbox.
Each subagent gets a full containerized environment: its own filesystem snapshot, installed dependencies, and tool access. They can read files, write code, run tests, execute shell commands, and search documentation. When they finish, results flow back to the manager agent, which validates the work, resolves conflicts, and assembles the final output — typically a pull request or a set of file changes.
The critical distinction from local-first tools: Codex subagents never touch your local machine. Everything runs in OpenAI's cloud infrastructure. This makes them ideal for fire-and-forget workflows — kick off a task, close your laptop, and review the PR in the morning. It also means your local development environment stays clean and unaffected.
Key distinction: Codex subagents run in isolated cloud sandboxes with no local machine access. This is fundamentally different from Claude Code (which runs locally) and Cursor (which operates as an IDE extension). The tradeoff: you get full autonomy and safety isolation, but lose real-time interactive feedback.
2Architecture: Manager Agent + Specialized Workers
The Codex subagent architecture follows a manager-worker pattern. The manager agent is the orchestrator — it receives your task, creates an execution plan, spawns subagents, monitors progress, and assembles the final result. Worker subagents are specialized executors that handle individual subtasks within their sandboxed environments.
🎯 Manager Agent
Receives the user's task description, analyzes the codebase structure, decomposes the work into parallelizable subtasks, and spawns worker subagents. The manager monitors each worker's progress, handles failures and retries, resolves merge conflicts between workers, and produces the final output (PR, diff, or file changes).
👷 Worker Subagents
Each worker runs in its own isolated sandbox with a snapshot of the repository. Workers have full tool access: file read/write, shell execution, test running, and documentation search. They operate autonomously on their assigned subtask and report results back to the manager. Workers cannot communicate directly with each other — all coordination flows through the manager.
🔒 Cloud Sandboxes
Every subagent runs in a containerized environment with network isolation, resource limits, and no access to the user's local machine. Sandboxes are ephemeral — created for the task and destroyed after completion. This provides security isolation and ensures subagents can't interfere with each other or your development environment.
// Codex subagent architecture flow
User Task: "Add user authentication with OAuth,
JWT tokens, and role-based access control"
Manager Agent
├── Analyzes codebase structure
├── Creates execution plan (4 subtasks)
│
├── Spawns: Auth Worker [Sandbox A]
│ └── OAuth provider integration,
│ callback handlers, token exchange
│
├── Spawns: JWT Worker [Sandbox B]
│ └── Token generation, validation
│ middleware, refresh logic
│
├── Spawns: RBAC Worker [Sandbox C]
│ └── Role definitions, permission
│ guards, admin middleware
│
└── Spawns: Test Worker [Sandbox D]
└── Unit tests, integration tests,
E2E auth flow tests
Manager Agent
├── Collects results from all workers
├── Resolves file conflicts (3 workers
│ touched middleware/auth.ts)
└── Opens PR with consolidated changes3Codex Desktop App: The Command Center
In February 2026, OpenAI released the Codex desktop app for macOS — positioned as the "command center for agents." It's a native application that provides a visual interface for managing Codex subagent workflows, monitoring task progress, and reviewing results without switching between browser tabs and terminal windows.
The desktop app isn't just a wrapper around the web interface. It integrates directly with your local Git repositories, provides real-time progress visualization for running subagents, and offers a unified view of all active and completed tasks. Think of it as a project management dashboard specifically designed for AI agent workflows.
- Repository integration — Connect local Git repos and trigger Codex tasks directly from the app. The app handles repo snapshotting and upload to cloud sandboxes automatically.
- Live agent monitoring — Watch subagents work in real time with streaming logs, file change previews, and test execution results. Each subagent gets its own panel showing current activity.
- Task queue management — Queue multiple tasks, prioritize them, and let Codex work through your backlog autonomously. Review completed tasks and approve or reject changes at your own pace.
- PR review interface — Built-in diff viewer for reviewing subagent output before merging. Includes inline commenting and the ability to request revisions from the manager agent.
- Cost tracking — Real-time token usage and cost estimates for each task and subagent. Helps teams stay within budget on multi-agent workflows.
Platform note: The Codex desktop app is currently macOS-only (released February 2026). Windows and Linux versions are on the roadmap but no release dates have been announced. The web interface at codex.openai.com remains available on all platforms.
4Codex Security: Automated Vulnerability Detection
Codex Security launched in March 2026 as a dedicated application security product built on the subagent architecture. Instead of general-purpose coding tasks, Codex Security deploys specialized subagents that scan your codebase for vulnerabilities — each agent focusing on a different category of security concern.
The approach goes beyond traditional static analysis tools like Snyk or SonarQube. Because each subagent has full code comprehension through GPT-5.4, it can understand business logic context, trace data flows across files, and identify vulnerabilities that pattern-matching tools miss. The subagents don't just flag issues — they generate fix suggestions with full code diffs.
| Security Agent | Focus Area | Example Findings |
|---|---|---|
| Injection Scanner | SQL, XSS, Command Injection | Unsanitized user input in SQL queries, reflected XSS in templates |
| Auth Auditor | Authentication & Authorization | Missing JWT validation, broken access control, session fixation |
| Dependency Checker | Supply Chain Security | Known CVEs in dependencies, outdated packages, typosquatting risks |
| Config Reviewer | Infrastructure & Config | Exposed secrets in env files, permissive CORS, debug mode in production |
| Crypto Analyzer | Cryptographic Issues | Weak hashing algorithms, hardcoded keys, insecure random generation |
Key advantage: Unlike static analysis tools that rely on pattern matching, Codex Security subagents understand code semantics. They can trace a user input from an API endpoint through middleware, service layers, and database queries to identify injection vulnerabilities that span multiple files and abstraction layers.
5Symphony Framework: Elixir-Based Agent Orchestration
In March 2026, OpenAI published the Symphony framework on GitHub — an Elixir-based agent orchestration system that provides the building blocks for creating custom multi-agent workflows. Symphony is the same framework that powers Codex subagents internally, now available for developers to build their own agent systems.
The choice of Elixir is deliberate. The BEAM virtual machine (Erlang's runtime) excels at managing thousands of lightweight concurrent processes with built-in fault tolerance — exactly what you need for orchestrating multiple AI agents. Each agent runs as an Elixir process with its own state, message queue, and supervision tree. If an agent crashes, the supervisor restarts it without affecting other agents.
# Symphony framework — basic multi-agent setup (Elixir)
defmodule MyApp.CodingTeam do
use Symphony.Orchestrator
def run(task_description, repo_path) do
# Define specialized agents
agents = [
%Symphony.Agent{
role: :frontend,
model: "gpt-5.4",
system_prompt: "You are a React/TypeScript specialist.",
tools: [:file_read, :file_write, :shell, :test_run]
},
%Symphony.Agent{
role: :backend,
model: "gpt-5.4",
system_prompt: "You are a Node.js/Express API specialist.",
tools: [:file_read, :file_write, :shell, :db_query]
},
%Symphony.Agent{
role: :testing,
model: "gpt-5.4",
system_prompt: "You are a QA engineer writing tests.",
tools: [:file_read, :file_write, :test_run, :coverage]
}
]
# Manager decomposes task and assigns to agents
Symphony.Orchestrator.execute(
task: task_description,
repo: repo_path,
agents: agents,
strategy: :parallel,
conflict_resolution: :manager_merge
)
end
endSymphony provides several key primitives for building multi-agent systems:
- Agent definitions — Typed structs for defining agent roles, model selection, system prompts, and available tools. Agents are composable and reusable across different orchestration workflows.
- Orchestration strategies — Built-in patterns for parallel execution, sequential pipelines, fan-out/fan-in, and hierarchical delegation. Custom strategies can be defined as Elixir behaviours.
- Conflict resolution — Automatic handling of file conflicts when multiple agents modify the same files. Supports manager-merge (manager resolves), last-write-wins, and custom resolution functions.
- Supervision trees — Leverages OTP supervisors for fault tolerance. If an agent process crashes, it gets restarted with its last known state. The orchestrator can reassign failed tasks to new agents.
- Telemetry & observability — Built-in telemetry events for monitoring agent activity, token usage, task progress, and error rates. Integrates with standard Elixir observability tools.
Why Elixir? The BEAM VM can handle millions of lightweight processes with preemptive scheduling and per-process garbage collection. This makes it ideal for agent orchestration where each agent is a long-running process with its own state. The "let it crash" philosophy maps perfectly to agent fault tolerance — if an agent fails, restart it and retry.
6API Integration & Pricing
Codex subagents are powered by GPT-5.4, which brings a 1M token context window, native computer use capabilities, and tool search to every agent. The API is available through the standard OpenAI endpoints, making integration straightforward for teams already using the OpenAI platform.
| Model | Input Tokens | Output Tokens | Context Window |
|---|---|---|---|
| GPT-5.4 | $2.50/M | $15.00/M | 1M tokens |
| GPT-5.4-mini | $0.40/M | $1.60/M | 512K tokens |
| GPT-4o (legacy) | $2.50/M | $10.00/M | 128K tokens |
// Codex API — triggering a subagent task via the API
import OpenAI from "openai";
const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });
// Create a Codex task with subagents
const task = await openai.codex.tasks.create({
repo: "github.com/myorg/myapp",
branch: "main",
description: `
Add a user notification system:
- WebSocket real-time notifications
- Email digest (daily summary)
- In-app notification center component
- Database schema for notification storage
- Full test coverage
`,
model: "gpt-5.4",
subagents: {
strategy: "parallel",
max_workers: 4,
sandbox_config: {
runtime: "node:20",
install_command: "npm ci",
test_command: "npm test"
}
}
});
// Poll for completion or use webhooks
console.log(`Task ${task.id} started with ${task.subagent_count} subagents`);
console.log(`Estimated completion: ${task.estimated_minutes} minutes`);A typical multi-agent session with 3–4 subagents costs between $0.50 and $3.00 depending on task complexity and codebase size. The cost scales with the number of subagents and the amount of code each one needs to read and generate. For teams running Codex on a daily basis, expect monthly costs in the $50–$500 range depending on usage intensity.
Cost tip: Use GPT-5.4-mini for simpler subtasks like test writing and documentation updates. Reserve GPT-5.4 for complex architectural work and security-sensitive code. The Symphony framework supports per-agent model selection, so you can mix models within a single orchestration run.
7Codex vs Claude Code vs Cursor
The AI coding landscape in 2026 has three distinct paradigms: Codex (autonomous cloud agents), Claude Code (local-first interactive agents), and Cursor (AI-enhanced IDE). Each serves a different workflow and developer preference. Here's how they compare:
| Feature | Codex | Claude Code | Cursor |
|---|---|---|---|
| Execution Model | Cloud sandboxes | Local machine | IDE extension |
| Multi-Agent | ✅ Manager + workers | ✅ Agent Teams (experimental) | ❌ Single agent |
| Autonomy Level | Fully autonomous | Semi-autonomous | Interactive |
| Context Window | 1M tokens (GPT-5.4) | 200K tokens (Opus 4.6) | Varies by model |
| Real-Time Feedback | ❌ Async results | ✅ Live streaming | ✅ Inline suggestions |
| Local File Access | ❌ Cloud only | ✅ Direct access | ✅ Direct access |
| Agent Communication | Manager-mediated | Peer-to-peer mailbox | N/A |
| Security Isolation | ✅ Full sandbox | ⚠️ Local permissions | ⚠️ IDE permissions |
| Best For | Autonomous batch work | Complex interactive dev | Real-time pair programming |
| Pricing | $2.50/$15 per M tokens | $20–$200/mo subscription | $20–$40/mo subscription |
The decision framework comes down to workflow preference. Codex is for teams that want to describe work and walk away — it's the autonomous agent that works while you sleep. Claude Code is for developers who want an AI collaborator that works alongside them in real time with deep codebase understanding. Cursor is for developers who want AI assistance integrated directly into their editor for real-time pair programming.
Many teams use all three. Codex handles overnight batch tasks and large refactors. Claude Code handles complex, interactive development sessions where real-time feedback matters. Cursor handles day-to-day coding with inline completions and quick edits. They're complementary tools, not competitors.
8Real-World Workflows
Here are concrete scenarios where Codex subagents deliver measurable value over single-agent or interactive workflows:
🌙 Overnight Feature Development
Queue up feature tasks before leaving for the day. Codex subagents work through the night — one building the React components, another writing the API endpoints, a third handling database migrations, and a fourth writing tests. Review the PRs over morning coffee.
🔄 Large-Scale Migrations
Migrating from JavaScript to TypeScript across a 500-file monorepo. Codex spawns subagents for each module, converting files in parallel while maintaining type consistency across module boundaries. The manager agent handles cross-module type dependencies and resolves conflicts.
🛡️ Security Audit Pipeline
Run Codex Security as part of your CI/CD pipeline. On every release branch, subagents scan for injection vulnerabilities, authentication issues, dependency CVEs, and configuration problems. Results feed into your security dashboard with prioritized fix suggestions.
📚 API Documentation Generation
Point Codex at your API codebase and let subagents generate comprehensive documentation: one agent extracts endpoint definitions, another generates request/response examples, a third writes integration guides, and a fourth validates all code samples actually compile and run.
🧪 Test Coverage Expansion
Identify modules with low test coverage and deploy subagents to write tests in parallel. Each subagent owns a module, understands its business logic through the 1M token context, and generates unit tests, integration tests, and edge case coverage. The manager validates that all tests pass before opening the PR.
# Example: overnight workflow with Codex CLI
# Queue multiple tasks before leaving for the day codex task create \ --repo ./my-app \ --description "Convert all API routes from Express to Hono" \ --subagents 4 \ --notify slack:#dev-channel codex task create \ --repo ./my-app \ --description "Add comprehensive error handling to all service files" \ --subagents 3 \ --notify slack:#dev-channel codex task create \ --repo ./my-app \ --description "Write missing unit tests for utils/ directory" \ --subagents 2 \ --notify slack:#dev-channel # Check status in the morning codex task list --status completed # → 3/3 tasks completed, 3 PRs ready for review
9Limitations & Known Issues
Codex subagents are production-ready, but they're not magic. Here are the current limitations and known issues to factor into your evaluation:
- No local machine access — Subagents run entirely in cloud sandboxes. They can't access local services, databases, or APIs that aren't publicly reachable. If your development workflow depends on local Docker containers or VPN-only services, Codex can't reach them.
- Async-only workflow — There's no real-time interactive mode. You describe the task, Codex executes, and you review the result. If the output isn't right, you iterate with a new task. This is slower for exploratory work where you need rapid back-and-forth.
- Merge conflict complexity — When multiple subagents modify the same files, the manager agent resolves conflicts. For simple conflicts this works well, but complex semantic conflicts (like two agents implementing different approaches to the same problem) may produce suboptimal merges.
- Repository size limits — Very large monorepos (10GB+) can be slow to snapshot and upload to cloud sandboxes. OpenAI recommends using sparse checkouts or submodule-based repos for large codebases.
- No persistent state between tasks — Each task starts fresh. Subagents don't remember previous tasks or learn from past interactions with your codebase. Every task re-reads and re-analyzes the relevant code from scratch.
- Desktop app is macOS-only — The Codex desktop app released in February 2026 is currently limited to macOS. Windows and Linux users must use the web interface or CLI.
- Cost unpredictability — Multi-agent tasks can consume significantly more tokens than expected, especially when subagents encounter errors and retry. Setting token budgets per task is recommended but not yet granular at the per-subagent level.
The sweet spot for Codex subagents is well-defined, parallelizable tasks on codebases with good test coverage (so subagents can validate their own work). Tasks that require deep contextual understanding of undocumented business logic or real-time human feedback are better suited to interactive tools like Claude Code or Cursor.
10Why Lushbinary for AI Development
At Lushbinary, we help teams integrate autonomous AI coding agents into their development workflows — from Codex subagents to Claude Code to custom orchestration systems. We've been building with these tools since their early previews, and we know where they excel and where they fall short.
Multi-Agent Architecture
We design Codex subagent configurations and Symphony orchestration workflows tailored to your codebase, team size, and development process.
AI Pipeline Integration
From Codex to Claude Code to Cursor, we set up AI-assisted development pipelines that integrate with your existing CI/CD, code review, and deployment systems.
Cost Optimization
Multi-agent workflows can get expensive fast. We audit your usage patterns, optimize model selection per agent, and implement token budgets to keep costs predictable.
Security-First Development
We integrate Codex Security into your pipeline and build custom security scanning workflows that catch vulnerabilities before they reach production.
🚀 Ready to deploy autonomous coding agents for your team? Get a free 30-minute consultation. We'll assess your current workflow and recommend the right Codex subagent strategy for your codebase.
❓ Frequently Asked Questions
What are OpenAI Codex subagents?
Codex subagents are a production-ready multi-agent system shipped to GA on March 14, 2026. A single manager agent coordinates multiple specialized coding agents that run in isolated cloud sandboxes. Each subagent handles a specific task like frontend development, backend logic, or test writing, while the manager orchestrates the workflow and merges results.
How does Codex differ from Claude Code for multi-agent workflows?
Codex runs entirely in isolated cloud sandboxes — each subagent gets its own containerized environment with no access to your local machine. Claude Code runs locally on your machine with direct filesystem access. Codex is designed for autonomous, fire-and-forget workflows while Claude Code offers more interactive, real-time collaboration with peer-to-peer agent communication.
What is the OpenAI Symphony framework?
Symphony is an Elixir-based agent orchestration framework published by OpenAI on GitHub in March 2026. It provides primitives for building multi-agent systems including agent spawning, message passing, task coordination, and result aggregation. Symphony leverages Elixir's BEAM VM for lightweight process management and fault tolerance.
How much does GPT-5.4 cost for Codex subagents?
GPT-5.4 is priced at $2.50 per million input tokens and $15.00 per million output tokens. With its 1M token context window, a typical multi-agent session with 3–4 subagents costs between $0.50 and $3.00 depending on task complexity and codebase size.
What is Codex Security and how does it work?
Codex Security launched in March 2026 as a specialized application security product. It deploys subagents to scan codebases for vulnerabilities including injection flaws, authentication bypasses, insecure dependencies, and configuration issues. Each subagent specializes in a different vulnerability category and results are aggregated into a prioritized report with suggested fixes.
📚 Sources
- OpenAI — Codex Subagents: Multi-Agent Coding at Scale (March 14, 2026)
- OpenAI — Codex Desktop App for macOS (February 2026)
- OpenAI — Codex Security: AI-Powered Vulnerability Detection (March 2026)
- OpenAI — Symphony: Elixir-Based Agent Orchestration Framework (GitHub)
- OpenAI — GPT-5.4: 1M Context, Native Computer Use, Tool Search
Content was rephrased for compliance with licensing restrictions.
Deploy Autonomous Coding Agents for Your Team
Let Lushbinary help you integrate Codex subagents, Symphony orchestration, and multi-agent workflows into your development process. From strategy to production deployment, we'll get your autonomous coding teams shipping code while you sleep.
Build Smarter, Launch Faster.
Book a free strategy call and explore how LushBinary can turn your vision into reality.
