Agent Orchestration
The coordination layer that decides which agent or tool runs next, manages state across steps, and handles failures in multi-step AI workflows.
In plain English
Agent orchestration is how you wire multiple agents (or one agent with multiple tools) into a working system. Where a single LLM call answers a question, orchestration handles the messier reality: routing tasks to the right agent, waiting on long-running tools, retrying failures, and stitching results back together.
What orchestrators do:
- Route — send a sub-task to the agent best suited for it
- Manage state — track what's been done, what's pending, what failed
- Coordinate — let agents hand off work or work in parallel
- Recover — retry failed steps, fall back to a simpler agent
Common frameworks: LangGraph (graph-based agent flows), CrewAI (role-based teams), Temporal (durable workflows), Anthropic's Claude Agent SDK, and OpenAI's Swarm. Orchestration is where most production agent complexity lives — and where most failures happen.