Agent Harness
The scaffolding around an LLM — tools, memory, loops, and orchestration — that turns a model into an agent.
In plain English
An agent harness is the software layer that wraps a language model and gives it the abilities a true agent needs: a tool-calling loop, working memory, error handling, and a way to stop when the goal is met. The harness is what separates a one-shot chatbot from something that can actually do work.
What a typical harness provides:
- Tool-use loop — call → run → feed result back → call again, until done
- Memory — short-term context and longer-term storage of facts or files
- Planning — break a goal into steps, often with a separate planner model
- Guardrails — limits on tool calls, spending, or unsafe actions
- Observability — logs, traces, and replay for debugging
Examples: Claude Code, Cursor, OpenAI's Agent SDK, LangGraph, CrewAI, and Anthropic's Claude Agent SDK are all harnesses. The underlying model (Claude, GPT, Gemini) is interchangeable; the harness is what creates the user-facing product.