Why Your Agent Harness Should Be Stateless: Decoupling Brain from Hands in Production
Most teams building AI agents treat the harness — the scaffolding that handles tool routing, context management, and the inference loop — as a long-lived, stateful process tied to a single container. When the container fails, the session dies. When you need to swap in a better model, you have to restart everything. When you want to scale horizontally, you hit a wall: each harness instance knows too much about its own state to be interchangeable.
The fix isn't a smarter harness. It's a stateless one.
