Branch off, don't push.
When Agent Etna writes a fix, it doesn't touch your main branch. It creates a temporary one — a stage where everything that follows happens. If anything goes wrong, the only thing affected is that stage.
Code breaks loudly. Agents don't. A bad fix can pass tests, deploy clean, and then start giving subtly wrong answers to a tenth of your users. By the time anyone notices, hundreds of conversations have already gone sideways. The sandbox is the time-out before that happens.
When Agent Etna writes a fix, it doesn't touch your main branch. It creates a temporary one — a stage where everything that follows happens. If anything goes wrong, the only thing affected is that stage.
The patched code starts up in an isolated environment. A real running version of your agent — just not the one your users are talking to. Agent Etna probes it the way a user would, watches the output, and waits for it to settle.
Every failing test that triggered the fix gets re-run against the sandbox copy. If they pass — and nothing else regresses — the fix has earned its way out. If anything fails, the sandbox is destroyed and your real agent never sees the change.
Even after the sandbox passes, Agent Etna doesn't merge unilaterally. You see the change — what was added, what was removed, what else uses the same code — and you say yes. Or you say try again.
Speed comes from being able to try things. Trust comes from knowing nothing you try can hurt your real users. The sandbox lets you have both — iterate as fast as you can think, ship only what you've proven works.
Try anything.
Break nothing.