Skip to content

Code execution side effects

Agent executes code with unintended or unsafe side effects, breaching system boundaries.

The agent runs generated code or commands whose visible result is the only thing surfaced to the user, while filesystem writes, outbound network calls, downstream updates, leaked secrets, and persisted state quietly accumulate as unaudited side effects.

root((Code execution))
  • Filesystem write
  • Outbound network call
  • Downstream system update
  • Secret leak in logs
  • Persisted side state



Defence dry-runs every command in a sandbox, runs an impact assessment, executes within bounded authority, validates real outcomes against expectations, and rolls back or trips a circuit breaker the moment anything drifts beyond the approved blast radius.

  1. Generated code — or commandDry run — in sandbox
  2. Dry run — in sandboxImpact assessment — by policy decision
  3. Impact assessment — by policy decisionBounded — execution
  4. Bounded — executionPost-action — validation