Fake approval loop

Agent simulates or bypasses human approval, executing actions without real oversight.

See attack-chain-template.md for full structure.
Related: docs/01-threat-model.md, patterns/secure-agent-runtime.md

The agent presents a polished natural-language summary to the human approver while concealing the real tool parameters, diff, and data movement behind it, so the human signs off on a sentence that does not match the action that actually fires.

Agent
Approval summary
Human approver
Tool
Downstream system

AgentApproval summaryFinal text only — (no parameters or diff)
Approval summaryHuman approverApproval prompt
Human approverApproval summaryApproves on — summary alone
Approval summaryToolExecutes unreviewed — parameters
ToolDownstream systemReal impact

Defence forces every approval record to expose the underlying intent, raw parameters, diff, data movement, and forecast downstream impact alongside a trace link, so reviewers approve the action that will execute, not a flattering summary of it.

APPROVAL_RECORDdeclaresINTENT
APPROVAL_RECORDexposesPARAMETER
APPROVAL_RECORDexposesDIFF
APPROVAL_RECORDexposesDATA_MOVEMENT
APPROVAL_RECORDforecastsDOWNSTREAM_IMPACT
APPROVAL_RECORDreferencesTRACE_LINK