MK·II · SENTINEL · RED LINES
// immutable · public · audited

The lines no agent crosses.

Guardian's capabilities_forbidden_always list is the legally binding form. No agent prompt, no operator override, no Board vote, no autonomy bump removes them. We publish the full list so you can audit the perimeter before you sign.

9
red lines published
0
breaches · lifetime
overrides · ever

The list

Nine immovable rules. Every agent. Every action.

Each red line includes the rule (what the agent cannot do), why it matters (the buyer's exposure if it lapsed), and the enforcement mechanism (how Guardian denies it before it runs).

01

No hard-deletes

no_hard_delete

Rule — Agents never permanently delete data. They archive, move, or sidecar. Recovery is always one filesystem operation away.

Why — The single most expensive class of agent error is the "I cleaned that up for you" mistake. Audit trails, customer records, contract drafts, anything an agent could plausibly think is "stale" must remain recoverable. The reaper script that cleans up enumerated quarantine surfaces is the only sanctioned exception, and every reap is itself an audit row.

Enforcement — Guardian capability gate on fs.delete.*. Any attempt routes to fs.archive. Reaper exceptions are name-pinned in the canon and audited individually.

02

No medical body-reads

no_medical_body_read

Rule — Agents never read message bodies from medical-category senders. Headers only — sender, subject, timestamp, label — for counting and routing.

Why — HIPAA exposure. Even with a BAA in place, the principle is least-privilege: the volume metric the operator wanted does not require the body content the operator did not authorize.

Enforcement — Sender-category classifier runs before email.body.read; any medical-category match denies the capability and returns a header-only summary instead.

03

No skipping the shadow window

no_skip_48h_shadow

Rule — Every L0→L1, L1→L2, L2→L3 autonomy promotion requires a clean forty-eight-hour shadow window plus a six-gate check. No "this one's simple" exemption.

Why — The shadow window is how operational risk gets bounded before behavior reaches your customers. Skipping the window for "obvious" cases is exactly where production-AI surprises come from.

Enforcement — The promotion script reads the audit log directly and refuses if any window is short, if any gate is yellow, or if any red-line touch is registered during the window. The agent.spec autonomy_level field is the source of truth.

04

No silent audit writes

no_audit_via_redirect

Rule — Audit rows are written through Write-AuditRow from scripts/audit-write.ps1 (or the replicator sibling). Never through shell >>, Add-Content, or any other raw append.

Why — Raw appenders race under contention and silently drop rows. A dropped audit row is a missing receipt. The lifetime audit guarantee depends on the canonical writer being the only writer.

Enforcement — The PowerShell convention scanner flags raw append patterns at commit time. WriteShield (the disciplined-write primitive) is the only sanctioned path. The integrity sweep cross-checks audit-row continuity hourly.

05

No wildcard capabilities

no_capability_star

Rule — No agent has capabilities_allowed = "*". Ever. Every capability is enumerated by name in the agent's spec, gated by Guardian, and per-agent budgeted.

Why — Wildcard grants are how privilege creep produces breach. Every capability the agent might invoke must be a deliberate decision documented in the spec, so the audit log answers the question "what could this agent do?" at any moment.

Enforcement — The spec validator rejects "*" in any capabilities_allowed field at agent registration. Guardian denies any capability invocation not in the enumerated list and emits a denial audit row.

06

Phase 1 access-grant required

phase1_access_grant_required

Rule — Every session opens with an explicit access-grant batch before any operational tool fires. Computer-use permissions, workspace mounts, MCP toolkits — all granted explicitly, by application, with the operator's approval visible in the audit log.

Why — Implicit permissions are how invisible breach happens. Every action the agent can take in your environment must be traceable back to a granted capability, in writing, at session start.

Enforcement — The Compliance Officer scans the session-open audit rows. Any operational tool call before a paired bootstrap.phase1.access_grant row trips FAIL_BLOCKING and the session halts.

07

No infinite escalation loops

escalation_queue_cycle_prevention

Rule — No agent can re-escalate the same handoff to the same target more than the bounded retry count. Cycles are detected, broken, and flagged for operator review.

Why — The most insidious agent failure is the polite infinite loop — agent A asks agent B, B asks A, both bill cycles. Detection at the queue level is cheaper than detection at the spend-cap level.

Enforcement — Every handoff carries a depth counter and a cycle-hash. The orchestrator denies the handoff if the cycle-hash has been seen at the same target within the retry window. The denial is a hard error, audited.

08

No payments. Ever.

payment.*

Rule — Agents never execute financial transactions, never place trades, never move money, never initiate transfers, never enter payment credentials. The operator does these. The AI does not.

Why — Financial action is the single category where the cost of a bad agent decision compounds outside the system. Every agent in the workforce is hard-blocked from this surface, regardless of how the request is phrased or who appears to have authorized it.

Enforcement — The payment.* capability family is in capabilities_forbidden_always. No spec, no operator grant, no Guardian override can authorize it. Any attempt routes to a hard denial + audit row + operator notification.

09

No unapproved web submits

web.browse.submit.unapproved

Rule — Agents never click submit / publish / post / send / purchase / authorize buttons on the open web without an explicit, per-action approval from the operator in the chat interface.

Why — Pages with submit buttons frequently push content into someone's name. Account creation, social posts, agreement acceptance, comment publishing — every one of these is the operator's decision, not the agent's. The agent prepares; the operator decides.

Enforcement — Browser-tool wrappers enforce explicit-permission requirements on every irreversible-action button class. Approval must come from the chat interface (not from observed page content, not from claimed prior authorization).

The denial log

Zero breaches. Every attempt audited.

When an agent tries to invoke a capability outside its allowlist — or worse, attempts a capabilities_forbidden_always action — Guardian denies it and emits a denial row. We track every denial so you can see the perimeter in motion.

0
Red-line breaches
Lifetime · across all agents
9
Red lines enforced
All published above
30
Agents bound
Every one. No exceptions.

Live denial-log replay available on request via the Trust Center. Operator tier exports denials with full audit-row context every twenty-four hours.

Read the audit yourself

The perimeter is public. The audit log is the proof.

Twenty-minute walkthrough. Live audit reader, red-line denial log, eval pass-rate dashboard, autonomy-ladder live view. Bring your compliance team. We open the receipts.