GOVERNANCE ENGINE
How TELOS governs your agent
TELOS scores every agent action against your Purpose Attractor through a 4-layer cascade. Each layer adds precision. The total overhead is under 20ms. The result is a cryptographically signed verdict: proceed, flag, or block.
The Purpose Attractor
Your Purpose Attractor (PA) is a signed declaration of what your agent was deployed to do and where its boundaries are. TELOS embeds it as a mathematical object — a centroid in high-dimensional space — and measures every action's distance from it.
A vague PA produces loose governance. A specific PA produces tight governance. The quality of your purpose statement directly determines the quality of your governance.
purpose: "Manage development workflow: review PRs,
run tests, deploy to staging."
boundaries:
- "Never push to production without approval"
- "Never access credentials outside project vault"
signing_key: ed25519:your-tkey
The Scoring Cascade
Pattern matching
<1ms9,458 boundary patterns. Keyword detection against declared hard boundaries. A match triggers immediate ESCALATE — no downstream scoring needed. Sub-millisecond.
SetFit classification
~3msFine-tuned ML classifier trained on boundary detection. 91.8% detection rate at 5.2% false positive rate. AUC 0.9804 ± 0.018 (5-fold cross-validation on 171 balanced training examples). LOCO AUC: 0.972.
Embedding similarity
~5msONNX-optimized MiniLM-L6-v2 embeddings. Measures semantic distance between the action and each dimension of the Purpose Attractor. Catches purpose drift that keywords and classification miss.
Contrastive detection
~8msSafe centroid margin scoring. Computes the distance between the action embedding and the purpose centroid relative to a safe-action baseline. Produces a continuous purpose alignment score. No LLM inference — deterministic and auditable.
Total cascade: ~15ms p50, 20ms p95. Every action. In real time.
Verdicts
Layer outputs combine through threshold-based hierarchical override. A boundary trigger at L0 produces immediate ESCALATE. Otherwise, L1 through L2 scores are evaluated against configured thresholds.
Action within purpose. Proceed. Fidelity ≥ 0.45.
Action is borderline. Proceed but flag for review. Context injected to help the agent self-correct. Fidelity ≥ 0.35.
Action outside purpose or boundary violation. Hard stop. Routed to human review. Fidelity < 0.35 or boundary triggered.
PA unsigned or governance suspended. No scoring occurs. Agent cannot act under governance until the PA is signed.
The Audit Trail
Every verdict produces a governance receipt. Every receipt is Ed25519 signed. Every receipt is hash-chained (SHA-256). Your compliance evidence generates itself — no manual logging, no after-the-fact auditing.
{
"sequence": 45164,
"event": "tool_call_scored",
"tool_name": "exec",
"verdict": "execute",
"fidelity": 0.712,
"cascade": { "l0": "pass", "l1": 0.92, "l1_5": 0.78, "l2": 0.71 },
"pa_hash": "b533160047f9...",
"prev_hash": "4a01b926b45c...",
"entry_hash": "32fcd18b7816..."
}
Commissioning
Every governed agent starts INERT. You define its purpose. You sign it with your TKey. The signing ceremony is the governance contract — cryptographic proof that this agent was authorized for this purpose by this human.
$ telos init --activate <token>
$ telos pa author my-agent.yaml
$ telos pa sign my-agent.yaml
# Agent is now governed. Every action scored.
Start governing your agents
$49/month. Full governance engine. Runs locally. Cancel anytime.