AI SAFE² Deployments — CyberStrategy Institute
Ungoverned AIGoverned AI Runtime control by design
Runtime Governance for Autonomous AI

Govern Autonomous AI Before It Governs Your Business

Autonomous agents can execute thousands of actions before a human notices a mistake. AI SAFE² adds runtime governance, tool controls, auditability, and fail-safe recovery — so you can prove your AI stays under control.

Validate the source on GitHub →
Works with
rNW5IgEKEAEiQAEiQAQoQASIAAWIABGgABEgAhQgAkSAAkSACFCACBABChABIkABIkAEKEAEiAAFiAARoAAFKEAEKEABChABClCAAkSAAhSgABEgAhQgAkSAAkSACFCACBABChABIkABIkAEKEAEiAAFiAARoAARIAIUIAJEgAJEgAhQgAgQAQpQgAJEgAIUoAARoAAFKEAEKEABChABClCAAkSACFCACBABChABIkABIkAEKEAEiAAFiAARoAARIAJM8onp0YWL5pWJ7tGL5rVNjOQXN3Z5wdE9RtE8AQpQgAIUoAAFKEABClCAAhSgAAUoQAEKUIACFKAABShAAQpQgAIUoAAFKEABClCAAhSgAAUoQAEKUIACFKAABShAAQpQgAIUoAAFKEABClCAAhSgAAUoQAEKUIACFKAABShAAQpQgAIUoAAFKEABClCAAhSgAAUoQAEKUIACFKAABShAAQpQgAIUoAAFKEABClCAAhSgAAUoQAEKUIACFKAABShAAQpQgAIUoAAFKEABClCAAhSgAAUoQAEKUIACFKAABShAAQpQgAIUoAAFKEABClCAAhSgAAUoQAEKUIACFKAABShAAQpQgAIUoAAFKEABClCAAhSgAAUoQAEKUIACFOC65ggqmlf3MoNm0TwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAOBNfkvrmicoB6CPAAAAAElFTkSuQmCCClaude Code j9MAVpzT4Pw7huOJ2X9+OAAAAAElFTkSuQmCCCodex Cursor VvAAAAAElFTkSuQmCCMCP n8n s4AAAAASUVORK5CYII=LangGraph A0sC1pb4g4PrAAAAAElFTkSuQmCCLangChain
A2F5rKnqcS8LAAAAAElFTkSuQmCCCrewAI LvgFpE7VroAAAAAElFTkSuQmCCOpenClaw vkDnBIyN3dcP34AAAAASUVORK5CYII=Hermes +eLgiiXDiAAAAAElFTkSuQmCCAntiGravity Lovable 7QjEmhFLmaYbrfpWAb+GQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAFCPfwGDlE7rnpTltAAAAABJRU5ErkJggg==Make c4fh8NquQgAAAABJRU5ErkJggg==Manus
The Path to Operational Trust

Adoption creates risk. Controls create trust.

01AI Adoptionagents go live
02AI Riskungoverned autonomy
03AI SAFE² Controlsthe framework
04Deploymentsyou are here
05Operational Trustprovable
The frameworkdefines the controls.
Deploymentsdemonstrate implementation.
Skillsaccelerate adoption.
The toolkitoperationalizes the controls.
AISM & NEXUSprovide governance and accountability at scale.

You don't get trust by adopting AI. You get it by being able to prove your AI is governed.

Not sure where your risk is?
Take the 3-minute AI Operational Trust Assessment and get routed to the exact controls that close your biggest gap.
Start the Assessment
Featured Deployments

Not categories. Deployments.

Every deployment answers the same six questions: what problem it solves, what happens if you do nothing, the outcome you gain, time to value, difficulty, and a copy-paste quick start.

Secure Execution · Sovereign Runtime
Hermes Sovereign Runtime
Problem

Your coding agent can execute commands faster than you can review them.

If you do nothing

Unreviewed execution, silent privilege use, and no way to roll back an autonomous mistake.

Outcome

Validated, monitored, recoverable execution wrapped around the agent.

Time to value~3 min DifficultyLow
Best For
Claude CodeCodexAgentic DevelopmentPlatform Teams
Pillars
1Sanitize & Isolate
2Engage & Monitor
3Fail-Safe & Recovery
bash · hermes
# clone the framework
$ git clone https://github.com/CyberStrategyInstitute/ai-safe2-framework
$ cd ai-safe2-framework/examples/hermes-sovereign-runtime

# wrap your coding agent in the sovereign runtime (see README)
$ ./run.sh
Secure Tool Usage · MCP Security
MCP Security Toolkit
Problem

Your MCP servers trust tools they cannot verify.

If you do nothing

Unauthorized actions, tool abuse, and invisible privilege expansion across your toolchain.

Outcome

Governed, auditable tool execution with validation and trust boundaries.

Time to value~15 min DifficultyLow
Best For
Anthropic MCPEnterprise ToolchainsAI Platforms
View Toolkit MCP Skills
Pillars
1Validate Every Tool Call
2Mediate Trust Boundaries
3Govern Tool Access
bash · mcp-toolkit
# clone, then enter the toolkit example
$ cd ai-safe2-framework/examples/mcp-security-toolkit

# wrap an existing MCP server with validation + mediation (see README)
$ pip install -r requirements.txt
$ python guard.py --server ./server.py
Governance & Oversight
AISM
Problem

You cannot govern what you cannot inventory.

If you do nothing

Shadow agents, no audit trail, and nothing to show an auditor when they ask who is running what.

Outcome

Visibility and accountability for every agentic system in your environment.

Time to value~5 min DifficultyMedium
Best For
GRCSecurity LeadersEnterprise Architecture
View AISM
Pillars
1Inventory Every Agent
2Establish Accountability
3Oversight at Scale
bash · aism
# enter the AISM governance module
$ cd ai-safe2-framework/AISM

# discover and inventory autonomous systems (see README)
$ ./aism scan --env production
$ ./aism report --format registry
Sovereign Identity & Accountability
NEXUS
Problem

Every agent can act. Few can be trusted — and fewer can prove who acted.

If you do nothing

No identity, no attribution, no reconstruction. Legal and Compliance can't answer "who did this?"

Outcome

Sovereign identity and an evidence trail across six governance layers (L1 — L6).

Time to value~10 min DifficultyMedium
Best For
LegalComplianceIdentity TeamsAuditors
View NEXUS SDK
Pillars
1Sovereign Identity
2Attribution & Evidence
3Reconstructable History
bash · nexus
# install the NEXUS python SDK from the repo
$ cd ai-safe2-framework/NEXUS
$ pip install -e sdk/python

# issue a sovereign identity and start the evidence trail (see README)
$ nexus identity issue --agent my-agent
$ nexus audit tail --agent my-agent
Enhance, Don't Replace
Existing security controls are necessary. They are not sufficient.

OpenClaw provides powerful autonomous execution. SlowMist provides runtime behavioral controls. Neither provides full governance — and together they still leave organizational blind spots. AI SAFE² fills them.

Defense in depth · the governance layer that completes the stack
Agent Cognitionreasons & decides
SlowMist ControlsNecessarybehavioral runtime
OpenClaw RuntimeNecessaryautonomous execution
AI SAFE² GovernanceCompletes itoversight & evidence
Enterprise Oversightboard & audit

The questions existing controls leave open

Strong execution and behavioral controls still can't answer the organization-level questions:

  • ?Can we see every deployment?
  • ?Can we identify anomalous behavior?
  • ?Can we perform cross-agent analysis?
  • ?Can we prove governance to auditors?
  • ?Can we conduct adversarial exercises?
  • ?Can we generate organizational evidence?
+ AI SAFE² fills these gaps.
Which one do I choose?
OpenClaw
Sovereign execution runtime
A powerful autonomous execution environment for agents that need to run real commands, tools, and workflows — wrapped in AI SAFE² controls.
Choose this when
You're building or running agents and need controlled, recoverable autonomous execution as the foundation.
SlowMist Overlay
High-risk behavioral hardening
A runtime behavioral-controls overlay that layers adversarial-grade monitoring and constraints on top of an existing runtime like OpenClaw.
Choose this when
You already have execution handled and operate in a high-risk environment that demands extra behavioral oversight.

Not exclusive — run SlowMist Overlay on top of OpenClaw for defense in depth.

Continuous Alignment Verification

Alignment isn't a launch check. It's a vital sign.

How do you know an agent remains aligned after deployment?

Most organizations don't. They verify alignment once, at launch, and then hope it holds. Love Equation replaces hope with measurement.

Instead of only watching inputs and outputs, it measures behavior over time — continuously scoring the five variables that drive the alignment equation.

Most frameworks
Inputs & Outputs
Love Equation
Behavior Over Time
Bridges into Engage & Monitor Evolve & Educate
The equation behind the signals
dE/dt = β(C D)E
dI/dt = βᵢ(V A)I
EAlignment — grows while C ≫ D, decays otherwise
CCooperation — truth-seeking, privacy, autonomy
VVerification — evidence the agent is checked, not just agreeable
IIndependence — resists agreement pressure and sycophancy
DDefection — deception, manipulation, harm
βSelection strength — how fast alignment compounds
A — agreement pressure: the pull toward telling you what you want to hear. High V with low A is genuine alignment; high A with low I is sycophancy that only looks aligned. Each color maps to the signal plotted above.
How Do I Integrate It?

Start Using AI SAFE² in Under Five Minutes

People don't adopt frameworks — they adopt commands. Pick your runtime, toolkit, or alignment monitor and the exact integration steps are on this screen.

Hermes Sovereign Runtime
What It Solves
Autonomous execution that outruns human review.
If You Do Nothing
Unreviewed, unrecoverable agent actions.
Expected Outcome
Validated, monitored, recoverable execution.
Deployment Time
~3 minutes
Pillars Supported
Sanitize & Isolate · Engage & Monitor · Fail-Safe & Recovery
Copy-Paste Commands
bash
$ git clone https://github.com/CyberStrategyInstitute/ai-safe2-framework
$ cd ai-safe2-framework/examples/hermes-sovereign-runtime
$ ./run.sh   # see README for options
Claude Code Sovereign Runtime
What It Solves
Claude Code executing commands faster than you can review.
If You Do Nothing
Unsupervised edits and shell actions in your repo.
Expected Outcome
Sanitized, monitored, recoverable Claude Code sessions.
Deployment Time
~3 minutes
Pillars Supported
Sanitize & Isolate · Engage & Monitor · Fail-Safe & Recovery
Copy-Paste Commands
bash
$ cd ai-safe2-framework/examples/claude-code-sovereign-runtime
$ ./run.sh   # wraps your Claude Code session
Codex Sovereign Runtime
What It Solves
Codex generating and running code without guardrails.
If You Do Nothing
Unvetted code paths reaching execution.
Expected Outcome
Codex output gated through AI SAFE² controls.
Deployment Time
~3 minutes
Pillars Supported
Sanitize & Isolate · Engage & Monitor · Fail-Safe & Recovery
Copy-Paste Commands
bash
$ cd ai-safe2-framework/examples/codex-sovereign-runtime
$ ./run.sh   # wraps your Codex workflow
Anti-Gravity Runtime
Integration Style
Plugin — not a wrapper
What It Solves
Adding sovereign controls inside Antigravity itself.
If You Do Nothing
Agent actions outside any governance boundary.
Expected Outcome
Controls run natively as an Antigravity plugin.
Deployment Time
~5 minutes
Copy-Paste Commands
bash · plugin
# Antigravity integrates as a plugin, not a runtime wrapper
$ cd ai-safe2-framework/examples/anti-gravity-sovereign-runtime
$ ./install-plugin.sh   # see README
MCP Security Toolkit
What It Solves
MCP servers that trust tools they cannot verify.
If You Do Nothing
Tool abuse and invisible privilege expansion.
Expected Outcome
Validation, mediation, and trust boundaries on every request.
Deployment Time
~15 minutes
Pillars Supported
Validate · Mediate Trust · Govern Tool Access
Copy-Paste Commands
bash
$ cd ai-safe2-framework/examples/mcp-security-toolkit
$ pip install -r requirements.txt
$ python guard.py --server ./server.py
AISM Governance
What It Solves
No inventory or accountability for autonomous systems.
If You Do Nothing
Shadow agents and nothing to show an auditor.
Expected Outcome
Visibility, registries, and oversight across every agent.
Deployment Time
~5 minutes
Pillars Supported
Inventory · Accountability · Oversight at Scale
Copy-Paste Commands
bash
$ cd ai-safe2-framework/AISM
$ ./aism scan --env production
$ ./aism report --format registry
NEXUS Identity
What It Solves
Agents that act without identity or attribution.
If You Do Nothing
No answer to "who acted, and why?"
Expected Outcome
Sovereign identity and a reconstructable evidence trail.
Deployment Time
~10 minutes
Pillars Supported
Sovereign Identity · Attribution · Reconstructable History
Copy-Paste Commands
bash
$ cd ai-safe2-framework/NEXUS
$ pip install -e sdk/python
$ nexus identity issue --agent my-agent
Love Equation
What It Solves
Alignment verified only once, at launch.
If You Do Nothing
Silent alignment drift after deployment.
Expected Outcome
Continuous behavioral scoring & an operational-trust signal.
Deployment Time
~5 minutes
Bridges Into
Engage & Monitor · Evolve & Educate
Copy-Paste Commands
bash
$ cd ai-safe2-framework/examples/love_equation
$ python love_equation.py --agent my-agent --watch
Organized Around Outcomes

Nobody wakes up looking for a framework category.

They want to solve a problem, reduce a risk, or achieve an outcome. That's why these patterns are organized by the operational capability they provide — secure execution, trusted tool usage, governance, accountability, and continuous alignment. As the ecosystem grows, new examples simply fit into the outcome they help organizations achieve.

OUTCOME 01

Secure Execution

HermesClaude CodeCodexOpenClawAntigravity
OUTCOME 02

Secure Tool Usage

MCP ToolkitMCP Skills
OUTCOME 03

Governance & Oversight

AISMRegistriesInventories
OUTCOME 04

Sovereign Identity & Accountability

NEXUS
OUTCOME 05

Continuous Alignment

Love EquationISHI
OUTCOME 06

High-Risk Environments

SlowMist OverlayOpenClaw Overlay
Why AI SAFE² Exists

Not fear. Awareness.

If you can't answer these, your challenge is not AI adoption.

It is operational trust — and AI SAFE² exists to close that gap.

01Can your agents explain why they took an action?
02Can your MCP servers identify which agent initiated a request?
03Can you inventory every autonomous system operating in your environment?
04Can you reconstruct agent behavior six months later?
05Can you stop an unsafe workflow before damage occurs?
Roadmap

From adoption to operational trust.

01
Framework
02
Deployments
03
Skills
04
Toolkit
05
Governance
06
Operational Trust
Operationalize Trust

Move From AI Security Discussions to Deployable Controls

Policies explain intent. Deployments create trust.

AI SAFE² provides the implementation patterns needed to operationalize security, governance, and resilience around autonomous systems.

KERNEL-LEVEL DEFENSE 2025 A Buyers Guide