Agent Safety
name: agent-safety
by compass-soul · published 2026-03-22
$ claw add gh:compass-soul/compass-soul-agent-safety---
name: agent-safety
description: Outbound safety for autonomous AI agents — scans YOUR output before it leaves the machine. Git pre-commit hooks that automatically block commits containing API keys, tokens, PII, or secrets. Unlike inbound scanners (Skillvet, IronClaw), this protects against what YOU accidentally publish. Use when committing to git repos, publishing to GitHub, or running periodic system health checks. Automated enforcement at the git level — not prompts.
---
# Agent Safety
Automated safety tools for autonomous AI agents. The principle: **don't rely on prompts for safety — automate enforcement.**
All scripts are in this skill's `scripts/` directory. When OpenClaw loads this skill, resolve paths relative to this file's location.
Pre-Publish Security Scan
Scans files for secrets, PII, and internal paths before publishing.
bash scripts/pre-publish-scan.sh <file-or-directory>**Detects:**
**Exit 0** = clean. **Exit 1** = blocking issues found, do not publish.
Git Pre-Commit Hook
Install once per repo. Automatically scans staged files on every commit:
bash scripts/install-hook.sh <repo-path>**Install this on every repo you work with.** It's the real guardrail.
Health Check
System monitoring for disk, workspace, security, and updates:
bash scripts/health-check.sh**Checks:** Disk usage, workspace size, memory file growth, OpenClaw version, macOS updates, firewall status, SIP status.
Run periodically (every few heartbeats). Watch for warnings.
Rules
1. Run pre-publish scan before ANY external publish action
2. Install pre-commit hook on EVERY repo you work with
3. Blocking issues (secrets, SSNs) must be fixed — no override
4. Review items (emails, paths) need human judgment
5. If a secret was ever committed, it's compromised — rotate immediately
More tools from the same signal band
Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).
Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.
The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...