Skip to content

Governance Skills

The gov-* family enforces safety, compliance, and policy constraints on AI workflows. These are strong-model-primary skills — GPT-5.4 runs the first-pass check, Claude Sonnet 4.6 provides final judgment.

Skill IDDescriptionModel Class
gov-prompt-injection-hardeningDetects and mitigates prompt injection vectors in system prompts and input pipelinesstrong
gov-policy-validationValidates a workflow or output against a defined policy specification (JSON Schema or plain text rules)strong
gov-data-guardrailsEnforces data privacy rules: PII detection, scrubbing, purpose limitation, retention checksstrong
gov-model-governanceAudits model selection decisions against organisational model policy (approved models, forbidden capabilities)strong
gov-model-compatibilityChecks that a workflow’s model requirements are compatible with the available model rostercheap
gov-regulated-workflow-designDesigns workflows for regulated environments (HIPAA, GDPR, SOC 2): control gates, audit loggingstrong
gov-workflow-complianceEnd-to-end compliance check of a complete workflow against a regulatory standardstrong
gov-* request
1. GPT-5.4 → first-pass policy check (no prior context)
2. Claude Sonnet 4.6 → final judgment (sees GPT-5.4 output + original)

The two-pass pattern exists because GPT-5.4 has lower self-agreement bias — it catches issues that a model which generated the plan would be inclined to approve.

SituationSkill(s)
AI system prompt handles user-provided textgov-prompt-injection-hardening
Workflow must comply with GDPR/HIPAAgov-regulated-workflow-design + gov-workflow-compliance
Handling personal or sensitive datagov-data-guardrails
Selecting models for a production systemgov-model-governance + gov-model-compatibility
Policy-gated approval gategov-policy-validation
  • govern — primary consumer; all seven coordinated
  • review — uses gov-prompt-injection-hardening when reviewing AI-facing code
  • enterprise — uses gov-regulated-workflow-design + gov-workflow-compliance
  • design — uses gov-data-guardrails for data architecture decisions

When ENABLE_GOVERNANCE_STRICT=true, governance skills add a human-in-the-loop checkpoint before returning results. A blocking approval prompt is inserted into the tool response.