prompt-engineering
Mission
Section titled “Mission”Structure → test → refine → version. Every prompt is a versioned, tested artifact.
When to Use
Section titled “When to Use”Use when writing a new system prompt, building a prompt template, improving an existing prompt that is failing or hallucinating, versioning prompts, chaining prompts into pipelines, calibrating agent autonomy levels, or evaluating prompt quality.
Triggers: “write a system prompt”, “improve this prompt”, “prompt is hallucinating”, “prompt template”, “chain these prompts”, “calibrate autonomy”, “prompt version”
Skills Invoked
Section titled “Skills Invoked”prompt-engineering— design from scratchprompt-chaining— multi-step prompt chainsprompt-refinement— iterative improvement from eval feedbackprompt-hierarchy— system → instruction → user layer designeval-prompt— score generated prompts
Chain-To
Section titled “Chain-To”quality-evaluate— benchmark the new prompt against baselinepolicy-govern— enforce governance layers in the prompt hierarchy
Refinement Loop
Section titled “Refinement Loop”prompt → eval-prompt score ↓ (below threshold)prompt-refinement ↓modified prompt → re-score ↓ (repeat)Example
Section titled “Example”{ "request": "The code generation prompt keeps producing code with `any` types. Fix it."}Output: Diagnosis of failure mode, refined system message with explicit TypeScript constraints, version tag, and eval score comparison.