Quick frame I've been using for AI-assisted work that actually scales.
When the task is new → prompt. You're discovering what the work looks like. The model is your sparring partner.
When the task repeats → make it a skill. Package the context, scripts, criteria, fallback path. You stop explaining everything from scratch.
When the skill is stable → move the deterministic parts to gates. Formatter for code. Linter for forbidden phrases. Schema for output shape. The model can still draft; the gate decides whether it passes.
When the gates are stable → reduce the LLM's responsibility. Often down to 20% of the workflow. The system handles the rest.
The point isn't "use AI less" — it's that the model should handle the part where ambiguity is genuinely useful, not the part that's already measurable.
Self-check I run on any workflow that feels janky:
- What do I keep explaining to the model? → that's a skill
- What does the model keep judging by itself? → that's a gate
- If I removed the LLM, which parts of the workflow would still be clear? → those are real process