Back to all

evaluating-skills-with-models

by Taisuke Oe

20Feb 6, 2026Visit Source
Evaluate skills by executing them across sonnet, opus, and haiku models using sub-agents. Use when testing if a skill works correctly, comparing model performance, or finding the cheapest compatible model. Returns numeric scores (0-100) to differentiate model capabilities.