evaluating-skills-with-models

Name: evaluating-skills-with-models
Brand: MuleRun
Author: Taisuke Oe

by Taisuke Oe

20Feb 6, 2026Visit Source

Evaluate skills by executing them across sonnet, opus, and haiku models using sub-agents. Use when testing if a skill works correctly, comparing model performance, or finding the cheapest compatible model. Returns numeric scores (0-100) to differentiate model capabilities.