MuleRun
Chat
Resources
Pricing
Back to all
evaluation-design
by
MB9012
Run
0
0
Feb 7, 2026
Visit Source
Use this skill when the user needs to define evaluation metrics, select datasets, or design grading/annotation strategies for agent optimization. Provides a structured, decision-driven workflow and reusable templates.