Back to all

evaluation-design

by MB9012

00Feb 7, 2026Visit Source
Use this skill when the user needs to define evaluation metrics, select datasets, or design grading/annotation strategies for agent optimization. Provides a structured, decision-driven workflow and reusable templates.