Back to all

evals

by Dazza Greenwood

00Feb 10, 2026Visit Source
Run the OTEL eval workflow end-to-end (export trace, run eval packs via run-skill-eval.sh, collect JSON/Markdown report locations, and summarize results). Use when asked to execute or verify evals, generate LLM-as-judge reports, or audit eval pipeline outputs.