Run the OTEL eval workflow end-to-end (export trace, run eval packs via run-skill-eval.sh, collect JSON/Markdown report locations, and summarize results). Use when asked to execute or verify evals, generate LLM-as-judge reports, or audit eval pipeline outputs.