Back to all

run-benchmark

by ZaikoXeas

00Feb 6, 2026Visit Source
Run an MCP evaluation using mcpbr on SWE-bench or other datasets.