# mineru-pdf > Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction. - Author: 0xturboblitz - Repository: sundial-org/awesome-openclaw-skills - Version: 20260201003509 - Stars: 38 - Forks: 0 - Last Updated: 2026-02-06 - Source: https://github.com/sundial-org/awesome-openclaw-skills - Web: https://mule.run/skillshub/@@sundial-org/awesome-openclaw-skills~mineru-pdf:20260201003509 --- --- name: mineru-pdf description: Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction. --- # MinerU PDF ## Overview Parse a PDF locally with MinerU (CPU). Default output is Markdown + JSON. Use tables/images only when requested. ## Quick start (single PDF) ```bash # Run from the skill directory ./scripts/mineru_parse.sh /path/to/file.pdf ``` Optional examples: ```bash ./scripts/mineru_parse.sh /path/to/file.pdf --format json ./scripts/mineru_parse.sh /path/to/file.pdf --tables --images ``` ## When to read references If flags differ from your wrapper or you need advanced defaults (backend/method/device/threads/format mapping), read: - `references/mineru-cli.md` ## Output conventions - Output root defaults to `./mineru-output/`. - MinerU creates the per-document subfolder under the output root (e.g., `./mineru-output//...`). ## Batching Default is single-PDF parsing. Only implement batch folder parsing if explicitly requested.