Back to all

skill-ocr

by lumen183

00Feb 6, 2026Visit Source
Converts complex PDF documents into structured Markdown with semantic image extraction and layout analysis using PaddleOCR (PP-StructureV3). Use when you need to digitize PDFs while preserving: 1. Document hierarchy (headings, numbering, and sections). 2. Tables (automatically converted to clean Markdown tables). 3. Images (extracted, semantically renamed based on nearby titles/text, and referenced). 4. Reading order recovery (fixing multi-column or complex layouts). **CRITICAL**: This skill MUST be executed using its own internal virtual environment.