skills/pdf-to-markdown/SKILL.md
Extracting text and tables, merging/splitting documents. Use when needing to convert PDFs while preserving structure.
npx skillsauth add cardoso-neto/personal-ai-infra pdf-to-markdownInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
# pip install marker-pdf # python==3.12
marker_single input.pdf --output_dir ./marker-output
marker_single input.pdf --output_dir ./out --page_range "0,5-10" # specific pages
marker_single input.pdf --output_dir ./out --force_ocr # for scanned PDFs
OUTPUT_IMAGE_FORMAT=PNG marker_single input.pdf --output_dir ./out # change image format to PNG
marker_single input.pdf --output_dir ./out --use_llm \
--llm_service marker.services.openai.OpenAIService \
--openai_api_key "$OPENAI_API_KEY" \
--openai_model gpt-5.2
Output:
marker_output/<filename>/<filename>.mdmarker_output/<filename>/_page_<N>_Figure_<M>.jpegmarker_single --help for all available options.python convert.py input.pdf [output.md]ConfigParser.development
Always use this skill when writing or editing webscrapers!
testing
Use this skill when working with software installed or run from local source checkouts that use git worktree hubs.
testing
Always use this skill when writing or editing software tests!
development
Always use this skill when writing or editing python code!