codex/skills/pdf-to-markdown/SKILL.md
Convert local PDF files or folders of PDFs into Markdown files using the bundled converter in this skill. Use this when the task is PDF-to-Markdown conversion inside the current workspace. Do not use it for OCR-heavy scanned PDFs, image extraction, or unrelated PDF summarization.
npx skillsauth add tkersey/dotfiles pdf-to-markdownInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill when the user wants one or more local PDF files converted into Markdown files in the current workspace.
This skill is fully self-contained. The conversion pipeline lives inside scripts/convert_pdf_to_markdown.py, and the PDF parser dependency is vendored under scripts/vendor/python/pypdf. The skill does not install packages, call npm, or fetch code at runtime.
The implementation is intentionally modeled on the structure-detection pipeline from jzillmann/pdf-to-markdown, but packaged as a repo-local Codex skill rather than a browser app. See references/upstream-notes.md for the design lineage and the main behavioral differences.
python3 scripts/convert_pdf_to_markdown.py ...The script accepts either:
Flags:
--input <path> or -i <path>: required. Accepts a PDF file or directory.--output <path> or -o <path>: optional.--recursive or -r: recurse through nested folders when the input is a directory.--overwrite or -f: overwrite existing Markdown output files.--helpFor a single file input:
--output is omitted, write a sibling Markdown file next to the PDF using the same base name.--output points to a directory, write <basename>.md inside that directory.--output looks like a .md file path, write exactly there.For a directory input:
--output is omitted, write to a sibling directory named <input>_markdown.python3 scripts/convert_pdf_to_markdown.py --input <...> [--output <...>] [--recursive] [--overwrite]The bundled converter aims to recover document structure rather than only dumping plain text. It includes heuristics for:
Read references/upstream-notes.md for the implementation rationale and references/THIRD_PARTY_NOTICES.md for vendored dependency notices.
tools
Convert markdown plans into beads with dependencies using br CLI. Use when creating task graphs, polishing beads before implementation, or bridging planning to agent swarm execution.
development
Orchestrate Codex skill optimization during active sessions through $cas goal control, $shadow single-session evidence, $tune diagnosis/refinement briefs, and the skill-optimizer custom subagent. Trigger for $opt, skill optimization loops, session-driven skill tuning, meta-skill audits, or explicit validated skill edits. Do not use for general code optimization, product optimization, or performance tuning.
development
Run a targeted fresh-eyes blunder pass over code, specs, plans, adjudications, closure gates, skill edits, or negative-evidence ledgers. Trigger when asked to reread with fresh eyes, find obvious bugs, catch mistakes/oversights/omissions, check for embarrassing misses, or perform a second independent blunder pass before closure. Do not use as a substitute for implementation, adjudication, or verification; use it as the final falsification/check pass for those workflows.
development
Explicitly shadow, tail, watch, follow, monitor, supervise, or companion exactly one Codex session id/path through `$seq`, then apply a named target skill as an interpretation/reporting/proposal/action lens until the watched session stops.