skills/codex/office-to-md/SKILL.md
Convert Office documents (Word, Excel, PowerPoint, PDF) to Markdown format. ONLY use this skill when the user explicitly requests to CONVERT, TRANSFORM or PARSE a specific office file into Markdown. Do NOT trigger for general questions, documentation reading, or discussions about files.
npx skillsauth add shuyu-labs/webcode office-to-mdInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Convert various Office document formats to structured Markdown with text, table, and image extraction.
enhanced_parser.py - Core document parserdoc_converter.py - DOC to DOCX converter (requires LibreOffice)requirements.txt - Python dependenciespip install -r requirements.txt
.doc format requires LibreOffice:
# Windows: Install LibreOffice from official website
# https://www.libreoffice.org/download/
# Linux
sudo apt install libreoffice
# Mac
brew install --cask libreoffice
from enhanced_parser import EnhancedDocumentParser
# Initialize parser
parser = EnhancedDocumentParser(
image_base_url="http://localhost:5000",
image_save_dir="./static/images",
filter_headers_footers=True # Filter headers and footers
)
# Parse document
result = parser.parse_document("document.docx")
if result["success"]:
print(result["markdown"])
print(f"Extracted {result['images_count']} images")
# Start service using app.py from project root
python app.py
# Visit http://localhost:5000/analyzer to upload files
| Format | Extensions | Notes | |--------|-----------|-------| | Word | .docx, .doc | .doc requires LibreOffice | | Excel | .xlsx, .xls | Supports multiple worksheets and date formats | | PowerPoint | .pptx | Extracts slide text and images | | PDF | .pdf | Auto-detects tables and images |
# Test LibreOffice configuration
python doc_converter.py
parser = EnhancedDocumentParser(
pdf_table_strategy="lines_strict" # Default: strict line detection, fastest
# "lines": Normal line detection
# "text": Based on text position, more accurate but slower
)
parser = EnhancedDocumentParser(
image_base_url="https://your-domain.com", # Image access URL
image_save_dir="./static/images" # Image save directory
)
{
"success": true,
"markdown": "# Document Title\n\nContent...",
"images_count": 2,
"images": [
{
"filename": "uuid.png",
"url": "http://localhost:5000/static/images/uuid.png",
"size": 12345
}
],
"file_type": "docx",
"file_info": {
"name": "document.docx",
"size": 45678,
"paragraphs": 50,
"tables": 3
}
}
python doc_converter.py to test configurationdocumentation
Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks
data-ai
Transforms workflow to use Manus-style persistent markdown files for planning, progress tracking, and knowledge storage. Use when starting complex tasks, multi-step projects, research tasks, or when the user mentions planning, organizing work, tracking progress, or wants structured output.
tools
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
development
Comprehensive guide for building Agentic RAG systems using Microsoft Agent Framework in C#. Use when creating RAG applications with semantic search, document indexing, and intelligent agent orchestration. Includes scaffolding scripts, reference implementations, and documentation for vector databases, embedding models, and multi-agent workflows.