Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

steelmorgan/docx-convert

Name: docx-convert
Author: steelmorgan

framework_eng/skills/tool-usage/content-generation/docx-convert/SKILL.md

npx skillsauth add steelmorgan/1c-agent-based-dev-framework docx-convert

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Word → Markdown Conversion

A thin wrapper around pandoc for converting .docx to GitHub-Flavored Markdown while extracting embedded images. It also post-processes the result: fixes image paths and converts HTML tables (which pandoc leaves unchanged for complex cases) into Markdown pipe tables.

When to Use

| Situation | Action | |----------|----------| | The customer sent a spec in .docx, and it needs to be put into the repository as md | docx2md.sh input.docx | | Vendor documentation is in Word, and it needs to be fed to the agent | docx2md.sh input.docx output_dir | | A document with complex tables and styles - pandoc skips them | mammoth (see below) | | Only the text portion is needed, without images | pandoc input.docx --to=gfm -o out.md directly |

When NOT to Use

HTML is needed - pandoc --to=html, the script is not required.
PDF is needed - pandoc --to=pdf (LaTeX is required), the script is not required.
The document was created with WordArt/SmartArt/shapes - they are lost during conversion; this is a pandoc limitation.

Dependencies

pandoc ≥ 3.x - the main converter
python3 - post-processing (html_tables_to_md.py)
mammoth (python) - an optional alternative for complex tables

Quick Start

# Результат рядом с файлом, в каталоге с тем же именем без расширения
bash framework/skills/tool-usage/content-generation/docx-convert/docx2md.sh "/path/to/file.docx"

# С указанием выходного каталога
bash framework/skills/tool-usage/content-generation/docx-convert/docx2md.sh "/path/to/file.docx" "/path/to/output"

Result:

output/document.md - text in GFM with pipe tables
output/images/ - all images from the document (png/jpeg/emf/wmf)

When used from a project where the framework is installed via symlinks, the script path is: .claude/skills/docx-convert/docx2md.sh.

Manual Commands (without the script)

Text only

pandoc input.docx --from=docx --to=gfm --wrap=none -o output.md

Text + images

pandoc input.docx --from=docx --to=gfm --wrap=none \
    --extract-media=./images \
    -o output.md

Via mammoth (for complex tables/styles)

python3 -c "
import mammoth, pathlib
result = mammoth.convert_to_markdown(open('input.docx', 'rb'))
pathlib.Path('output.md').write_text(result.value)
"

Anti-Patterns

Converting .doc (old format) - pandoc accepts only .docx. Resave it first through LibreOffice/Word.
Expecting formulas to be preserved - OMML is converted to LaTeX only partially; complex formulas are better rebuilt manually.
Applying it to scans/PDFs - this is not a docx-convert task; use other tools for OCR.

Notes

Complex Word elements (WordArt, SmartArt, shapes) are lost - this is normal for pandoc conversion.
Embedded images are extracted correctly in png, jpeg, emf, wmf formats.
Post-processing (html_tables_to_md.py) handles only HTML tables and <img> tags left by pandoc; the rest of the HTML is kept as-is.

steelmorgan/docx-convert

framework_eng/skills/tool-usage/content-generation/docx-convert/SKILL.md

For converting DOCX to Markdown with images

83 stars

documentation

Updated Jul 4, 2026

$ install --global

skillsauth

npx skillsauth add steelmorgan/1c-agent-based-dev-framework docx-convert

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jul 4, 2026, 5:02 AM130.4s3 files scanned

SKILL.md

name:: docx-convert
description:: For converting DOCX to Markdown with images
capabilities:: content-generation,document-conversion

Word → Markdown Conversion

When to Use

When NOT to Use

HTML is needed - pandoc --to=html, the script is not required.
PDF is needed - pandoc --to=pdf (LaTeX is required), the script is not required.
The document was created with WordArt/SmartArt/shapes - they are lost during conversion; this is a pandoc limitation.

Dependencies

pandoc ≥ 3.x - the main converter
python3 - post-processing (html_tables_to_md.py)
mammoth (python) - an optional alternative for complex tables

Quick Start

# Результат рядом с файлом, в каталоге с тем же именем без расширения
bash framework/skills/tool-usage/content-generation/docx-convert/docx2md.sh "/path/to/file.docx"

# С указанием выходного каталога
bash framework/skills/tool-usage/content-generation/docx-convert/docx2md.sh "/path/to/file.docx" "/path/to/output"

Result:

output/document.md - text in GFM with pipe tables
output/images/ - all images from the document (png/jpeg/emf/wmf)

When used from a project where the framework is installed via symlinks, the script path is: .claude/skills/docx-convert/docx2md.sh.

Manual Commands (without the script)

Text only

pandoc input.docx --from=docx --to=gfm --wrap=none -o output.md

Text + images

pandoc input.docx --from=docx --to=gfm --wrap=none \
    --extract-media=./images \
    -o output.md

Via mammoth (for complex tables/styles)

python3 -c "
import mammoth, pathlib
result = mammoth.convert_to_markdown(open('input.docx', 'rb'))
pathlib.Path('output.md').write_text(result.value)
"

Anti-Patterns

Converting .doc (old format) - pandoc accepts only .docx. Resave it first through LibreOffice/Word.
Expecting formulas to be preserved - OMML is converted to LaTeX only partially; complex formulas are better rebuilt manually.
Applying it to scans/PDFs - this is not a docx-convert task; use other tools for OCR.

Notes

Complex Word elements (WordArt, SmartArt, shapes) are lost - this is normal for pandoc conversion.
Embedded images are extracted correctly in png, jpeg, emf, wmf formats.
Post-processing (html_tables_to_md.py) handles only HTML tables and <img> tags left by pandoc; the rest of the HTML is kept as-is.

Related Skills

steelmorgan/onec-server-maintenance-hooks

development

VerifiedTrustedCommunity

1C server maintenance webhooks: container restart and external component cache cleanup

83SKILL.mdUpdated Jul 4, 2026

steelmorgan/onec-server-maintenance-hooks

steelmorgan/dap-bsl-code-debug-procedure

development

VerifiedTrustedCommunity

Interactive DAP debugging of a single BSL procedure

83SKILL.mdUpdated Jul 4, 2026

steelmorgan/dap-bsl-code-debug-procedure

steelmorgan/rlm-bsl-search

tools

VerifiedTrustedCommunity

Rules for using RLM tools for project search and navigation in 1C/BSL

83SKILL.mdUpdated Jul 4, 2026

steelmorgan/rlm-bsl-search

steelmorgan/winow

development

VerifiedTrustedCommunity

Creates web applications and routes on Winow (a web server on OneScript and Autumn). Use when working with a web server on OneScript, routing, or Winow controllers.

83SKILL.mdUpdated Jul 4, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/steelmorgan/1c-agent-based-dev-framework.git

# Copy into Claude Code skills folder (global)
cp -r 1c-agent-based-dev-framework/framework_eng/skills/tool-usage/content-generation/docx-convert ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

steelmorgan/1c-agent-based-dev-framework

83 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT