Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

edwinhu/research

Name: research
Author: edwinhu

skills/research/SKILL.md

npx skillsauth add edwinhu/workflows research

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Academic Literature Search

Multi-source academic search with deduplication, DOI resolution, and journal filtering.

Always read ${CLAUDE_SKILL_DIR}/../google-scholar/domain-knowledge.local.md before presenting results.

IRON LAW: Always Use the Script

NEVER run the sources manually in sequence. ALWAYS use the research script. This is not negotiable.

uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "<query>" [--n 50] [--min-citations N]

The script parallelizes all sources and DOI resolution automatically. Doing it manually serializes everything and triples wall time.

Sources

| Source | Tool | Strength | Default | |--------|------|----------|---------| | scholar lookup | Keyword/citation-ranked | Finance classics, foundational papers | ✅ | | consensus CLI | Empirical corpus, sorted by citations | Accounting/finance empirical literature | ✅ | | Paperpile bib | Personal library (My Library.bib) | Papers already in your collection | ✅ | | scholar search | NL semantic | Law reviews, conceptual literature | opt-in (--scholar-search) |

scholar search is opt-in because it shares rate limits with scholar lookup and 429s when run in parallel. Add --scholar-search when you specifically want semantic/NL results.

Output Schema

The script outputs a JSON array. Each paper has:

{
  "title": "...",
  "authors": ["..."],
  "year": 2023,
  "journal": "...",           // original journal label (may be SSRN)
  "journal_resolved": "...",  // CrossRef-resolved journal (present if SSRN label was resolved)
  "doi": "...",
  "citations": 150,
  "takeaway": "...",
  "url": "...",
  "sources": ["lookup", "consensus"]  // all sources that returned this paper
}

LLM Review Step (After Script)

After running the script, read ${CLAUDE_SKILL_DIR}/../google-scholar/domain-knowledge.local.md and cross-reference each paper's effective journal (use journal_resolved if present, else journal) against the trusted list:

★ = journal matches trusted list
Papers in sources: ["lookup", "consensus"] (multiple sources) = higher confidence
Papers from bib source = already in user's library (flag with 📚)

Presentation Format

★ [Title](url) — Authors (Year), *Journal*, N citations  [sources]
  > Takeaway: ...

📚 ★ [Title](url) — Authors (Year), *Journal*  [in your library]
  > Takeaway: ...

Trusted papers first (sorted by citations desc), then non-trusted in a collapsed table.

Red Flags

About to run the sources manually in sequence → STOP. That serializes the work and triples wall time; run uv run python3 research.py "<query>".
About to call mcp__consensus__search → STOP. It is rate-limited to 3 results; the script uses the CLI binary automatically.
About to present results before reading domain-knowledge.local.md → STOP. The ★ trusted-journal signals come from that file; read it first, always.
About to use the journal field when journal_resolved is present → STOP. The SSRN label hides the real venue; always prefer journal_resolved.

Common Patterns

# Standard search
uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "mandatory disclosure"

# With citation floor
uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "poison pill" --min-citations 50

# More results from Consensus
uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "corporate governance" --n 100

# Disable streaming (wait for all sources, output pretty-printed JSON)
uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "mandatory disclosure" --no-stream

Streaming Mode (default)

Without --stream, the script waits for all four sources before emitting anything — Consensus takes ~60s, so fast sources (bib <1s, Scholar ~10s) sit idle.

With --stream, the script emits one NDJSON line per event as it happens:

{"event": "source", "source": "bib", "papers": [...]}
{"event": "source", "source": "scholar-lookup", "papers": [...]}
{"event": "source", "source": "scholar-search", "papers": [...]}
{"event": "source", "source": "consensus", "papers": [...]}
{"event": "final", "papers": [...]}

source events: raw papers from each source as it completes (may have duplicates across sources)
final event: deduplicated + CrossRef-resolved unified set

Process source events as they arrive to present early results; use final for the complete deduplicated list. Pass --no-stream for batch mode (pretty-printed JSON after all sources complete).

edwinhu/research

skills/research/SKILL.md

This skill should be used when the user asks to "find papers", "search academic literature", "find citations", "literature search", "find research on", "what does the literature say about", or any request to search for academic papers across multiple sources.

16 stars

testing

Updated Jun 10, 2026

$ install --global

skillsauth

npx skillsauth add edwinhu/workflows research

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 10, 2026, 3:25 AM20.2s2 files scanned

SKILL.md

name:: research
description:: This skill should be used when the user asks to "find papers", "search academic literature", "find citations", "literature search", "find research on", "what does the literature say about", or any request to search for academic papers across multiple sources.
version:: 0.2.0
user-invocable:: false

Academic Literature Search

Multi-source academic search with deduplication, DOI resolution, and journal filtering.

Always read ${CLAUDE_SKILL_DIR}/../google-scholar/domain-knowledge.local.md before presenting results.

IRON LAW: Always Use the Script

NEVER run the sources manually in sequence. ALWAYS use the research script. This is not negotiable.

uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "<query>" [--n 50] [--min-citations N]

The script parallelizes all sources and DOI resolution automatically. Doing it manually serializes everything and triples wall time.

Sources

scholar search is opt-in because it shares rate limits with scholar lookup and 429s when run in parallel. Add --scholar-search when you specifically want semantic/NL results.

Output Schema

The script outputs a JSON array. Each paper has:

{
  "title": "...",
  "authors": ["..."],
  "year": 2023,
  "journal": "...",           // original journal label (may be SSRN)
  "journal_resolved": "...",  // CrossRef-resolved journal (present if SSRN label was resolved)
  "doi": "...",
  "citations": 150,
  "takeaway": "...",
  "url": "...",
  "sources": ["lookup", "consensus"]  // all sources that returned this paper
}

LLM Review Step (After Script)

★ = journal matches trusted list
Papers in sources: ["lookup", "consensus"] (multiple sources) = higher confidence
Papers from bib source = already in user's library (flag with 📚)

Presentation Format

★ [Title](url) — Authors (Year), *Journal*, N citations  [sources]
  > Takeaway: ...

📚 ★ [Title](url) — Authors (Year), *Journal*  [in your library]
  > Takeaway: ...

Trusted papers first (sorted by citations desc), then non-trusted in a collapsed table.

Red Flags

About to run the sources manually in sequence → STOP. That serializes the work and triples wall time; run uv run python3 research.py "<query>".
About to call mcp__consensus__search → STOP. It is rate-limited to 3 results; the script uses the CLI binary automatically.
About to present results before reading domain-knowledge.local.md → STOP. The ★ trusted-journal signals come from that file; read it first, always.
About to use the journal field when journal_resolved is present → STOP. The SSRN label hides the real venue; always prefer journal_resolved.

Common Patterns

# Standard search
uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "mandatory disclosure"

# With citation floor
uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "poison pill" --min-citations 50

# More results from Consensus
uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "corporate governance" --n 100

# Disable streaming (wait for all sources, output pretty-printed JSON)
uv run python3 "${CLAUDE_SKILL_DIR}/scripts/research.py" "mandatory disclosure" --no-stream

Streaming Mode (default)

Without --stream, the script waits for all four sources before emitting anything — Consensus takes ~60s, so fast sources (bib <1s, Scholar ~10s) sit idle.

With --stream, the script emits one NDJSON line per event as it happens:

{"event": "source", "source": "bib", "papers": [...]}
{"event": "source", "source": "scholar-lookup", "papers": [...]}
{"event": "source", "source": "scholar-search", "papers": [...]}
{"event": "source", "source": "consensus", "papers": [...]}
{"event": "final", "papers": [...]}

source events: raw papers from each source as it completes (may have duplicates across sources)
final event: deduplicated + CrossRef-resolved unified set

Process source events as they arrive to present early results; use final for the complete deduplicated list. Pass --no-stream for batch mode (pretty-printed JSON after all sources complete).

Related Skills

edwinhu/npx-ownership-panel

development

VerifiedTrustedCommunity

Build the meeting-level proxy-voting × ownership panel on the WRDS SGE grid — ISS N-PX fund votes reduced to (item × block) direction cells, joined to institutional and mutual-fund ownership. Use when working with risk.voteanalysis_npx, N-PX fund-level votes, ISS→CRSP fund linking, index/passive/active voting blocks, or a proxy-voting panel that needs ownership attached.

17SKILL.mdUpdated Jul 28, 2026

edwinhu/npx-ownership-panel

edwinhu/crsp-v2

development

VerifiedTrustedCommunity

Use when "CRSP CIZ", "CRSP v2", "CRSP flat file format 2.0", "crsp.dsf_v2 / msf_v2", "StkDlySecurityData", "StkMthSecurityData", "StkSecurityInfoHist", "stocknames_v2", "DlyRet / MthRet / DlyPrc / MthPrc", "SHRCD or EXCHCD equivalent in new CRSP", "SIZ to CIZ migration", "CRSP data after 2024", "CRSP delisting returns", "CRSP cumulative adjustment factors", "CRSP index INDNO / INDFAM", or any CRSP stock/index query where the legacy SIZ column names no longer exist.

17SKILL.mdUpdated Jul 28, 2026

edwinhu/fuzzy-name-matching

development

VerifiedTrustedCommunity

Use when linking or deduping datasets by entity name rather than a shared key — 'fuzzy match', 'fuzzy name matching', 'entity resolution', 'record linkage', 'match company/person names', 'dedupe entity names', 'name-based join', 'bridge identifiers' (CIK ↔ permno ↔ gvkey ↔ wficn ↔ EIN ↔ personid), or any use of char n-gram TF-IDF, cosine similarity on names, `sparse_dot_topn`, or RapidFuzz at scale.

17SKILL.mdUpdated Jul 23, 2026

edwinhu/fuzzy-name-matching

edwinhu/ds-tables

development

VerifiedTrustedCommunity

Use when building a publication-quality table in Python — 'regression table', 'results table', 'summary statistics table', 'etable', 'coefplot', 'great_tables', 'GT', 'gt table', 'format a table for the paper', 'export table to LaTeX/HTML', significance stars, spanners, or column formatting for a table headed into a paper, slide deck, or notebook.

17SKILL.mdUpdated Jul 23, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/edwinhu/workflows.git

# Copy into Claude Code skills folder (global)
cp -r workflows/skills/research ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

edwinhu/workflows

16 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT