Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

qzzqzzb/interproscan-domain-analysis

Name: interproscan-domain-analysis
Author: qzzqzzb

drclaw/agent_hub/templates/biochemistry/skills/interproscan-domain-analysis/SKILL.md

npx skillsauth add qzzqzzb/drclaw interproscan-domain-analysis

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

InterProScan Protein Domain Analysis

Usage

1. MCP Server Definition

Use the same BioInfoToolsClient class as defined in the protein-blast-search skill.

2. InterProScan Domain Analysis Workflow

This workflow analyzes protein sequences using InterProScan to identify functional domains, protein families, binding sites, and associated Gene Ontology annotations.

Workflow Steps:

Validate Sequence - Check protein sequence format and length
Run InterProScan - Identify domains using multiple signature databases
Extract Annotations - Parse domain locations, families, and GO terms

Implementation:

from datetime import timedelta

## Initialize client
client = BioInfoToolsClient(
    "https://scp.intern-ai.org.cn/api/v1/mcp/17/BioInfo-Tools",
    "<your-api-key>"
)

if not await client.connect():
    print("connection failed")
    exit()

## Input: Protein sequence to analyze
protein_sequence = """
MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH
"""

## Step 1 & 2: Run InterProScan analysis
result = await client.session.call_tool(
    "interproscan_analyze",
    arguments={
        "sequence": protein_sequence.strip(),
        "sequence_id": "HBB_HUMAN",        # Optional identifier
        "databases": ["Pfam"],              # Signature databases to use
        "goterms": True                     # Include GO term annotations
    },
    read_timeout_seconds=timedelta(seconds=900)  # Allow up to 15 minutes
)

## Step 3: Parse and display results
result_data = client.parse_result(result)

if result_data.get("success"):
    results = result_data.get("results", {})
    domains = results.get("domains", [])
    go_terms = results.get("go_terms", [])

    print(f"✅ InterProScan analysis completed successfully")
    print(f"Execution time: {result_data.get('time_seconds', '?')} seconds")
    print(f"Domains found: {len(domains)}")
    print(f"GO annotations: {len(go_terms)}\n")

    # Display domain information
    if domains:
        print("=== Functional Domains ===\n")
        for i, domain in enumerate(domains, 1):
            print(f"{i}. {domain.get('name', 'N/A')}")
            print(f"   Accession: {domain.get('accession', 'N/A')}")
            print(f"   Database: {domain.get('database', 'N/A')}")
            if domain.get('description'):
                print(f"   Description: {domain.get('description')}")

            # Display domain locations
            locations = domain.get('locations', [])
            if locations:
                print(f"   Locations:")
                for loc in locations:
                    print(f"     - Position {loc.get('start')}-{loc.get('end')} aa")
                    if loc.get('score'):
                        print(f"       Score: {loc.get('score')}")
            print()

    # Display GO annotations
    if go_terms:
        print("=== Gene Ontology Annotations ===\n")

        # Group by category
        by_category = {}
        for go in go_terms:
            category = go.get('category', 'UNKNOWN')
            if category not in by_category:
                by_category[category] = []
            by_category[category].append(go)

        for category, terms in by_category.items():
            print(f"{category}:")
            for go in terms:
                print(f"  - {go.get('id', 'N/A')}: {go.get('name', 'N/A')}")
            print()
else:
    print(f"❌ InterProScan analysis failed: {result_data.get('error', 'Unknown error')}")

await client.disconnect()

Tool Descriptions

BioInfo-Tools Server:

interproscan_analyze: Analyze protein sequence using InterProScan
- Args:
  - sequence (str): Protein sequence in amino acid single-letter code
  - sequence_id (str, optional): Identifier for the query sequence
  - databases (list, optional): Signature databases to query (default: ["Pfam"])
  - goterms (bool, optional): Include GO term annotations (default: True)
- Returns:
  - success (bool): Whether analysis completed successfully
  - results (dict): Analysis results containing domains and GO terms
  - time_seconds (float): Execution time

Input/Output

Input:

sequence: Protein sequence (amino acid single-letter code)
sequence_id: Optional identifier for the query
databases: List of signature databases (e.g., ["Pfam", "SMART", "PRINTS"])
goterms: Whether to include Gene Ontology annotations

Output:

domains: List of identified protein domains, each containing:
- name: Domain or family name
- accession: Database accession number
- database: Source database (e.g., "PFAM", "SMART")
- description: Functional description
- locations: List of domain positions in the sequence
  - start: Start position (amino acid number)
  - end: End position (amino acid number)
  - score: Match score (if available)
go_terms: List of GO annotations, each containing:
- id: GO identifier (e.g., "GO:0020037")
- name: GO term name
- category: GO category (MOLECULAR_FUNCTION, BIOLOGICAL_PROCESS, or CELLULAR_COMPONENT)

Available Signature Databases

InterProScan integrates multiple signature databases:

Pfam: Protein families based on HMMs
SMART: Simple Modular Architecture Research Tool
PRINTS: Protein fingerprints
ProSite: Protein domains, families, and functional sites
SUPERFAMILY: Structural and functional annotation
And more...

Default: ["Pfam"] for fastest results

Performance Notes

Typical execution time:
- Short sequences (~150 aa): 30-60 seconds
- Medium sequences (~400 aa): 2-4 minutes
- Long sequences (~800+ aa): 5-15 minutes
Timeout recommendation: Set to at least 900 seconds (15 minutes)
Multiple databases: Using more databases increases execution time but provides comprehensive annotation

Use Cases

Identify functional domains in novel protein sequences
Predict protein function from domain composition
Locate active sites and binding regions
Annotate protein families and superfamilies
Obtain GO term annotations for functional analysis
Compare domain architecture across homologous proteins

GO Term Categories

MOLECULAR_FUNCTION: Molecular-level activities (e.g., "heme binding", "catalytic activity")
BIOLOGICAL_PROCESS: Biological pathways and processes (e.g., "oxygen transport", "signal transduction")
CELLULAR_COMPONENT: Cellular locations (e.g., "cytoplasm", "membrane")

qzzqzzb/interproscan-domain-analysis

drclaw/agent_hub/templates/biochemistry/skills/interproscan-domain-analysis/SKILL.md

Analyze protein sequences using InterProScan to identify functional domains, protein families, and Gene Ontology (GO) annotations.

157 stars

development

Updated Apr 11, 2026

$ install --global

skillsauth

npx skillsauth add qzzqzzb/drclaw interproscan-domain-analysis

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 11, 2026, 8:44 PM11.8s1 file scanned

SKILL.md

name:: interproscan-domain-analysis
description:: Analyze protein sequences using InterProScan to identify functional domains, protein families, and Gene Ontology (GO) annotations.
license:: MIT license
skill-author:: PJLab

InterProScan Protein Domain Analysis

Usage

1. MCP Server Definition

Use the same BioInfoToolsClient class as defined in the protein-blast-search skill.

2. InterProScan Domain Analysis Workflow

This workflow analyzes protein sequences using InterProScan to identify functional domains, protein families, binding sites, and associated Gene Ontology annotations.

Workflow Steps:

Validate Sequence - Check protein sequence format and length
Run InterProScan - Identify domains using multiple signature databases
Extract Annotations - Parse domain locations, families, and GO terms

Implementation:

from datetime import timedelta

## Initialize client
client = BioInfoToolsClient(
    "https://scp.intern-ai.org.cn/api/v1/mcp/17/BioInfo-Tools",
    "<your-api-key>"
)

if not await client.connect():
    print("connection failed")
    exit()

## Input: Protein sequence to analyze
protein_sequence = """
MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH
"""

## Step 1 & 2: Run InterProScan analysis
result = await client.session.call_tool(
    "interproscan_analyze",
    arguments={
        "sequence": protein_sequence.strip(),
        "sequence_id": "HBB_HUMAN",        # Optional identifier
        "databases": ["Pfam"],              # Signature databases to use
        "goterms": True                     # Include GO term annotations
    },
    read_timeout_seconds=timedelta(seconds=900)  # Allow up to 15 minutes
)

## Step 3: Parse and display results
result_data = client.parse_result(result)

if result_data.get("success"):
    results = result_data.get("results", {})
    domains = results.get("domains", [])
    go_terms = results.get("go_terms", [])

    print(f"✅ InterProScan analysis completed successfully")
    print(f"Execution time: {result_data.get('time_seconds', '?')} seconds")
    print(f"Domains found: {len(domains)}")
    print(f"GO annotations: {len(go_terms)}\n")

    # Display domain information
    if domains:
        print("=== Functional Domains ===\n")
        for i, domain in enumerate(domains, 1):
            print(f"{i}. {domain.get('name', 'N/A')}")
            print(f"   Accession: {domain.get('accession', 'N/A')}")
            print(f"   Database: {domain.get('database', 'N/A')}")
            if domain.get('description'):
                print(f"   Description: {domain.get('description')}")

            # Display domain locations
            locations = domain.get('locations', [])
            if locations:
                print(f"   Locations:")
                for loc in locations:
                    print(f"     - Position {loc.get('start')}-{loc.get('end')} aa")
                    if loc.get('score'):
                        print(f"       Score: {loc.get('score')}")
            print()

    # Display GO annotations
    if go_terms:
        print("=== Gene Ontology Annotations ===\n")

        # Group by category
        by_category = {}
        for go in go_terms:
            category = go.get('category', 'UNKNOWN')
            if category not in by_category:
                by_category[category] = []
            by_category[category].append(go)

        for category, terms in by_category.items():
            print(f"{category}:")
            for go in terms:
                print(f"  - {go.get('id', 'N/A')}: {go.get('name', 'N/A')}")
            print()
else:
    print(f"❌ InterProScan analysis failed: {result_data.get('error', 'Unknown error')}")

await client.disconnect()

Tool Descriptions

BioInfo-Tools Server:

interproscan_analyze: Analyze protein sequence using InterProScan
- Args:
  - sequence (str): Protein sequence in amino acid single-letter code
  - sequence_id (str, optional): Identifier for the query sequence
  - databases (list, optional): Signature databases to query (default: ["Pfam"])
  - goterms (bool, optional): Include GO term annotations (default: True)
- Returns:
  - success (bool): Whether analysis completed successfully
  - results (dict): Analysis results containing domains and GO terms
  - time_seconds (float): Execution time

Input/Output

Input:

sequence: Protein sequence (amino acid single-letter code)
sequence_id: Optional identifier for the query
databases: List of signature databases (e.g., ["Pfam", "SMART", "PRINTS"])
goterms: Whether to include Gene Ontology annotations

Output:

domains: List of identified protein domains, each containing:
- name: Domain or family name
- accession: Database accession number
- database: Source database (e.g., "PFAM", "SMART")
- description: Functional description
- locations: List of domain positions in the sequence
  - start: Start position (amino acid number)
  - end: End position (amino acid number)
  - score: Match score (if available)
go_terms: List of GO annotations, each containing:
- id: GO identifier (e.g., "GO:0020037")
- name: GO term name
- category: GO category (MOLECULAR_FUNCTION, BIOLOGICAL_PROCESS, or CELLULAR_COMPONENT)

Available Signature Databases

InterProScan integrates multiple signature databases:

Pfam: Protein families based on HMMs
SMART: Simple Modular Architecture Research Tool
PRINTS: Protein fingerprints
ProSite: Protein domains, families, and functional sites
SUPERFAMILY: Structural and functional annotation
And more...

Default: ["Pfam"] for fastest results

Performance Notes

Typical execution time:
- Short sequences (~150 aa): 30-60 seconds
- Medium sequences (~400 aa): 2-4 minutes
- Long sequences (~800+ aa): 5-15 minutes
Timeout recommendation: Set to at least 900 seconds (15 minutes)
Multiple databases: Using more databases increases execution time but provides comprehensive annotation

Use Cases

Identify functional domains in novel protein sequences
Predict protein function from domain composition
Locate active sites and binding regions
Annotate protein families and superfamilies
Obtain GO term annotations for functional analysis
Compare domain architecture across homologous proteins

GO Term Categories

MOLECULAR_FUNCTION: Molecular-level activities (e.g., "heme binding", "catalytic activity")
BIOLOGICAL_PROCESS: Biological pathways and processes (e.g., "oxygen transport", "signal transduction")
CELLULAR_COMPONENT: Cellular locations (e.g., "cytoplasm", "membrane")

Related Skills

qzzqzzb/nsfc-budget

content-media

VerifiedTrustedCommunity

当用户明确要求“写/生成 NSFC 预算说明书”“写预算说明”“生成 budget.tex / budget.pdf”“写国自然预算 justification”时使用。基于用户标书正文或补充材料，输出一份可提交的预算说明书 LaTeX 项目并渲染 `budget.pdf`。若用户未指定工作目录，必须暂停并先要求其指定。⚠️ 不适用：用户只是想了解预算原则；用户仅要预算表数字而不写说明书；或用户是 2026 青年 A/B/C 默认包干制且无需预算说明书的场景。

157SKILL.mdUpdated Apr 11, 2026

qzzqzzb/nsfc-abstract

tools

VerifiedTrustedCommunity

当用户明确要求"写/润色 NSFC 标书摘要""生成中文摘要和英文摘要""把中文摘要翻译成英文摘要"时使用。输出中文、英文两个版本（英文必须是中文的忠实翻译版），同时输出标题建议（1个推荐标题+5个候选标题及理由）。中文摘要默认≤400字符，英文摘要默认≤4000字符。输出方式：将结果写入工作目录下的 `NSFC-ABSTRACTS.md`。⚠️ 不适用：用户只想翻译一段与标书无关的通用文本（应直接翻译）；用户只想写立项依据/研究内容/研究基础正文（应使用对应 nsfc 系列 skill）。

157SKILL.mdUpdated Apr 11, 2026

qzzqzzb/nsfc-abstract

qzzqzzb/guide-updater

documentation

VerifiedTrustedCommunity

当用户明确要求"更新项目指南""同步指南""沉淀洞见到指南"时使用。将对话中新产生的可复用写作洞见实时沉淀到项目指南文件，保持术语口径一致、结构稳定、可检验与可复现。调用时必须指定指南文件路径。

157SKILL.mdUpdated Apr 11, 2026

qzzqzzb/guide-updater

qzzqzzb/get-review-theme

content-media

VerifiedTrustedCommunity

当用户明确要求"从文件/图片/网页/描述中提取综述主题"或"生成主题+关键词+核心问题结构化输出"时使用。支持文件（PDF/Word/Markdown/Tex）、文件夹、图片、自然语言描述、网页 URL 等多种输入源，自动识别输入类型并提取内容，生成可直接用于 systematic-literature-review 及其他文献综述技能的结构化输出。

157SKILL.mdUpdated Apr 11, 2026

qzzqzzb/get-review-theme

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/qzzqzzb/drclaw.git

# Copy into Claude Code skills folder (global)
cp -r drclaw/drclaw/agent_hub/templates/biochemistry/skills/interproscan-domain-analysis ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

qzzqzzb/drclaw

157 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT