Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

qzzqzzb/comprehensive-protein-analysis

Name: comprehensive-protein-analysis
Author: qzzqzzb

drclaw/agent_hub/templates/biochemistry/skills/comprehensive-protein-analysis/SKILL.md

npx skillsauth add qzzqzzb/drclaw comprehensive-protein-analysis

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Comprehensive Protein Analysis

Usage

1. MCP Server Definition

Use the same BioInfoToolsClient class as defined in the protein-blast-search skill.

2. Comprehensive Protein Analysis Workflow

This workflow combines InterProScan domain analysis with BLAST similarity search to provide a complete functional and evolutionary annotation of a protein sequence.

Workflow Steps:

Validate Input - Check protein sequence format
Run InterProScan - Identify functional domains and GO terms
Run BLAST Search - Find similar sequences and homologs
Integrate Results - Combine domain and homology information for comprehensive annotation

Implementation:

from datetime import timedelta

## Initialize client
client = BioInfoToolsClient(
    "https://scp.intern-ai.org.cn/api/v1/mcp/17/BioInfo-Tools",
    "<your-api-key>"
)

if not await client.connect():
    print("connection failed")
    exit()

## Input: Protein sequence to analyze
protein_sequence = """
MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCGERGFFYTPKTRREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSLYQLENYCN
"""

sequence_id = "INS_HUMAN"

## Step 1, 2 & 3: Run comprehensive analysis (InterProScan + BLAST)
result = await client.session.call_tool(
    "analyze_protein",
    arguments={
        "sequence": protein_sequence.strip(),
        "sequence_id": sequence_id,
        "databases": ["Pfam"],     # InterProScan databases
        "evalue": 1e-5,             # BLAST E-value threshold (more stringent)
        "max_hits": 10              # BLAST max hits
    },
    read_timeout_seconds=timedelta(seconds=1200)  # Allow up to 20 minutes
)

## Step 4: Parse and display comprehensive results
result_data = client.parse_result(result)

print(f"{'='*80}")
print(f"Comprehensive Protein Analysis: {sequence_id}")
print(f"{'='*80}\n")

# InterProScan Results
ips_result = result_data.get("interproscan", {})
if ips_result.get("success"):
    ips_data = ips_result.get("results", {})
    domains = ips_data.get('domains', [])
    go_terms = ips_data.get('go_terms', [])

    print("=== DOMAIN ANALYSIS (InterProScan) ===")
    print(f"Execution time: {ips_result.get('time_seconds', '?')} seconds")
    print(f"Domains found: {len(domains)}")
    print(f"GO annotations: {len(go_terms)}\n")

    if domains:
        print("Functional Domains:")
        for domain in domains:
            print(f"  • {domain.get('name', 'N/A')} ({domain.get('database', 'N/A')})")
            if domain.get('description'):
                print(f"    Description: {domain.get('description')}")
            locations = domain.get('locations', [])
            if locations:
                loc = locations[0]
                print(f"    Position: {loc.get('start')}-{loc.get('end')} aa")
        print()

    if go_terms:
        print("Gene Ontology Annotations:")
        for go in go_terms[:5]:  # Show top 5
            print(f"  • {go.get('id', 'N/A')}: {go.get('name', 'N/A')}")
            print(f"    Category: {go.get('category', 'N/A')}")
        if len(go_terms) > 5:
            print(f"  ... and {len(go_terms) - 5} more")
        print()
else:
    print(f"❌ InterProScan failed: {ips_result.get('error', 'Unknown')}\n")

# BLAST Results
blast_result = result_data.get("blast", {})
if blast_result.get("success"):
    hits = blast_result.get('hits', [])

    print("=== HOMOLOGY SEARCH (BLAST) ===")
    print(f"Execution time: {blast_result.get('time_seconds', '?')} seconds")
    print(f"Similar sequences found: {blast_result.get('total_hits', 0)}")
    print(f"E-value threshold: {1e-5}\n")

    if hits:
        print("Top Homologous Proteins:")
        for i, hit in enumerate(hits[:5], 1):
            print(f"  {i}. {hit['uniprot_id']} - {hit.get('organism', 'N/A')}")
            print(f"     Description: {hit['description']}")
            print(f"     Identity: {hit['identity_percent']:.1f}%, E-value: {hit['evalue']:.2e}")
        if len(hits) > 5:
            print(f"  ... and {len(hits) - 5} more matches")
        print()
    else:
        print("No significant homologs found (E-value threshold may be too stringent)\n")
else:
    print(f"❌ BLAST failed: {blast_result.get('error', 'Unknown')}\n")

# Summary
print("=== FUNCTIONAL SUMMARY ===")
if domains:
    print(f"Protein Family: {domains[0].get('name', 'Unknown')}")
if hits:
    most_similar = hits[0]
    print(f"Most Similar Protein: {most_similar['uniprot_id']} ({most_similar['identity_percent']:.1f}% identity)")
    print(f"Organism: {most_similar.get('organism', 'Unknown')}")
print(f"{'='*80}")

await client.disconnect()

Tool Descriptions

BioInfo-Tools Server:

analyze_protein: Comprehensive protein analysis combining InterProScan and BLAST
- Args:
  - sequence (str): Protein sequence in amino acid single-letter code
  - sequence_id (str, optional): Identifier for the query sequence
  - databases (list, optional): InterProScan databases (default: ["Pfam"])
  - evalue (float, optional): BLAST E-value threshold (default: 0.01)
  - max_hits (int, optional): Maximum BLAST hits (default: 10)
- Returns:
  - interproscan (dict): InterProScan analysis results
    - success (bool): Whether InterProScan completed
    - results (dict): Domains and GO terms
    - time_seconds (float): Execution time
  - blast (dict): BLAST search results
    - success (bool): Whether BLAST completed
    - hits (list): Similar proteins
    - total_hits (int): Number of matches
    - time_seconds (float): Execution time

Input/Output

Input:

sequence: Protein sequence (amino acid single-letter code)
sequence_id: Optional identifier for the query
databases: List of InterProScan databases to query
evalue: BLAST E-value threshold (lower = more stringent)
max_hits: Maximum number of BLAST hits to return

Output:

InterProScan Results:
- Functional domains with positions
- Protein family classifications
- Gene Ontology annotations
BLAST Results:
- Homologous proteins across species
- Sequence identity and alignment statistics
- Evolutionary relationships

Analysis Strategy

This comprehensive approach provides:

Structural Information (InterProScan):
- Domain architecture and organization
- Functional motifs and active sites
- Protein family membership
Evolutionary Context (BLAST):
- Homologs in other species
- Sequence conservation patterns
- Potential orthologs and paralogs
Functional Prediction:
- Combining domain and homology information
- GO term annotations for molecular function
- Biological process involvement

Performance Notes

Total execution time: 2-20 minutes depending on sequence length
- InterProScan: 30 seconds to 15 minutes
- BLAST: 10-90 seconds
- Both run sequentially in this workflow
Timeout recommendation: Set to at least 1200 seconds (20 minutes)
E-value tuning: Use lower E-values (e.g., 1e-10) for highly conserved proteins, higher (e.g., 0.01) for divergent families

Use Cases

Complete functional annotation of unknown proteins
Validate predicted protein functions
Study protein evolution and conservation
Identify potential drug targets
Annotate proteomes and genome sequences
Compare protein function across species

Interpretation Tips

High domain coverage + high homology: Well-characterized protein with known function
Domains but no homologs: Novel protein with conserved domains, function can be inferred from domains
Homologs but no domains: May need more sensitive domain detection or represents a novel fold
Neither domains nor homologs: Potentially novel protein, may require experimental characterization

qzzqzzb/comprehensive-protein-analysis

drclaw/agent_hub/templates/biochemistry/skills/comprehensive-protein-analysis/SKILL.md

Comprehensive protein analysis combining InterProScan domain identification with BLAST similarity search to provide complete functional and evolutionary annotation.

157 stars

data-ai

Updated Apr 11, 2026

$ install --global

skillsauth

npx skillsauth add qzzqzzb/drclaw comprehensive-protein-analysis

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 11, 2026, 8:44 PM12.3s1 file scanned

SKILL.md

name:: comprehensive-protein-analysis
description:: Comprehensive protein analysis combining InterProScan domain identification with BLAST similarity search to provide complete functional and evolutionary annotation.
license:: MIT license
skill-author:: PJLab

Comprehensive Protein Analysis

Usage

1. MCP Server Definition

Use the same BioInfoToolsClient class as defined in the protein-blast-search skill.

2. Comprehensive Protein Analysis Workflow

This workflow combines InterProScan domain analysis with BLAST similarity search to provide a complete functional and evolutionary annotation of a protein sequence.

Workflow Steps:

Validate Input - Check protein sequence format
Run InterProScan - Identify functional domains and GO terms
Run BLAST Search - Find similar sequences and homologs
Integrate Results - Combine domain and homology information for comprehensive annotation

Implementation:

from datetime import timedelta

## Initialize client
client = BioInfoToolsClient(
    "https://scp.intern-ai.org.cn/api/v1/mcp/17/BioInfo-Tools",
    "<your-api-key>"
)

if not await client.connect():
    print("connection failed")
    exit()

## Input: Protein sequence to analyze
protein_sequence = """
MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCGERGFFYTPKTRREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSLYQLENYCN
"""

sequence_id = "INS_HUMAN"

## Step 1, 2 & 3: Run comprehensive analysis (InterProScan + BLAST)
result = await client.session.call_tool(
    "analyze_protein",
    arguments={
        "sequence": protein_sequence.strip(),
        "sequence_id": sequence_id,
        "databases": ["Pfam"],     # InterProScan databases
        "evalue": 1e-5,             # BLAST E-value threshold (more stringent)
        "max_hits": 10              # BLAST max hits
    },
    read_timeout_seconds=timedelta(seconds=1200)  # Allow up to 20 minutes
)

## Step 4: Parse and display comprehensive results
result_data = client.parse_result(result)

print(f"{'='*80}")
print(f"Comprehensive Protein Analysis: {sequence_id}")
print(f"{'='*80}\n")

# InterProScan Results
ips_result = result_data.get("interproscan", {})
if ips_result.get("success"):
    ips_data = ips_result.get("results", {})
    domains = ips_data.get('domains', [])
    go_terms = ips_data.get('go_terms', [])

    print("=== DOMAIN ANALYSIS (InterProScan) ===")
    print(f"Execution time: {ips_result.get('time_seconds', '?')} seconds")
    print(f"Domains found: {len(domains)}")
    print(f"GO annotations: {len(go_terms)}\n")

    if domains:
        print("Functional Domains:")
        for domain in domains:
            print(f"  • {domain.get('name', 'N/A')} ({domain.get('database', 'N/A')})")
            if domain.get('description'):
                print(f"    Description: {domain.get('description')}")
            locations = domain.get('locations', [])
            if locations:
                loc = locations[0]
                print(f"    Position: {loc.get('start')}-{loc.get('end')} aa")
        print()

    if go_terms:
        print("Gene Ontology Annotations:")
        for go in go_terms[:5]:  # Show top 5
            print(f"  • {go.get('id', 'N/A')}: {go.get('name', 'N/A')}")
            print(f"    Category: {go.get('category', 'N/A')}")
        if len(go_terms) > 5:
            print(f"  ... and {len(go_terms) - 5} more")
        print()
else:
    print(f"❌ InterProScan failed: {ips_result.get('error', 'Unknown')}\n")

# BLAST Results
blast_result = result_data.get("blast", {})
if blast_result.get("success"):
    hits = blast_result.get('hits', [])

    print("=== HOMOLOGY SEARCH (BLAST) ===")
    print(f"Execution time: {blast_result.get('time_seconds', '?')} seconds")
    print(f"Similar sequences found: {blast_result.get('total_hits', 0)}")
    print(f"E-value threshold: {1e-5}\n")

    if hits:
        print("Top Homologous Proteins:")
        for i, hit in enumerate(hits[:5], 1):
            print(f"  {i}. {hit['uniprot_id']} - {hit.get('organism', 'N/A')}")
            print(f"     Description: {hit['description']}")
            print(f"     Identity: {hit['identity_percent']:.1f}%, E-value: {hit['evalue']:.2e}")
        if len(hits) > 5:
            print(f"  ... and {len(hits) - 5} more matches")
        print()
    else:
        print("No significant homologs found (E-value threshold may be too stringent)\n")
else:
    print(f"❌ BLAST failed: {blast_result.get('error', 'Unknown')}\n")

# Summary
print("=== FUNCTIONAL SUMMARY ===")
if domains:
    print(f"Protein Family: {domains[0].get('name', 'Unknown')}")
if hits:
    most_similar = hits[0]
    print(f"Most Similar Protein: {most_similar['uniprot_id']} ({most_similar['identity_percent']:.1f}% identity)")
    print(f"Organism: {most_similar.get('organism', 'Unknown')}")
print(f"{'='*80}")

await client.disconnect()

Tool Descriptions

BioInfo-Tools Server:

analyze_protein: Comprehensive protein analysis combining InterProScan and BLAST
- Args:
  - sequence (str): Protein sequence in amino acid single-letter code
  - sequence_id (str, optional): Identifier for the query sequence
  - databases (list, optional): InterProScan databases (default: ["Pfam"])
  - evalue (float, optional): BLAST E-value threshold (default: 0.01)
  - max_hits (int, optional): Maximum BLAST hits (default: 10)
- Returns:
  - interproscan (dict): InterProScan analysis results
    - success (bool): Whether InterProScan completed
    - results (dict): Domains and GO terms
    - time_seconds (float): Execution time
  - blast (dict): BLAST search results
    - success (bool): Whether BLAST completed
    - hits (list): Similar proteins
    - total_hits (int): Number of matches
    - time_seconds (float): Execution time

Input/Output

Input:

sequence: Protein sequence (amino acid single-letter code)
sequence_id: Optional identifier for the query
databases: List of InterProScan databases to query
evalue: BLAST E-value threshold (lower = more stringent)
max_hits: Maximum number of BLAST hits to return

Output:

InterProScan Results:
- Functional domains with positions
- Protein family classifications
- Gene Ontology annotations
BLAST Results:
- Homologous proteins across species
- Sequence identity and alignment statistics
- Evolutionary relationships

Analysis Strategy

This comprehensive approach provides:

Structural Information (InterProScan):
- Domain architecture and organization
- Functional motifs and active sites
- Protein family membership
Evolutionary Context (BLAST):
- Homologs in other species
- Sequence conservation patterns
- Potential orthologs and paralogs
Functional Prediction:
- Combining domain and homology information
- GO term annotations for molecular function
- Biological process involvement

Performance Notes

Total execution time: 2-20 minutes depending on sequence length
- InterProScan: 30 seconds to 15 minutes
- BLAST: 10-90 seconds
- Both run sequentially in this workflow
Timeout recommendation: Set to at least 1200 seconds (20 minutes)
E-value tuning: Use lower E-values (e.g., 1e-10) for highly conserved proteins, higher (e.g., 0.01) for divergent families

Use Cases

Complete functional annotation of unknown proteins
Validate predicted protein functions
Study protein evolution and conservation
Identify potential drug targets
Annotate proteomes and genome sequences
Compare protein function across species

Interpretation Tips

High domain coverage + high homology: Well-characterized protein with known function
Domains but no homologs: Novel protein with conserved domains, function can be inferred from domains
Homologs but no domains: May need more sensitive domain detection or represents a novel fold
Neither domains nor homologs: Potentially novel protein, may require experimental characterization

Related Skills

qzzqzzb/nsfc-budget

content-media

VerifiedTrustedCommunity

当用户明确要求“写/生成 NSFC 预算说明书”“写预算说明”“生成 budget.tex / budget.pdf”“写国自然预算 justification”时使用。基于用户标书正文或补充材料，输出一份可提交的预算说明书 LaTeX 项目并渲染 `budget.pdf`。若用户未指定工作目录，必须暂停并先要求其指定。⚠️ 不适用：用户只是想了解预算原则；用户仅要预算表数字而不写说明书；或用户是 2026 青年 A/B/C 默认包干制且无需预算说明书的场景。

157SKILL.mdUpdated Apr 11, 2026

qzzqzzb/nsfc-abstract

tools

VerifiedTrustedCommunity

当用户明确要求"写/润色 NSFC 标书摘要""生成中文摘要和英文摘要""把中文摘要翻译成英文摘要"时使用。输出中文、英文两个版本（英文必须是中文的忠实翻译版），同时输出标题建议（1个推荐标题+5个候选标题及理由）。中文摘要默认≤400字符，英文摘要默认≤4000字符。输出方式：将结果写入工作目录下的 `NSFC-ABSTRACTS.md`。⚠️ 不适用：用户只想翻译一段与标书无关的通用文本（应直接翻译）；用户只想写立项依据/研究内容/研究基础正文（应使用对应 nsfc 系列 skill）。

157SKILL.mdUpdated Apr 11, 2026

qzzqzzb/nsfc-abstract

qzzqzzb/guide-updater

documentation

VerifiedTrustedCommunity

当用户明确要求"更新项目指南""同步指南""沉淀洞见到指南"时使用。将对话中新产生的可复用写作洞见实时沉淀到项目指南文件，保持术语口径一致、结构稳定、可检验与可复现。调用时必须指定指南文件路径。

157SKILL.mdUpdated Apr 11, 2026

qzzqzzb/guide-updater

qzzqzzb/get-review-theme

content-media

VerifiedTrustedCommunity

当用户明确要求"从文件/图片/网页/描述中提取综述主题"或"生成主题+关键词+核心问题结构化输出"时使用。支持文件（PDF/Word/Markdown/Tex）、文件夹、图片、自然语言描述、网页 URL 等多种输入源，自动识别输入类型并提取内容，生成可直接用于 systematic-literature-review 及其他文献综述技能的结构化输出。

157SKILL.mdUpdated Apr 11, 2026

qzzqzzb/get-review-theme

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/qzzqzzb/drclaw.git

# Copy into Claude Code skills folder (global)
cp -r drclaw/drclaw/agent_hub/templates/biochemistry/skills/comprehensive-protein-analysis ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

qzzqzzb/drclaw

157 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT