drclaw/agent_hub/templates/biochemistry/skills/protein-blast-search/SKILL.md
Search for similar protein sequences in UniProt Swiss-Prot database using BLAST to identify homologous proteins and functional relationships.
npx skillsauth add qzzqzzb/drclaw protein-blast-searchInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
import asyncio
import json
from mcp.client.streamable_http import streamablehttp_client
from mcp import ClientSession
class BioInfoToolsClient:
"""BioInfo-Tools MCP Client"""
def __init__(self, server_url: str, api_key: str):
self.server_url = server_url
self.api_key = api_key
self.session = None
async def connect(self):
"""Establish connection and initialize session"""
print(f"server url: {self.server_url}")
try:
self.transport = streamablehttp_client(
url=self.server_url,
headers={"SCP-HUB-API-KEY": self.api_key}
)
self.read, self.write, self.get_session_id = await self.transport.__aenter__()
self.session_ctx = ClientSession(self.read, self.write)
self.session = await self.session_ctx.__aenter__()
await self.session.initialize()
session_id = self.get_session_id()
print(f"✓ connect success")
return True
except Exception as e:
print(f"✗ connect failure: {e}")
import traceback
traceback.print_exc()
return False
async def disconnect(self):
"""Disconnect from server"""
try:
if self.session:
await self.session_ctx.__aexit__(None, None, None)
if hasattr(self, 'transport'):
await self.transport.__aexit__(None, None, None)
print("✓ already disconnect")
except Exception as e:
print(f"✗ disconnect error: {e}")
def parse_result(self, result):
"""Parse MCP tool call result"""
try:
if hasattr(result, 'content') and result.content:
content = result.content[0]
if hasattr(content, 'text'):
return json.loads(content.text)
return str(result)
except Exception as e:
return {"error": f"parse error: {e}", "raw": str(result)}
This workflow searches for similar protein sequences in the UniProt Swiss-Prot database using BLAST, identifying homologous proteins and their functional relationships.
Workflow Steps:
Implementation:
from datetime import timedelta
## Initialize client
client = BioInfoToolsClient(
"https://scp.intern-ai.org.cn/api/v1/mcp/17/BioInfo-Tools",
"<your-api-key>"
)
if not await client.connect():
print("connection failed")
exit()
## Input: Protein sequence to search
protein_sequence = """
MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH
"""
## Step 1 & 2: Execute BLAST search against UniProt Swiss-Prot
result = await client.session.call_tool(
"blast_search",
arguments={
"sequence": protein_sequence.strip(),
"sequence_id": "HBB_HUMAN", # Optional identifier
"evalue": 0.01, # E-value threshold (default: 0.01)
"max_hits": 50 # Maximum number of hits to return
},
read_timeout_seconds=timedelta(seconds=300) # Allow up to 5 minutes
)
## Step 3: Parse and display results
result_data = client.parse_result(result)
if result_data.get("success"):
print(f"✅ BLAST search completed successfully")
print(f"Execution time: {result_data.get('time_seconds', '?')} seconds")
print(f"Total hits found: {result_data.get('total_hits', 0)}\n")
hits = result_data.get("hits", [])
# Display top matches
for i, hit in enumerate(hits[:10], 1):
print(f"{i}. {hit['uniprot_id']} - {hit.get('organism', 'N/A')}")
print(f" Description: {hit['description']}")
print(f" Identity: {hit['identity_percent']:.1f}%")
print(f" E-value: {hit['evalue']:.2e}")
print(f" Alignment length: {hit['alignment_length']} aa\n")
else:
print(f"❌ BLAST search failed: {result_data.get('error', 'Unknown error')}")
await client.disconnect()
BioInfo-Tools Server:
blast_search: Search for similar protein sequences in UniProt Swiss-Prot database
sequence (str): Protein sequence in amino acid single-letter codesequence_id (str, optional): Identifier for the query sequenceevalue (float, optional): E-value threshold (default: 0.01)max_hits (int, optional): Maximum number of hits to return (default: 50)success (bool): Whether search completed successfullytotal_hits (int): Number of matching sequences foundhits (list): List of matching proteins with detailstime_seconds (float): Execution timeInput:
sequence: Protein sequence (amino acid single-letter code)sequence_id: Optional identifier for the queryevalue: E-value threshold (lower = more stringent, default: 0.01)max_hits: Maximum number of results to return (default: 50)Output:
uniprot_id: UniProt accession numberdescription: Protein description and nameorganism: Species/organism nameidentity_percent: Sequence identity percentage (0-100)evalue: E-value (statistical significance, lower is better)alignment_length: Length of sequence alignmentquery_coverage: Percentage of query sequence coveredcontent-media
当用户明确要求“写/生成 NSFC 预算说明书”“写预算说明”“生成 budget.tex / budget.pdf”“写国自然预算 justification”时使用。基于用户标书正文或补充材料,输出一份可提交的预算说明书 LaTeX 项目并渲染 `budget.pdf`。若用户未指定工作目录,必须暂停并先要求其指定。⚠️ 不适用:用户只是想了解预算原则;用户仅要预算表数字而不写说明书;或用户是 2026 青年 A/B/C 默认包干制且无需预算说明书的场景。
tools
当用户明确要求"写/润色 NSFC 标书摘要""生成中文摘要和英文摘要""把中文摘要翻译成英文摘要"时使用。输出中文、英文两个版本(英文必须是中文的忠实翻译版),同时输出标题建议(1个推荐标题+5个候选标题及理由)。中文摘要默认≤400字符,英文摘要默认≤4000字符。输出方式:将结果写入工作目录下的 `NSFC-ABSTRACTS.md`。⚠️ 不适用:用户只想翻译一段与标书无关的通用文本(应直接翻译);用户只想写立项依据/研究内容/研究基础正文(应使用对应 nsfc 系列 skill)。
documentation
当用户明确要求"更新项目指南""同步指南""沉淀洞见到指南"时使用。将对话中新产生的可复用写作洞见实时沉淀到项目指南文件,保持术语口径一致、结构稳定、可检验与可复现。调用时必须指定指南文件路径。
content-media
当用户明确要求"从文件/图片/网页/描述中提取综述主题"或"生成主题+关键词+核心问题结构化输出"时使用。支持文件(PDF/Word/Markdown/Tex)、文件夹、图片、自然语言描述、网页 URL 等多种输入源,自动识别输入类型并提取内容,生成可直接用于 systematic-literature-review 及其他文献综述技能的结构化输出。