10_toolBased.genomic-feature-annotation/SKILL.md
This skill is used to perform genomic feature annotation and visualization for any file containing genomic region information using Homer (Hypergeometric Optimization of Motif EnRichment). It annotates regions such as promoters, exons, introns, intergenic regions, and TSS proximity, and generates visual summaries of feature distributions. ChIPseeker mode is also supported according to requirements.
npx skillsauth add bisnake2001/chromskills genomic-feature-annotationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
annotatePeaks.pl.Genomic region formats supported:
genomic_feature_annotation/
results/
${sample}.anno_genomic_features.txt
${sample}.anno_genomic_features_stats.txt
logs/
${sample}.anno_genomic_features.log
plots/
${sample}.anno_genomic_features.pdf
Before calling any tool, ask the user:
sample): used as prefix and for the output directory ${sample}_genomic_feature_annotation.genome): e.g. hg38, mm10, danRer11.
Call:
mcp__project-init-tools__project_initwith:
sample: the user-provided sample nametask: de_novo_motif_discoveryThe tool will:
${sample}_genomic_feature_annotation directory.${sample}_genomic_feature_annotation directory, which will be used as ${proj_dir}.This step is optional. Only perform this step if the input file is a BED file. If the input file is a gene list, skip this step.
From 1 format to chr1 format
From MT format to chrM format
Call:
mcp__file-format-tools__standardize_bed_chrom_nameswith:
input_bed: the user-provided BED fileoutput_bed: the path to save the standardized BED fileThe tool will:
mcp__homer-tools__annotate_genomic_featuresWith:
sample: the user-provided sample nameproj_dir: directory to save the genomic feature annotation results. In this skill, it is the full path of the ${sample}_genomic_feature_annotation directory returned by mcp__project-init-tools__project_initregions_bed: the user-provided regions file in BED format. May end with .bed, .narrowPeak, .broadPeak, etc.genome: the user-provided genome assembly, e.g. hg38, mm10, danRer11ann: "custom homer annotation file (created by assignGenomeAnnotation.pl), (default: None).size_given: keep original region sizes (default: True)cpg: include CpG information (default: False)The tool will:
Annotate the genomic regions using Homer's annotatePeaks.pl.
Return the path of the annotated regions file under ${proj_dir}/results/ directory, and the path to the log file under ${proj_dir}/logs/ directory.
${proj_dir}/results/${sample}.anno_genomic_features.txt${proj_dir}/results/${sample}.anno_genomic_features_stats.txt${proj_dir}/logs/${sample}.anno_genomic_features.log(Option 2) ChIPseeker mode
library(ChIPseeker)
library(TxDb.Mmusculus.UCSC.mm10.knownGene) # ajust this depend on species
library(org.Mm.eg.db) # ajust this depend on species
txdb <- TxDb.Mmusculus.UCSC.mm10.knownGene # ajust this depend on species
peak_file <- "$sample.narrowPeak"
peak_anno <- annotatePeak(
peak_file,
TxDb = txdb,
tssRegion = c(-3000, 3000), # define "promoter" window around TSS
annoDb = "org.Mm.eg.db" # adds SYMBOL, GENENAME, etc.
)
pdf("plots/${sample}_anno_ChIPseeker.pdf", width = 6, height = 5)
plotAnnoPie(peak_anno)
plotAnnoBar(peak_anno)
plotDistToTSS(peak_anno)
dev.off()
Call:
mcp__plot-anno-tools__visualize_annotation_resultsWith:
sample: the user-provided sample nameproj_dir: directory to save the annotation results. In this skill, it is the full path of the ${sample}_genomic_feature_annotation directory returned by mcp__project-init-tools__project_initchart_type: Type of plot: 'pie' for pie chart, 'bar' for barplot. Default: 'pie'.The tool will:
${proj_dir}/plots/ directory, and ends with .pdf.Typical annotation categories:
Quality indicators:
development
Align ChIP-seq or ATAC-seq FASTQ files to a reference genome using Bowtie2, with strict input validation, library layout detection, output organization and logging. Use it when raw sequencing reads must be converted into sorted/indexed BAM files before downstream QC, peak calling, or footprinting.
development
Align bisulfite sequencing DNA methylation reads using Bismark only, with explicit validation of reference preparation, library layout detection, output organization, logging, and alignment QC. Use it for WGBS, RRBS, or other bisulfite-converted DNA methylation sequencing data when raw FASTQ files must be aligned before methylation extraction and downstream analysis.
data-ai
Perform peak calling for ChIP-seq or ATAC-seq data using MACS3, with intelligent parameter detection from user feedback. Use it when you want to call peaks for ChIP-seq data or ATAC-seq data.
devops
The TF-differential-binding pipeline performs differential transcription factor (TF) binding analysis from ChIP-seq datasets (TF peaks) using the DiffBind package in R. It identifies genomic regions where TF binding intensity significantly differs between experimental conditions (e.g., treatment vs. control, mutant vs. wild-type). Use the TF-differential-binding pipeline when you need to analyze the different function of the same TF across two or more biological conditions, cell types, or treatments using ChIP-seq data or TF binding peaks. This pipeline is ideal for studying regulatory mechanisms that underlie transcriptional differences or epigenetic responses to perturbations.