15.chromatin-state-inference/SKILL.md
This skill should be used when users need to infer chromatin states from histone modification ChIP-seq data using chromHMM. It provides workflows for chromatin state segmentation, model training, state annotation.
npx skillsauth add bisnake2001/chromskills chromatin-state-inferenceInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill enables comprehensive chromatin state analysis using chromHMM for histone modification ChIP-seq data. ChromHMM uses a multivariate Hidden Markov Model to segment the genome into discrete chromatin states based on combinatorial patterns of histone modifications.
Main steps include:
Use this skill when you need to infer chromatin states from histone modification ChIP-seq data using chromHMM.
(1) Option 1: BED files of aligned reads
<mark1>.bed
<mark2>.bed
... # Other marks
(1) Option 2: BAM files of aligned reads
<mark1>.bam
<mark2>.bam
... # Other marks
chromhmm_output/
binarized/
*.txt
model/
*.txt
... # other files output by the ChromHMM
Call:
mcp__project-init-tools__project_initwith:
sample: alltask: chromhmmcellmarkfile (skip this step if signal files are provided)Prepare a .txt file (without header) containing following three columns:
example of the cellmark.txt file
cell1 mark1 cell1_mark2.bam cell1_control.bam
cell1 mark2 cell1_mark2.bam cell1/control.bam
For BAM inputs:
Call:
mcp__chromhmm-tools__binarize_bam
with:path_chrom_sized: Provide by user or detect from the working directoryinput_dir: Directory containing BAM filescellmarkfile: Cell mark file defining histone modificationsoutput_dir: (e.g. binarized/)bin_size: Provided by userFor BED inputs:
Call mcp__chromhmm-tools__binarize_bed instead.
For Signal inputs:
Call: mcp__chromhmm-tools__binarize_signal
with:
input_dir: Directory of signalsoutput_dir: (e.g. binarized/)Call
mcp__chromhmm-tools__learn_modelwith:
binarized_dir: Directory binarized file located innum_states: Provide by user (e.g. 15)output_model_dir: (e.g. model_15_states/)genome: Provide by user (e.g. hg38)threads: Provide by user (e.g. 16)development
Align ChIP-seq or ATAC-seq FASTQ files to a reference genome using Bowtie2, with strict input validation, library layout detection, output organization and logging. Use it when raw sequencing reads must be converted into sorted/indexed BAM files before downstream QC, peak calling, or footprinting.
development
Align bisulfite sequencing DNA methylation reads using Bismark only, with explicit validation of reference preparation, library layout detection, output organization, logging, and alignment QC. Use it for WGBS, RRBS, or other bisulfite-converted DNA methylation sequencing data when raw FASTQ files must be aligned before methylation extraction and downstream analysis.
data-ai
Perform peak calling for ChIP-seq or ATAC-seq data using MACS3, with intelligent parameter detection from user feedback. Use it when you want to call peaks for ChIP-seq data or ATAC-seq data.
devops
The TF-differential-binding pipeline performs differential transcription factor (TF) binding analysis from ChIP-seq datasets (TF peaks) using the DiffBind package in R. It identifies genomic regions where TF binding intensity significantly differs between experimental conditions (e.g., treatment vs. control, mutant vs. wild-type). Use the TF-differential-binding pipeline when you need to analyze the different function of the same TF across two or more biological conditions, cell types, or treatments using ChIP-seq data or TF binding peaks. This pipeline is ideal for studying regulatory mechanisms that underlie transcriptional differences or epigenetic responses to perturbations.