Tag: bioinformatics

Type: All Skills Tools
skill ★ 34

Annotating Genetic Variants with ENCODE Data

Annotate genetic variants with ENCODE functional genomics data to identify causal regulatory elements and link them to target genes. The skill supports the full post-GWAS workflow, including tissue-specific mapping, enrichment testing, and …

ammawla/encode-toolkit genomics variant-annotation encode gwas
skill ★ 34

Manage and track ENCODE experiments locally

This skill enables local management of ENCODE experiments by tracking metadata, publications, and data provenance in a SQLite database. It supports experiment comparison, citation export in multiple formats, and the logging of derived files…

ammawla/encode-toolkit encode-project experiment-tracking data-provenance bioinformatics
skill ★ 34

Analyzing ENCODE single-cell genomics data

This skill guides the retrieval and analysis of single-cell genomics data (scRNA-seq and scATAC-seq) from the ENCODE project. It provides detailed protocols for data structure, quality assessment, and integrating single-cell profiles with b…

ammawla/encode-toolkit single-cell genomics encode scrna-seq
skill ★ 34

ENCODE Toolkit Setup and Configuration

Provides instructions for installing, configuring, and verifying the ENCODE Toolkit MCP server. It covers installation via uvx or pip, managing credentials, and testing connections using metadata and search queries.

ammawla/encode-toolkit mcp genomics bioinformatics server-setup
skill ★ 34

ENCODE Hi-C Data Processing Pipeline

This Nextflow pipeline processes Hi-C FASTQ files to generate multi-resolution contact matrices and chromatin loop calls. It integrates BWA, pairtools, Juicer, and cooler for end-to-end chromatin conformation analysis.

ammawla/encode-toolkit hi-c nextflow genomics bioinformatics
skill ★ 34

ENCODE pipeline workflow generation and management

This skill facilitates the generation and execution of standardized ENCODE bioinformatics pipelines, supporting custom Nextflow or WDL workflows. It manages compute resource requirements and deployment across local, HPC, and major cloud pla…

ammawla/encode-toolkit encode-pipelines bioinformatics nextflow workflow-generation
skill ★ 34

ENCODE Peak Annotation and Functional Enrichment

Annotate ENCODE genomic peaks with regulatory features and nearby genes using ChIPseeker and GREAT. The workflow enables genomic feature distribution analysis and functional enrichment via clusterProfiler.

ammawla/encode-toolkit genomics peak-annotation chip-seq bioinformatics
skill ★ 34

ENCODE Multi-Omics Data Integration

Integrates multiple ENCODE data types, including RNA-seq, ATAC-seq, and ChIP-seq, to construct a comprehensive regulatory landscape for specific tissues or cell types. It enables chromatin state annotation, enhancer-gene linkage, and the ch…

ammawla/encode-toolkit multi-omics encode epigenomics bioinformatics
skill ★ 34

Aggregate DNA Methylation Data Across Studies

Construct comprehensive tissue-level DNA methylation landscapes by aggregating WGBS data from multiple ENCODE experiments. The process includes quality-gating, coverage filtering, and the identification of hypomethylated regions and partial…

ammawla/encode-toolkit dna-methylation wgbs encode epigenomics
skill ★ 34

Genomic coordinate liftover and assembly conversion guide

This skill provides a comprehensive workflow for safely converting genomic coordinates between different assembly versions (e.g., hg19 to hg38). It guides the use of industry-standard tools like UCSC liftOver and CrossMap, ensuring full pro…

ammawla/encode-toolkit genomic-coordinates liftover assembly-conversion bioinformatics
skill ★ 34

ENCODE Multi-omic Data Integration

Provides a framework for planning and executing integrative analyses across multiple ENCODE experiments, including multi-omic and cross-sample workflows. It assists with compatibility verification, integration strategy selection, and the re…

ammawla/encode-toolkit encode multi-omics genomics bioinformatics
skill ★ 34

NCBI GEO and ENCODE Connector

Facilitates searching, querying, and cross-referencing NCBI GEO datasets with ENCODE experiments to identify complementary epigenomic data. It supports retrieving metadata, series matrices, and supplementary files via E-utilities and FTP.

ammawla/encode-toolkit ncbi-geo encode bioinformatics genomics
skill ★ 34

Comprehensive Epigenomic Profiling with ENCODE

Assemble comprehensive epigenomic profiles for specific tissues or cell types by systematically gathering histone modifications, chromatin accessibility, and DNA methylation data from ENCODE. It enables the characterisation of chromatin sta…

ammawla/encode-toolkit epigenomics encode chromatin-profiling bioinformatics
skill ★ 34

Downloading ENCODE Genomics Data Files

Facilitates the automated retrieval and organisation of ENCODE genomics files, such as BED, FASTQ, and BAM, to a local filesystem. Supports batch processing with MD5 verification and configurable directory structures based on experiment or …

ammawla/encode-toolkit genomics bioinformatics data-retrieval encode-project
skill ★ 34

Cross-reference ENCODE with scientific databases

Link ENCODE genomic data to external scientific databases such as PubMed, ClinicalTrials.gov, and Open Targets. This skill facilitates building translational pipelines by connecting regulatory elements to clinical research and variant annot…

ammawla/encode-toolkit encode genomics bioinformatics data-integration
skill ★ 34

Comprehensive Genomics Provenance Tracking

This skill establishes a rigorous audit trail for genomic analyses, logging every operation—including tool versions, parameters, and environment details—to ensure absolute reproducibility. It structures the data to auto-generate publication…

ammawla/encode-toolkit genomics provenance reproducibility data-logging
skill ★ 34

Annotating ENCODE Regulatory Variants with ClinVar

Cross-reference ENCODE functional genomic elements with ClinVar clinical variant classifications to identify pathogenic variants in regulatory regions. This skill enables both forward annotation of ENCODE peaks and reverse analysis of ClinV…

ammawla/encode-toolkit encode clinvar genomics variant-annotation
skill ★ 34

Setup reproducible bioinformatics environments for ENCODE

This skill provisions fully reproducible, version-pinned conda environments and associated scripts for comprehensive ENCODE data analysis. It manages dependencies across multiple modalities (RNA-seq, ChIP-seq, ATAC-seq) using tools like STA…

ammawla/encode-toolkit bioinformatics encode conda r-packages
skill ★ 34

ENCODE experiment tracking and provenance management

Track and manage local collections of ENCODE experiments, including metadata, publications, and data provenance. It supports experiment comparison, citation management, and exporting datasets to CSV, TSV, or JSON formats.

ammawla/encode-toolkit encode-experiments data-provenance metadata-management bioinformatics
skill ★ 34

Searching and Exploring ENCODE Genomics Data

Provides a structured strategy for searching and exploring ENCODE Project genomics data using facets and metadata. It facilitates the discovery of experiments, files, and specific biological parameters like assays, organs, and cell lines.

ammawla/encode-toolkit genomics bioinformatics encode-project data-retrieval
skill ★ 34

Characterise Regulatory Elements with ENCODE Data

Identify and characterise candidate cis-regulatory elements using ENCODE datasets and the cCRE catalog. The skill enables the discovery of active enhancers, promoter state mapping, and super-enhancer identification using ChromHMM and ROSE.

ammawla/encode-toolkit encode genomics regulatory-elements epigenomics
skill ★ 34

ENCODE Hi-C Processing Pipeline

Executes the ENCODE Hi-C pipeline using Nextflow to transform FASTQ files into multi-resolution contact matrices and loop calls. The pipeline supports local, SLURM, and cloud-based deployments via Docker.

ammawla/encode-toolkit hi-c nextflow bioinformatics genomics
tool ★ 34

ENCODE RNA-seq Quantification Pipeline

This tool executes a comprehensive, ENCODE-standard RNA-seq pipeline using Nextflow. It processes raw FASTQ data through STAR alignment, generating gene/transcript quantification (TPM/FPKM) and strand-specific bigWig signal tracks.

ammawla/encode-toolkit rna-seq gene-expression quantification star-alignment
skill ★ 34

ENCODE Pipeline Workflow Generation and Management

This skill facilitates the generation and execution of complex ENCODE bioinformatics workflows, supporting custom Nextflow and WDL pipelines. It manages compute resource requirements and deployment across local, HPC, and major cloud platfor…

ammawla/encode-toolkit encode nextflow workflow bioinformatics