0 job(s) in queue; 0 job(s) currently running
This tool allows comparison of motif enrichment results of 2 independent i-cisTarget analyses. In this analysis, it is possible to compare both results for the same species as well as for different species (if the same motif collection is used), e.g. enriched motifs found for active regions in mouse and drosophila heart.
The comparative analysis can be performed here.
See the Legend explaining the study types below.
Study type | Study | Species version | Input | Reference | Link |
---|---|---|---|---|---|
The top 1000 GATA1 ChIP-seq peaks in K562 cell line (Encode) | Human | Peaks | ENCODE Project Consortium, Nature (2012) | Report | |
FLI1 ChIP-seq peaks in Ewing sarcoma (provided in the Supplementary Table 1 of the corresponding paper) | Human | Peaks | Riggi et al., Cancer Cell (2014) | Report | |
Genes up-regulated after GATA1 activation in G1E-ER4 mouse cells. Human orthologs used as input (as downloaded from MSigDB). | Human | Genes | Welch et al., Blood (2004) | Report | |
Genes down-regulated upon EWS-FLI1 knockdown in Ewing sarcoma | Human | Genes | Riggi et al., Cancer Cell (2014) | Report | |
Differentially expressed genes after TGFbeta treatment in A549 | Human | Genes | Cieślik et al., Epigenetics Chromatin (2013) | Report | |
The top 1000 less active regions (H3K27ac) in Ewing sarcoma upon EWS-FLI1 knockdown | Human | CRRs | Riggi et al., Cancer Cell (2014) | Report | |
100 less active peaks (H3K27ac) in Ewing sarcoma upon EWS-FLI1 knockdown (provided in the Supplementary Table 3 of the corresponding paper) | Human | Peaks | Riggi et al., Cancer Cell (2014) | Report | |
100 more active peaks (H3K27ac) in Ewing sarcoma upon EWS-FLI1 knockdown (provided in the Supplementary Table 3 of the corresponding paper) | Human | Peaks | Riggi et al., Cancer Cell (2014) | Report | |
The top 1000 active regions (ATAC-seq) in Ewing sarcoma after EWS-FLI1 activation | Human | CRRs | Riggi et al., Cancer Cell (2014) | Report | |
Heart positive VISTA enhancers | Human | Peaks | Visel et al., Nucleic Acids Res (2007) | Report | |
The top 500 Gata1 ChIP-seq peaks in MEL mouse cell line (Mouse Encode) | Mouse | Peaks | Mouse Encode | Report | |
Genes up-regulated after GATA1 activation in G1E-ER4 mouse cells (erythroid precursors engineered to express GATA1 upon addition of estradiol. HGNC symbols as downloaded from MSigDB converted to MGI symbols. | Mouse | Genes | Welch et al., Blood (2004) | Report | |
P300 ChIP-seq peaks in mouse heart | Mouse | Peaks | downloaded from GEO | Report | |
Genes related to GO term 'heart process' [GO:0003015] (170 genes, 375 annotations) | Mouse | Genes | downloaded from MGI | Report | |
9433 Zelda ChIP-seq peaks in the early Drosophila melanogaster embryo | Drosophila | Peaks | Harrison et al., Plos Genetics 2011 | Report | |
Conserved eye disc-specific genes (versus wing disc) | Drosophila | Genes | Naval Sanchez et al., Genome Research 2013 | Report | |
Set of co-expressed genes downstream of Dorsal (down in knockout), in the fly embryo | Drosophila | Genes | Stathopoulos et al., Cell (2002) | Report | |
Daphnia pulex heat shock signature | Daphnia pulex | Genes | Becker et al. (unpublished data) | Report | |
Daphnia magna genes upregulated after chronic treatment with microcystin-free cyanobacteria | Daphnia magna | Genes | Schwarzenberger et al., BMC Genomics (2014) | Report |
Study type | Question | Input | Input type | Output |
---|---|---|---|---|
Are motifs and ChIP-seq tracks of my ChIP'ped factor enriched? What are the co-factors of the TF? Which DHS/Faire/Histone tracks are correlated with my TF ChIP-seq peaks? Can we discriminate direct from indirect targets? | TF ChIP-seq peaks | Peaks | The motifs and TF ChIP-seq tracks of the ChIP'ped factor and their direct target regions + motifs and ChIP-seq tracks of co-factors + correlated DHS/Faire/Histone marks in specific cell lines. | |
Which TFs bind to significantly more active/open/repressed regions between two conditions (normal vs cancer, non-treated vs treated, cancer subtype1 vs subtype2,...) based on e.g. H3K27ac/H3K27me3/Faire/ATAC differential analysis? | a set of differentially active regions | Peaks | The most correlated motifs and TF ChIP-seq tracks and their direct target regions, the most correlated DHS/Faire/Histone tracks. | |
Which TF regulates the expression of the specific gene signature/gene module/co-expressed genes? | a set of genes | Genes | The motifs and TF ChIP-seq tracks of the predicted upstream regulators, alongside with the non-TF regulatory tracks in specific cell types. |
The analysis is based on ranking conserved regions in the Human, Mouse or Drosophila genome and recovery curves (See Figure).
Some of the key features of i-cisTarget are:
A vast library of motifs and in vivo regulatory features was compiled for i-cisTarget. These features are:
The input for i-cisTarget can be:
If we start from peaks/loci, e.g.:
then the peaks are mapped to the overlapping candidate i-cisTarget regulatory regions (described below).
If we want to start from a set of genes such as a module, a gene signature, top mutated genes, a Gene Ontology category, etc. to find out for instance which upstream TF regulates this certain group of genes or which DHS track is correlated most, then the input genes will be linked to candidate regulatory regions. To do so, all non-coding regions located in the neighbourhood of a gene will be assigned to this gene. These regions include the promoter regions upstream and downstream to the transcription start site (TSS). The “space” around each gene is a parameter and can be selected. For human and mouse version the space 20 kb around TSS is used.
i-cisTarget analysis relies on candidate regulatory regions that we defined using publicly available regulatory data: General Binding Preference models, CpG islands, proximal promoters, conserved non-coding sequences, ultra-conserved elements, regulatory elements from OregAnno, VistaEnhancers, predicted cis-regulatory modules and DNAseI Hypersensitive (DHS) uniform clustered peaks across 125 cell lines from ENCODE.
Table 1. Publicly available regulatory datasets used to create human i-cisTarget candidate regulatory regions
GBP | CpG | Proximal promoters | CNS | UCR | Oreganno | Vista Enhancers | CRMs | DHS | |
---|---|---|---|---|---|---|---|---|---|
Number of regions | 61550 | 27718 | 34722 | 232101 | 15931 | 23112 | 1339 | 123500 | 1281988 |
% of the genome | 1.77 | 0.73 | 0.67 | 2.25 | 0.13 | 0.39 | 0.07 | 2.05 | 13.36 |
All these features were merged. In a first step, regions having an overlap of at least 20% or 80% with insulator elements in the genome or coding exons respectively were removed. Next, regions with an overlap smaller then 20% or 80% with insulators or exons are split and the regions containing the insulator or coding exons were removed. Remaining regions are then filtered based on size and regions < 30bp="" are="" removed.="" finally,="" any="" resulting="" regions="" shorter="" than="" 1000="" bp="" were="" extended="" if="" possible="" to="" 1000="" bp="" in="" a="" direction="" that="" prevents="" overlap="" with="" an="" insulator="" or="" exon.="" the="" complete="" procedure="" of="" creating="" candidate="" regulatory="" regions="" yielded="" 1.223.024="" regions="" (representing="" ~35%="" of="" the="" genome)="" with="" average="" size="" 818="">
Subsequently, all the candidate regulatory regions were scored and ranked for each feature (motifs and regulatory tracks) and ranking databases were created.
The ranking of the foreground set of the user input (input genes/regions mapped to the i-cisTarget regulatory regions) is considered per each feature (motif or regulatory track) and the Area Under the Curve (AUC) of these “foreground” regions is calculated. The areas for all features are normalized using a Normalized Enrichment Score (NES = (AUC-µ) / σ). Moreover, similar enriched motifs are clustered together using STAMP.
When the analysis is finished the results will appear on the webpage or the link to the results will be provided to the user's e-mail.
The report of results includes (see part A of the figure):
Compared to the old version of i-cisTarget (2012):
Table 2. Human regulatory tracks included in the databases
ENCODE | Epigenomics Roadmap Project | Taipale lab | Aerts lab | ∑ | |
---|---|---|---|---|---|
DHS | 467 | 390 | 0 | 0 | 857 |
FAIRE | 37 | 0 | 0 | 14 | 51 |
Histone ChIP-seq | 402 | 1572 | 3 | 26 | 2003 |
TF ChIP-seq | 1274 | 0 | 117 | 3 | 1394 |
∑ | 2180 | 1962 | 120 | 43 | 4305 |
Table 3. Mouse regulatory tracks included in the databases
ENCODE | |
---|---|
DHS | 150 |
Histone ChIP-seq | 209 |
TF ChIP-seq | 206 |
∑ | 565 |
Gene signatures must be supplied as a list of gene identifiers, separated by newline characters.
The following IDs are supported:
HGNC gene symbols for Human version | MGI gene symbols for Mouse version | FlyBase gene symbols for Drosophila version |
---|---|---|
KLF7 | Cdk6 | pros |
The GMT file format is a tab delimited file format that describes multiple gene sets. In the GMT format each row represents a gene set. Each gene set is described by a name, a description, and the genes in the gene set. These fields are separated by a TAB character; the gene identifiers need to be separated by a semicolon, colon or again a TAB character.
Signature1 Description BCL2;CDH1;ESRRA;GNAL;MITF;MYC;PTEN;VAT1;ZBTB10;TANGO2;POU3F2;HEY2;FAM210B;DCT |
ChIP peaks must be supplied as BED file entries, specifying the locations of these peaks in the Human, Mouse or Drosophila genome. FASTA file input is not supported.
An example of a BED file:
chr1 161871 162031 MACS_peak_1 |
If you have any question or problem related to i-cisTarget, please, inform us: lcbtools@kuleuven.be
If you use i-cisTarget, please cite:
Imrichová,H., Hulselmans,G., Kalender Atak,Z., Potier,D. and Aerts,S. (2015) i-cisTarget 2015 update: generalized cis-regulatory enrichment analysis in human, mouse and fly. Nucleic Acids Res. doi: 10.1093/nar/gkv395
Herrmann,C., Van de Sande,B., Potier,D. and Aerts,S. (2012) i-cisTarget: an integrative genomics method for the prediction of regulatory features and cis-regulatory modules. Nucleic Acids Res. doi: 10.1093/nar/gks543
联系客服