TFregulomeR reveals transcription factors’ context-specific features and functions

Quy Xiao Xuan Lin

National University of Singapore

Denis Thieffry

École Normale Supérieure

Sudhakar Jha

National University of Singapore

Touati Benoukraf

National University of Singapore & Memorial University of Newfoundland

Abstract

Transcription factors (TFs) are sequence-specific binding proteins, fine-tuning the spatiotemporal gene expression. Since the genomic occupancy of TF is highly dynamic, it is crucial to study TF binding sites (TFBS) in a cell-specific and in vivo context. Here, we introduce TFregulomeR, an R-package linked to a large timely-updated compendium of cistrome and methylome datasets, implemented with functionalities that facilitate the manipulation and analysis of TFBS and methylome meta-data. In particular, TFregulomeR permits the characterisation of TF binding partners and cell-specific TFBSs, along with the study of TF’s functions in the context of different partners’ combinations and DNA methylation levels. TFregulomeR package version: 2.0.0

The servers linked by TFregulomeR
Browse TFregulomeR data compendium
Retrieve datasets from TFregulomeR data warehouse
Study TFBS propensity
Annotate TFBS locations
Annotate TFBS functions
Connect with TFBSTools

The servers linked by TFregulomeR

TFregulomeR data compendium is hosted in Singapore and Canada. As an API, TFregulomeR (from v2.0.0) will dynamically access and retrieve the data either from Singapore (default) or Canada server. The functions in TFregulomeR that dynamically link to the servers include dataBrowser(), loadPeaks(), searchMotif(), commonPeaks(), exclusivePeaks(), intersectPeakMatrix(), motifDistrib(), genomeAnnotate() and toTFBSTools(). By default, these functions are linking to Singapore server. If users opt to choose the Canada server, they can use the input parameter server='ca' when using these functions.

Browse TFregulomeR data compendium

TFregulomeR allows users to easily browse TFregulomeR compendium using only one simple function dataBrowser (previously called TFBSBrowser before v1.2.0). Users can search the TFBS according to species, organ, sample type, cell/tissue name, TF name, disease state, and source. A data.frame will be returned upon searching, including the information in TFregulomeR ID (important for downstream analysis), species, organ, sample type, cell/tissue name, description of cell/tissue, disease state, TF name, source, TF source ID, number of peaks and number of peaks with motif. In particular, if no input is given for the function, all records in TFregulomeR compendium will be returned.

In TFregulomeR project, we used MACS2 (Zhang et al., 2008) to call peaks from ChIP-seq datasets and subsequently employed MEME-ChIP (Machanick et al., 2011) to perform de novo motif discovery in each peak set. MACS2 has been proved as one of the best peak callers in terms of sensitivity, precision and F-score metrics in a study encompassing 300 simulated ChIP-seq datasets with different noise levels (Thomas et al., 2017), while MEME-ChIP is one of the most commonly used motif callers and also utilised in both JASPAR 2010 and 2014 versions (Portales-Casamar et al., 2010; Mathelier et al., 2014). Highly and centrally enriched motifs were selected and compared with the existing TF-binding profile databases, such as HOCOMOCO (Kulakovskiy et al., 2018) and JASPAR. 91 highly enriched motifs were not consistent with the TF-binding profile databases. This is presumably due to the fact that in those cell types, TFs are indirectly recruited to genome, and/or that the high presence of some non-targeted motifs repeatedly observed across ChIP-seq datasets, also known as zinger motifs, mask the motif enrichment of the ChIP’ed TF (Hunt et al., 2014). Furthermore, 136 motifs were not recorded for their corresponding TFs in the databases. In order to verify that divergent motifs do not derive from the use of a specific motif discovery algorithm, we used another software, HOMER (Heinz et al., 2010), based on hypergeometric enrichment, to perform an additional de novo motif discovery. Motif results by HOMER were compared with those by MEME-ChIP and their similarity were measured by normalised Pearson correlation coefficient using compare-matrices function in RSAT (Nguyen et al., 2018) with the formula: Ncor = cor * w / w_smaller, where cor is raw Pearson correlation coefficient, w is the alignment width of two matrices from MEME-ChIP and HOMER (the minimum value of w was set as 5), and w_smaller is the width of smaller motifs from MEME-ChIP and HOMER. We found that majority of the PWM matrices generated with MEME-ChIP, was recapitulated with HOMER. We have added the information into the last two columns of dataBrowser output (from v1.2.0).

Figure 1. Similarity of de novo enriched motifs by MEME-ChIP and HOMER. The beeswarm and violin plots show the normalised Pearson correlation coefficient of de novo motifs called by MEME-ChIP and HOMER, and the red dash denotes normalised Pearson correlation coefficient value 0.7.

library(TFregulomeR)
# browse all records in TFregulomeR TFBS compendium
all_record <- dataBrowser() # or TFBSBrowser() before v1.2.0
#> 1468 record(s) found: ...
#> ... covering 415 TF(s)
#> ... from 1 species:
#> ... ...human
#> ... from 29 organ(s):
#> ... ... stem_cell, blood_and_lymph, connective_tissue, colorectum, brain, bone, stomach, prostate, breast, pancreas, skin, kidney, lung, eye, esophagus, heart, muscle, uterus, spleen, cervix, testis, liver, adrenal_gland, neck_and_mouth, pleura, ovary, thymus, fallopian, vagina
#> ... in 3 sample type(s):
#> ... ... primary_cells, cell_line, tissue
#> ... in  414  different cell(s) or tissue(s)
#> ... in 8 type(s) of disease state(s):
#> ... ... normal, tumor, Simpson_Golabi_Behmel_syndrome, progeria, metaplasia, unknown, immortalized, premetastatic
#> ... from the source(s): GTRD, MethMotif

# returned table
head(all_record)
#>                                               ID species             organ   sample_type
#> 1 GTRD-EXP000061_HSA_embryonic-stem-cells_PRDM14   human         stem_cell primary_cells
#> 2          GTRD-EXP000080_HSA_CD4pos-T-cells_YY1   human   blood_and_lymph primary_cells
#> 3                 GTRD-EXP000128_HSA_EWS502_FLI1   human connective_tissue     cell_line
#> 4                  GTRD-EXP000132_HSA_HUVEC_FLI1   human connective_tissue     cell_line
#> 5                GTRD-EXP000140_HSA_LS180_TCF7L2   human        colorectum     cell_line
#> 6                 GTRD-EXP000142_HSA_LS180_CEBPB   human        colorectum     cell_line
#>       cell_tissue_name                      description disease_state     TF source source_ID
#> 1 embryonic-stem-cells             embryonic stem cells        normal PRDM14   GTRD EXP000061
#> 2       CD4pos-T-cells                     CD4+ T-cells        normal    YY1   GTRD EXP000080
#> 3               EWS502                    Ewing sarcoma         tumor   FLI1   GTRD EXP000128
#> 4                HUVEC umbilical vein endothelial cells        normal   FLI1   GTRD EXP000132
#> 5                LS180                     colon cancer         tumor TCF7L2   GTRD EXP000140
#> 6                LS180                     colon cancer         tumor  CEBPB   GTRD EXP000142
#>   peak_num peak_with_motif_num Consistent_with_HOCOMOCO_JASPAR Ncor_between_MEME_ChIP_and_HOMER
#> 1    17482                5656                             YES                               NA
#> 2    18298                2038                             YES                               NA
#> 3    89523               28914                             YES                               NA
#> 4    71437               36126                             YES                               NA
#> 5    16517                2312                             YES                               NA
#> 6   128542               67747                             YES                               NA

# browse TFBSs in blood and lymph
blood_and_lymph_record <- dataBrowser(organ = "blood_and_lymph") # or TFBSBrowser() before v1.2.0
#> 494 record(s) found: ...
#> ... covering 197 TF(s)
#> ... from 1 species:
#> ... ...human
#> ... from 1 organ(s):
#> ... ... blood_and_lymph
#> ... in 3 sample type(s):
#> ... ... primary_cells, cell_line, tissue
#> ... in  129  different cell(s) or tissue(s)
#> ... in 2 type(s) of disease state(s):
#> ... ... normal, tumor
#> ... from the source(s): GTRD, MethMotif

# browse all CEBPB TFBSs
CEBPB_record <- dataBrowser(tf = "CEBPB") # or TFBSBrowser() before v1.2.0
#> 16 record(s) founded: ...
#> ... covering 1 TF(s)
#> ... from 1 species:
#> ... ...human
#> ... from 9 organ(s):
#> ... ... colorectum, uterus, blood_and_lymph, stem_cell, bone, lung, cervix, liver, 
#> breast
#> ... in 2 sample type(s):
#> ... ... cell_line, primary_cells
#> ... in  16  different cell(s) or tissue(s)
#> ... in 2 type(s) of disease state(s):
#> ... ... tumor, normal
#> ... from the source(s): GTRD, MethMotif

Retrieve datasets from TFregulomeR data warehouse

We have designed some useful functions for data retrieval from TFregulomeR compendium. Retrieval of motif matrix and DNA methylation matrix if the source is MethMotif (If the source is GTRD, no DNA methylation information is available) can be achieved using searchMotif. Further, these obtained matrices can be easily saved locally using exportMMPFM, and corresponding (Meth)Motif logo (MethMotif logo, if the source is MethMotif; motif logo, if the source is GTRD) can be simply plotted using plotLogo. Here, we introduced an object of class “MethMotif” using S4 class in order for an easy and intuitive storage, manipulation and conversion (with other packages such as “TFBSTools”) of a MethMotif matrix, which contains a TF motif weight position matrix and its DNA methylation matrix (beta score matrix). What’s more, we allow users to directly load peak regions of a TF of interest (all peaks or peaks with motif only) from TFregulomeR compendium using loadPeaks.

# according to TFBSBrowser results for all CEBPB TFBS query, we select two CEBPB TFBSs 
# from MethMotif and GTRD: MM1_HSA_K562_CEBPB, GTRD-EXP040801_HSA_HL-60_CEBPB.

# loading MethMotif object in "MEME" format. Currently we support "MEME" and "TRANSFAC".
K562_CEBPB <- searchMotif(id = "MM1_HSA_K562_CEBPB", motif_format = "MEME")
#> There are a matched record exported in a MethMotif object.
HL60_CEBPB <- searchMotif(id = "GTRD-EXP040801_HSA_HL-60_CEBPB", motif_format = "MEME")
#> There are a matched record exported in a MethMotif object.
class(K562_CEBPB)
#> [1] "MethMotif"
#> attr(,"package")
#> [1] "TFregulomeR"

After obtaining a MethMotif matrix, user can use plotLogo to plot logo as below (Figure 2). If the TFBS source is MethMotif, then a MethMotif logo will be saved. Two options are available for motif logo, “entropy” and “frequency”, and also different methylation levels (“all”, “methylated” and “unmethylated”) can be opted for methylation bar charts. However, if the TFBS source is GTRD, only a motif logo will be saved due to the unknown DNA methylation within motif.

In the plot, the number of peaks with motif will be printed. It should be NOTED that the motif logo is generated by the all possible TFBSs (initally scanned by FIMO after de novo motif discovery using MEME-ChIP suite with a p-value less than 1e-4) in the peak regions (+/-100bp around peak summits). It’s possbile that one peak region contains more than one significant TFBSs. Hence, the number of TFBSs in the peak regions could be larger than the number of peaks with motif.

For each MethMotif logo, bar plot above motif logo denotes the number of cytosines in CpG context covered by WGBS at each base position in motif and these CpGs are segregated into three groups shown in different colors, namely homogeneously methylated (orange bar, beta score > 90%), heterogeneously methylated (green bar, beta score 10-90%), and homogenously unmethylated (blue bar, beta score < 10%).

plotLogo(K562_CEBPB, logo_type = "entropy", meth_level = "all")
#> Success: a PDF named 'MM1_HSA_K562_CEBPB-logo-entropy.pdf' has been saved!
plotLogo(K562_CEBPB, logo_type = "entropy", meth_level = "methylated")
#> Success: a PDF named 'MM1_HSA_K562_CEBPB-logo-entropy-methylated-only.pdf' has been saved!
plotLogo(K562_CEBPB, logo_type = "entropy", meth_level = "unmethylated")
#> Success: a PDF named 'MM1_HSA_K562_CEBPB-logo-entropy-unmethylated-only.pdf' has been saved!
plotLogo(K562_CEBPB, logo_type = "frequency", meth_level = "all")
#> Success: a PDF named 'MM1_HSA_K562_CEBPB-logo-frequency.pdf' has been saved!
plotLogo(K562_CEBPB, logo_type = "frequency", meth_level = "methylated")
#> Success: a PDF named 'MM1_HSA_K562_CEBPB-logo-frequency-methylated-only.pdf' has been saved!
plotLogo(K562_CEBPB, logo_type = "frequency", meth_level = "unmethylated")
#> Success: a PDF named 'MM1_HSA_K562_CEBPB-logo-frequency-unmethylated-only.pdf' has been saved!

# plot motif logo for GTRD-EXP040801_HSA_HL-60_CEBPB. No DNA methylation states available here
plotLogo(HL60_CEBPB, logo_type = "entropy")
#> Success: a PDF named 'GTRD-EXP040801_HSA_HL-60_CEBPB-logo-entropy.pdf' has been saved!

Figure 2. CEBPB (Meth)Motif logos in K562 and HL-60

Motif matrix as well as methylation matrix (beta score matrix), if available, can be saved locally using exportMMPFM. To be noted, the function exportMMPFM is also able to export (Meth)Motif matrix for the outputs of commonPeaks, exclusivePeaks and intersectPeakMatrix by specifying “fun =”. We will introduce it in the following section.

# export MethMotif matrix for MM1_HSA_K562_CEBPB
exportMMPFM(fun_output = K562_CEBPB, fun = "searchMotif", save_motif_PFM = TRUE, save_betaScore_matrix = TRUE)
#> Start exporting ... ...
#> ... ... You chose to save motif PFM and beta score matrix.
#> ... ... export searchMotif
#> ... ... ... ... Beta score matrix has been saved as 'MM1_HSA_K562_CEBPB-methScore.txt'.
#> ... ... ... ... Motif PFM has been saved as 'MM1_HSA_K562_CEBPB-motif-MEME.txt'.

# export motif matrix for GTRD-EXP040801_HSA_HL-60_CEBPB
exportMMPFM(fun_output = HL60_CEBPB, fun = "searchMotif", save_motif_PFM = TRUE, save_betaScore_matrix = TRUE)
#> Start exporting ... ...
#> ... ... You chose to save motif PFM and beta score matrix.
#> ... ... export searchMotif
#> ... ... ... ... No beta score matrix is available. Skip!
#> ... ... ... ... Motif PFM has been saved as 'GTRD-EXP040801_HSA_HL-60_CEBPB-motif-MEME.txt'.

# exprot motif matrix in TRANSFAC format
K562_CEBPB_TRANSFAC <- searchMotif(id = "MM1_HSA_K562_CEBPB", motif_format = "TRANSFAC")
#> There are a matched record exported in a MethMotif object.
exportMMPFM(fun_output <- K562_CEBPB_TRANSFAC, fun = "searchMotif", save_motif_PFM = TRUE, save_betaScore_matrix = TRUE)
#> Start exporting ... ...
#> ... ... You chose to save motif PFM and beta score matrix.
#> ... ... export searchMotif
#> ... ... ... ... Beta score matrix has been saved as 'MM1_HSA_K562_CEBPB-methScore.txt'.
#> ... ... ... ... Motif PFM has been saved as 'MM1_HSA_K562_CEBPB-motif-TRANSFAC.txt'.

More importantly, we allow the users to load peak regions (all peaks or peaks only with motif) of all TFs in TFregulomeR compendium using loadPeaks. To be noted, the peak regions of a given TF in TFregulomeR compendium are the peak summits (hg38 for human), and the TFBS is enriched in a +/- 100bp window surrounding the peak summits. For each peak region, we also provide its tag (read) fold change (fifth column of returned peaks). This read enrichment value is obtained from MACS2 and denotes the fold change of reads in TF ChIP-seq compared to input ChIP-seq.

K562_CEBPB_peaks <- loadPeaks(id = "MM1_HSA_K562_CEBPB", includeMotifOnly = TRUE)
#> Success: peak file has been returned in a data frame!
head(K562_CEBPB_peaks)
#>    chr     start       end                                    id tag_fold_change
#> 1 chr3 101823721 101823722 MM1_HSA_K562_CEBPB_peaks_with_motif_1        22.91742
#> 2 chr3 101850619 101850620 MM1_HSA_K562_CEBPB_peaks_with_motif_2        13.09647
#> 3 chr3 102182290 102182291 MM1_HSA_K562_CEBPB_peaks_with_motif_3        17.28870
#> 4 chr3 105626970 105626971 MM1_HSA_K562_CEBPB_peaks_with_motif_4        23.36092
#> 5 chr3 105647238 105647239 MM1_HSA_K562_CEBPB_peaks_with_motif_5        34.80412
#> 6 chr3 105899733 105899734 MM1_HSA_K562_CEBPB_peaks_with_motif_6        13.61153

Study TFBS propensity

Common peak regions

Figure 3. functionality in TFregulomeR for common peak analysis

TFregulomeR provides the functionality to find the common peak regions along with DNA methylation profiles and read (tag) enrichments using commonPeaks.

For the target peak sets, users can directly use TFregulomeR TF peaks (hg38 for human) by inputting its TFregulomeR IDs in target_peak_id (all peaks or peaks with motif only can be opted using motif_only_for_target_peak), or their own peak regions in user_target_peak_list. If customised peak sets are provided, all peak sets should be stored in an R list(), and each peak set should be a bed-format data.frame with the first three columns as chromosome (starting with ‘chr’), start and end. It’s recommended that users provide the UNIQUE IDs for their customised peak list in user_target_peak_id (also should be unique to the provided TFregulomeR ID list in target_peak_id). If unavailable, the function will automatically assign IDs for the user’s peak sets. It should be noted that if the customised peak set is derived TFregulomeR compendium, it’s highly recommended that its TFregulomeR ID should be provided correspondingly in user_target_peak_id if one opts to profile the DNA methylation levels. Even though TFregulomeR peak sets are peak summits, the function is able to recognise it with the provided TFregulomeR ID in user_target_peak_id and automatically expand +/- 100bp during the analysis.

For the compared peak sets, same rules are applicable to options compared_peak_id, motif_only_for_compared_peak, user_compared_peak_list and user_compared_peak_id when loading compared peak sets.

During the analysis, EACH of target peak set from TFregulomeR compendium using target_peak_id and/or user provided using user_target_peak_list will be compared with ALL input compared peak sets (compared_peak_id and user_compared_peak_list), to get a final target sub-ensemble peaks shared by all compared peak sets. If methylation_profile_in_narrow_region=TRUE, DNA methylation profiling in +/- 100bp surrounding peak summit will be performed for each target common peak sub-ensemble, if its ID labeled in target_peak_id and user_target_peak_list matches a MethMotif ID (“MM1_HSA_”) in TFregulomeR.

# read my local file. Here we use the HCT116 CEBPB binding sites from the publication PMID: 30380113
my_peak_path <- system.file("extdata", "HCT116_CEBPb_binding_sites.txt", package = "TFregulomeR")
my_peak <- read.delim(my_peak_path, sep = "\t", header = FALSE)
head(my_peak)
#>      V1        V2        V3
#> 1  chr1  58585814  58585827
#> 2 chr12 122925699 122925712
#> 3  chr9   5818111   5818124
#> 4 chr19   5850889   5850902
#> 5 chr10   5951274   5951287
#> 6  chr1   8175025   8175038

# To get the sub-ensemble peaks of K562 CEBPB peaks and my peaks, which are
# share with the CEBPB peaks in all cell types from TFregulomeR compendium,
# and at the same time to profile the DNA methylation states in the final subsets.

# 1) Get all CEBPB records in TFregulomeR compendium
CEBPB_record <- dataBrowser(tf = "CEBPB") # or TFBSBrowser() before v1.2.0
#> 16 records(s) founded: ...
#> ... covering 1 TF(s)
#> ... from 1 species:
#> ... ...human
#> ... from 9 organ(s):
#> ... ... colorectum, uterus, blood_and_lymph, stem_cell, bone, lung, cervix, liver, breast
#> ... in 2 sample type(s):
#> ... ... cell_line, primary_cells
#> ... in  16  different cell(s) or tissue(s)
#> ... in 2 type(s) of disease state(s):
#> ... ... tumor, normal
#> ... from the source(s): GTRD, MethMotif

# 2) Start commonPeaks analysis
commonPeak_output <- commonPeaks(target_peak_id = "MM1_HSA_K562_CEBPB",
                                 motif_only_for_target_peak = TRUE, 
                                 user_target_peak_list = list(my_peak), 
                                 user_target_peak_id = c("HCT116_CEBPB"), 
                                 compared_peak_id = CEBPB_record$ID, 
                                 motif_only_for_compared_peak = TRUE, 
                                 methylation_profile_in_narrow_region = TRUE)
#> TFregulomeR::commonPeaks() starting ... ...
#> You chose to profile the methylation levels in 200bp window around peak summits, 
#> if there is any peak loaded from TFregulome
#> Loading target peak list ... ...
#> ... You have 1 TFBS(s) requested to be loaded from TFregulomeR server
#> ... You chose to load TF peaks with motif only. Using 'motif_only_for_target_peak' tunes your options
#> ... loading TFBS(s) from TFregulomeR now
#> ... ... peak file loaded successfully for id 'MM1_HSA_K562_CEBPB'
#> ... Done loading TFBS(s) from TFregulome
#> ... You have 1 customised peak set(s)
#> Loading compared peak list ... ...
#> ... You have 16 TFBS(s) requested to be loaded from TFregulomeR server
#> ... You chose to load TF peaks with motif only. Using 'motif_only_for_compared_peak' tunes your options
#> ... loading TFBS(s) from TFregulomeR now
#> ... ... peak file loaded successfully for id 'GTRD-EXP000142_HSA_LS180_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP010975_HSA_Ishikawa_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP030173_HSA_LoVo_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP030702_HSA_blood-monocytes_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP034967_HSA_mesenchymal-stem-cells_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP036478_HSA_fetal-osteoblasts_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP040652_HSA_monocyte-derived-macrophages_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP040801_HSA_HL-60_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_A549_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_H1-hESC_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_HCT116_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_HeLa-S3_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_HepG2_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_IMR-90_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_K562_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_MCF-7_CEBPB'
#> ... Done loading TFBS(s) from TFregulome
#> Start analysing: MM1_HSA_K562_CEBPB... ...
#> Start analysing: HCT116_CEBPB... ...
#> Done analysing.

After getting the output of commonPeaks, you can use commonPeakResult to get 1) the summary, 2) the common peak regions, 3) DNA methylation levels in 200bp around common peak summits if the input TF source is MethMotif in TFregulomeR compendium and 4) the (Meth)Motif logo if TF input is from TFregulomeR warehouse or with TFregulomeR ID.

commonPeak_result <- commonPeakResult(commonPeaks = commonPeak_output,
                                      return_common_peak_sites = TRUE, 
                                      save_MethMotif_logo = TRUE, 
                                      return_methylation_profile = TRUE, 
                                      return_summary = TRUE)
#> Start getting the results of commonPeakResult ...
#> ... ... You chose to return common peak sites;
#> ... ... You chose to return methylation profile;
#> ... ... You chose to return common peak summary;
#> ... ... ... ALL of common peak sets, methylation profiles and peak summary will be 
#> stored in a list, and named with 'common_peak_list', 'methylation_profile' and 
#> 'peak_summary' in the list. Use 'names()' in the output for its detials.
#> ... ... You chose to save MethMotif logo in PDF if any;
#> ... ... ... You chose entropy logo;
#> ... ... ... You chose to show all methylation levels;
#> Success: a PDF named 'MM1_HSA_K562_CEBPB_common_peaks-logo-entropy.pdf' has been saved!
#> ... ... ... The input peak set for the results 'HCT116_CEBPB_common_peaks' was 
#> not originated from TFregulomeR or the number of direct binding sites in the 
#> common peaks is 0, so no motif logo available.

# the contents in commonPeak_result
names(commonPeak_result)
#> [1] "common_peak_list"    "methylation_profile" "peak_summary" 
common_peak_list <- commonPeak_result$common_peak_list
methylation_profile <- commonPeak_result$methylation_profile
peak_summary <- commonPeak_result$peak_summary

# peak summary: 1.137225% of K562 CEBPB peaks and 2.517540% of my peaks are common with 
# the CEBPB peaks in all cell types available in TFregulomeR warehouse.
peak_summary
#>                                 percentage_in_original_inputs(%)
#> MM1_HSA_K562_CEBPB_common_peaks                         1.137225
#> HCT116_CEBPB_common_peaks                               2.517540

# common peak regions
K562_CEBPB_common_peak <- common_peak_list$MM1_HSA_K562_CEBPB_common_peaks
head(K562_CEBPB_common_peak)
#>      chr     start       end                                      id tag_fold_change
#> 39  chr3 126261153 126261154  MM1_HSA_K562_CEBPB_peaks_with_motif_39        91.77499
#> 145 chr3 152100699 152100700 MM1_HSA_K562_CEBPB_peaks_with_motif_145        74.01529
#> 276 chr3 188239782 188239783 MM1_HSA_K562_CEBPB_peaks_with_motif_276        13.06116
#> 283 chr3 194003766 194003767 MM1_HSA_K562_CEBPB_peaks_with_motif_283        26.67740
#> 435 chr4  73704747  73704748 MM1_HSA_K562_CEBPB_peaks_with_motif_435        54.26637
#> 456 chr4  77158137  77158138 MM1_HSA_K562_CEBPB_peaks_with_motif_456        31.97526

# methylation profile in common peak regions
names(methylation_profile)
#> [1] "MM1_HSA_K562_CEBPB_common_peaks" "HCT116_CEBPB_common_peaks"
methylation_profile$MM1_HSA_K562_CEBPB_common_peaks
#>         CpG_num
#> 0-10%       161  #161 CpG methylation scores are less than 0.1 (homogeneously unmethylated) in +/-100bp window around common peak summits
#> 10-20%        9
#> 20-30%       17
#> 30-40%        2
#> 40-50%        1
#> 50-60%        2
#> 60-70%        9
#> 70-80%        5
#> 80-90%       13
#> 90-100%      25 #25 CpG methylation scores are more than 0.9 (homogeneously methylated) in +/-100bp window around common peak summits

# customised input peaks are not originated from TFregulomeR compendium, so no DNA methylation states
methylation_profile$HCT116_CEBPB_common_peaks
#>      [,1]
#> [1,]   NA

Figure 4. MethMotif logo of K562 CEBPB common peaks

Exclusive peak regions

Figure 5. functionality in TFregulomeR for exclusive peak analysis

Exclusive peak regions are important to study a TF’s context-specific function. Hence, we implemented such functionality to achieve the extraction of the context dependent peak loci along with DNA methylation profiles, exclusivePeaks.

For the target peak sets, users can directly use TFregulomeR TF peaks (hg38 for human) by inputting the TFregulomeR IDs in target_peak_id (all peaks or peaks with motif only can be selected using motif_only_for_target_peak), or their own peak regions in user_target_peak_list. If customised peak sets are provided, all peak sets should be stored in an R list(), and each peak set should be a bed-format data.frame with the first three columns as chromosome (starting with ‘chr’), start and end. It’s recommended that users provide the UNIQUE IDs for their customised peak list in user_target_peak_id (also should be unique to the provided TFregulomeR ID list in target_peak_id). If unavailable, the function will automatically assign IDs for the provided peak sets. It should be noted that if the customised target peak set is derived from TFregulomeR compendium, it’s highly recommended that its TFregulomeR ID should be provided correspondingly in user_target_peak_id if one opts to profile the DNA methylation levels . Even though TFregulomeR peak sets are peak summits, the function is able to recognise it with the provided TFregulomeR ID in user_target_peak_id and automatically expand +/- 100bp during the analysis.

For the excluded peak sets, same rules are applicable to options excluded_peak_id, motif_only_for_excluded_peak, user_excluded_peak_list and user_excluded_peak_id when loading excluded peak sets.

During the analysis, EACH of target peak set from TFregulomeR compendium using target_peak_id and/or user provided using user_target_peak_list will be compared with ALL input excluded peak sets from excluded_peak_id and user_excluded_peak_list, to get a final target peak sub-ensemble which is exclusive from all input excluded peak sets. If methylation_profile_in_narrow_region=TRUE, DNA methylation profiling in +/- 100bp surrounding peak summit will be performed for each target exclusive peak sub-ensemble, if its ID labeled in target_peak_id and user_target_peak_list matches a MethMotif ID record (“MM1_HSA_”) in TFregulomeR compendium.

# To get the exclusive subset of K562 CEBPB peaks and at the same time to profile the DNA 
# methylation states in the exclusive subset

# 1) Get all CEBPB records in TFregulomeR warehouse
CEBPB_record <- dataBrowser(tf = "CEBPB") # or TFBSBrowser() before v1.2.0
#> 16 record(s) founded: ...
#> ... covering 1 TF(s)
#> ... from 1 species:
#> ... ...human
#> ... from 9 organ(s):
#> ... ... colorectum, uterus, blood_and_lymph, stem_cell, bone, lung, cervix, liver, breast
#> ... in 2 sample type(s):
#> ... ... cell_line, primary_cells
#> ... in  16  different cell(s) or tissue(s)
#> ... in 2 type(s) of disease state(s):
#> ... ... tumor, normal
#> ... from the source(s): GTRD, MethMotif

# 2) All CEBPB TFregulomeR IDs except MM1_HSA_K562_CEBPB
CEBPB_record_ID_noK562 <- CEBPB_record$ID[!(CEBPB_record$ID %in% "MM1_HSA_K562_CEBPB")]
exclusivePeak_output <- exclusivePeaks(target_peak_id = "MM1_HSA_K562_CEBPB", 
                                       motif_only_for_target_peak = TRUE, 
                                       excluded_peak_id = CEBPB_record_ID_noK562, 
                                       motif_only_for_excluded_peak = TRUE, 
                                       methylation_profile_in_narrow_region = TRUE)
#> TFregulomeR::exclusivePeaks() starting ... ...
#> You chose to profile the methylation levels in 200bp window around peak summits, 
#> if there is any peak loaded from TFregulome
#> Loading target peak list ... ...
#> ... You have 1 TFBS(s) requested to be loaded from TFregulomeR server
#> ... You chose to load TF peaks with motif only. Using 'motif_only_for_target_peak' tunes your options
#> ... loading TFBS(s) from TFregulomeR now
#> ... ... peak file loaded successfully for id 'MM1_HSA_K562_CEBPB'
#> ... Done loading TFBS(s) from TFregulome
#> Loading excluded peak list ... ...
#> ... You have 15 TFBS(s) requested to be loaded from TFregulomeR server
#> ... You chose to load TF peaks with motif only. Using 'motif_only_for_excluded_peak' tunes your options
#> ... loading TFBS(s) from TFregulomeR now
#> ... ... peak file loaded successfully for id 'GTRD-EXP000142_HSA_LS180_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP010975_HSA_Ishikawa_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP030173_HSA_LoVo_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP030702_HSA_blood-monocytes_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP034967_HSA_mesenchymal-stem-cells_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP036478_HSA_fetal-osteoblasts_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP040652_HSA_monocyte-derived-macrophages_CEBPB'
#> ... ... peak file loaded successfully for id 'GTRD-EXP040801_HSA_HL-60_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_A549_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_H1-hESC_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_HCT116_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_HeLa-S3_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_HepG2_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_IMR-90_CEBPB'
#> ... ... peak file loaded successfully for id 'MM1_HSA_MCF-7_CEBPB'
#> ... Done loading TFBS(s) from TFregulome
#> Start analysing: MM1_HSA_K562_CEBPB... ...
#> Done analysing.

After getting the output of exclusivePeaks, you can use exclusivePeakResult to get 1) the summary, 2) the exclusive peak regions, 3) DNA methylation levels in exclusive peak subsets if the input source is MethMotif in TFregulomeR compendium and 4) the (Meth)Motif logo if the input target is from TFregulomeR compendium or comes with a TFregulomeR ID.

exclusivePeak_result <- exclusivePeakResult(exclusivePeaks = exclusivePeak_output,
                                            return_exclusive_peak_sites = TRUE,
                                            save_MethMotif_logo = TRUE, 
                                            return_methylation_profile = TRUE,
                                            return_summary = TRUE)
#> Start getting the results of exclusivePeaks ...
#> ... ... You chose to return exclusive peak sites;
#> ... ... You chose to return methylation profile;
#> ... ... You chose to return exclusive peak summary;
#> ... ... ... ALL of exclusive peak sets, methylation profiles and peak summary will 
#> be stored in a list, and named with 'exclusive_peak_list', 'methylation_profile' and 
#> 'peak_summary' in the list. Use 'names()' in the output for its detials.
#> ... ... You chose to save MethMotif logo in PDF if any;
#> ... ... ... You chose entropy logo;
#> ... ... ... You chose to show all methylation levels;
#> Success: a PDF named 'MM1_HSA_K562_CEBPB_exclusive_peaks-logo-entropy.pdf' has been saved!

# the contents in exclusivePeak_result
names(exclusivePeak_result)
#> [1] "exclusive_peak_list" "methylation_profile" "peak_summary" 
exclusive_peak_list <- exclusivePeak_result$exclusive_peak_list
peak_summary <- exclusivePeak_result$peak_summary
methylation_profile <- exclusivePeak_result$methylation_profile

# peak summary, 9.110437% of K562 CEBPB peaks are unique compared with 
# all CEBPB TFBSs in TFregulomeR warehouse
peak_summary
#>                                    percentage_in_original_inputs(%)
#> MM1_HSA_K562_CEBPB_exclusive_peaks                         9.110437

K562_CEBPB_exclusive_peak <- exclusive_peak_list$MM1_HSA_K562_CEBPB_exclusive_peaks
head(K562_CEBPB_exclusive_peak)
#>     chr     start       end                                     id tag_fold_change
#> 4  chr3 105626970 105626971  MM1_HSA_K562_CEBPB_peaks_with_motif_4        23.36092
#> 22 chr3 119695686 119695687 MM1_HSA_K562_CEBPB_peaks_with_motif_22        10.89517
#> 45 chr3 128415444 128415445 MM1_HSA_K562_CEBPB_peaks_with_motif_45        40.57037
#> 57 chr3 130184388 130184389 MM1_HSA_K562_CEBPB_peaks_with_motif_57        23.26712
#> 58 chr3 130343212 130343213 MM1_HSA_K562_CEBPB_peaks_with_motif_58        13.14518
#> 68 chr3 133629332 133629333 MM1_HSA_K562_CEBPB_peaks_with_motif_68        20.32872

# methylation profile in exclusive peak regions
names(methylation_profile)
#> [1] "MM1_HSA_K562_CEBPB_exclusive_peaks"
methylation_profile$MM1_HSA_K562_CEBPB_exclusive_peaks
#>         CpG_num
#> 0-10%       852  #852 CpG methylation scores are less than 0.1 (homogeneously unmethylated) in +/-100bp window around exclusive peak summits
#> 10-20%      149
#> 20-30%       86
#> 30-40%       63
#> 40-50%       45
#> 50-60%       39
#> 60-70%       25
#> 70-80%       46
#> 80-90%       29
#> 90-100%      40 #40 CpG methylation scores are more than 0.9 ((homogeneously methylated)) in +/-100bp window around exclusive peak summits

Figure 6. MethMotif logo of K562 CEBPB exclusive peaks

Intersected peak matrix

Figure 7. functionality in TFregulomeR for cofactor and interactome analysis

TFregulomeR allows users to portray the co-binding landscapes between two collections of TFs, along with DNA methylation states in the pair-wise intersected peaks, using intersectPeakMatrix. This functionality is particularly useful to study TF cofactor and interactome (see the detail regarding TF interactome at here) in a cell type. Different from commonPeaks, intersectPeakMatrix perform an exhaustive pair-wise intersection analysis between peak list x and peak list y, to form an x*y intersection matrix. Therefore, it is required for users to provide the two lists of peak sets.

For peak list x, users can directly use TFregulomeR peaks by providing TFregulomeR ID in peak_id_x and indicating whether loading peaks with motif only using motif_only_for_id_x. In addition, customised peak sets can also be input in user_peak_list_x. It’s recommended that UNIQUE IDs (also unique to IDs in peak_id_x) be provided for each customised peak set in user_peak_x_id. If the customised peak set is derived from TFregulomeR compendium, it’s highly recommended that the corresponding TFregulomeR ID be provided in user_peak_x_id, which allows the function to recognise the source of peak set and to properly profile the DNA methylation states (if methylation_profile_in_narrow_region=TRUE) as well as read enrichments in the intersected regions. Even though TFregulomeR peak sets are peak summits, the function is able to recognise it with the provided TFregulomeR ID in peak_id_x and automatically expand +/- 100bp during the analysis.

Same principles are applicable for peak list y.

In addidtion to profile DNA methylation landscapes across the peak regions of different TF-pair combinations (TFx-TFy), from TFregulomeR v1.2.2 onwards, users are allowed to input their own external signals in bed format (at least four column: chromosome, start, end, and score) and assess external signal scores across the peak regions of different TF-pair combinations (TFx-TFy) at the same time. For example, if DNase-seq bedgraph is provided (the fourth column denotes DNase-seq read intensity), DNase-seq read intensity values will be profiled during TF intersection analysis. Just simply inputting this data via external_source in intersectPeakMatrix, users could achieve this analysis. Of note, it is highly recommended that users could refine the external signal file into the certain potential chromosome regions of their interest to speed up the analysis, because the file at whole-genome level could substantially slow down analysis (see the example at TF interactome demo).

Advice: It’s to be noted that function intersectPeakMatrix could take up several minutes to hours depending on the number of input TFs. Save the intersectPeakMatrix output into R data format using saveRDS(object, file="my_data.rds") and restore it using readRDS(file="my_data.rds") if you need to re-use the output to extract other results with function intersectPeakMatrixResult.

# profile the co-binding landscapes of all K562 TFs in TFregulomeR warehouse surrounding
# K562 CEBPB common and exclusive peaks

# browse all TFBS record in K562
K562_TFBS = dataBrowser(cell_tissue_name = "K562") # or TFBSBrowser() before v1.2.0
#> 131 record(s) found: ...
#> ... covering 131 TF(s)
#> ... from 1 species:
#> ... ...human
#> ... from 1 organ(s):
#> ... ... blood_and_lymph
#> ... in 1 sample type(s):
#> ... ... cell_line
#> ... in  1  different cell(s) or tissue(s)
#> ... in 1 type(s) of disease state(s):
#> ... ... tumor
#> ... from the source(s): MethMotif

# co-binding landscapes in K562 CEBPB common peaks
intersectMatrix_common <- intersectPeakMatrix(user_peak_list_x = list(K562_CEBPB_common_peak),
                                              user_peak_x_id =  c("MM1_HSA_K562_CEBPB"), 
                                              peak_id_y = K562_TFBS$ID, 
                                              motif_only_for_id_y = TRUE, 
                                              methylation_profile_in_narrow_region = TRUE)
#> TFregulomeR::intersectPeakMatrix() starting ... ...
#> You chose to profile the methylation levels in 200bp window around peak summits, 
#> if there is any peak loaded from TFregulome. It will make the program slow. 
#> Disable it if you want a speedy analysis and do not care about methylation
#> Loading peak list x ... ...
#> ... You have 1 customised peak set(s)
#> Loading peak list y ... ...
#> ... You have 131 TFBS(s) requested to be loaded from TFregulomeR server
#> ... You chose to load TF peaks with motif only. Using 'motif_only_for_id_y' tunes your options
#> ... loading TFBS(s) from TFregulomeR now
#> ... ... peak file loaded successfully for id 'MM1_HSA_K562_AFF1'
#> ... ... peak file loaded successfully for id 'MM1_HSA_K562_ARID2'
#> ... ... peak file loaded successfully for id 'MM1_HSA_K562_ARID3A'
#> ... ... peak file loaded successfully for id 'MM1_HSA_K562_ATF1'
#>     ... ...
#> ... ... peak file loaded successfully for id 'MM1_HSA_K562_ZSCAN29'
#> ... Done loading TFBS(s) from TFregulome
#> Start analysing list x:MM1_HSA_K562_CEBPB... ...
#> ... ... Start analysing list y:MM1_HSA_K562_AFF1
#> ... ... Start analysing list y:MM1_HSA_K562_ARID2
#> ... ... Start analysing list y:MM1_HSA_K562_ARID3A
#> ... ... Start analysing list y:MM1_HSA_K562_ATF1
#>     ... ...
#> ... ... Start analysing list y:MM1_HSA_K562_ZSCAN29

# K562 CEBPB exclusive peaks
intersectMatrix_exclusive <- intersectPeakMatrix(user_peak_list_x = list(K562_CEBPB_exclusive_peak), 
                                                 user_peak_x_id = c("MM1_HSA_K562_CEBPB"), 
                                                 peak_id_y = K562_TFBS$ID,
                                                 motif_only_for_id_y = TRUE,
                                                 methylation_profile_in_narrow_region = TRUE)
#> TFregulomeR::intersectPeakMatrix() starting ... ...
#> You chose to profile the methylation levels in 200bp window around peak summits, 
#> if there is any peak loaded from TFregulome. It will make the program slow. 
#> Disable it if you want a speedy analysis and do not care about methylation
#> Loading peak list x ... ...
#> ... You have 1 customised peak set(s)
#> Loading peak list y ... ...
#> ... You have 108 TFBS(s) requested to be loaded from TFregulomeR server
#> ... You chose to load TF peaks with motif only. Using 'motif_only_for_id_y' tunes your options
#> ... loading TFBS(s) from TFregulomeR now
#> .. ... peak file loaded successfully for id 'MM1_HSA_K562_AFF1'
#> .. ... peak file loaded successfully for id 'MM1_HSA_K562_ARID2'
#> .. ... peak file loaded successfully for id 'MM1_HSA_K562_ARID3A'
#> .. ... peak file loaded successfully for id 'MM1_HSA_K562_ATF1'
#>     ... ...
#> .. ... peak file loaded successfully for id 'MM1_HSA_K562_ZSCAN29'
#> ... Done loading TFBS(s) from TFregulome
#> Start analysing list x:MM1_HSA_K562_CEBPB... ...
#> ... ... Start analysing list y:MM1_HSA_K562_AFF1
#> ... ... Start analysing list y:MM1_HSA_K562_ARID2
#> ... ... Start analysing list y:MM1_HSA_K562_ARID3A
#> ... ... Start analysing list y:MM1_HSA_K562_ATF1
#>     ... ...
#> ... ... Start analysing list y:MM1_HSA_K562_ZSCAN29

We have implemented intersectPeakMatrixResult in TFregulomeR package, allowing an easy extraction and interpretation of intersectPeakMatrix output. It is worth noting that there are two ways of interpretations in intersection between set A and B, that is, 1) the percentage of A overlapped with B, and 2) the percentage of B intersected with A. Same principle is applicable for the output of intersectPeakMatrix.

The output of intersectPeakMatrix is a X*Y matrix table (X peak sets in peak list x and Y peak sets in peak list y) and each table cell contains an intersectPeakMatrix class object. IntersectPeakMatrix class is an exclusively-designed S4 class and each object contains:

Information of peak x subset overlapped with peak y:

overlap percentage in peak x;
motif in peak x overlapped with peak y;
tag enrichment in peak x overlapped with peak y;
DNA methylation in peak x overlapped with peak y, if peak x is derived from MethMotif in TFregulomeR compendium.
external signal score in peak x overlapped with peak y, if external signal file is provided.

and

Information of peak y subset overlapped with peak x:

overlap percentage in peak y;
motif in peak y overlapped with peak x;
tag enrichment in peak y overlapped with peak x;
DNA methylation in peak y overlapped with peak x, if peak y is derived from MethMotif in TFregulomeR compendium.
external signal score in peak y overlapped with peak x, if external signal file is provided.

By using return_intersection_matrix = TRUE and angle_of_matrix = "x" in function
intersectPeakMatrixResult, user can obtain an intersection matrix table. Row i and column j table cell denotes the percent of peak x(i) overlapped with peak y(j). If angle_of_matrix = "y", then row i and column j table cell denotes the percent of peak y(j) overlapped with peak x(i).

By using return_methylation_profile = TRUE and angle_of_methylation_profile = "x" in function
intersectPeakMatrixResult, user can obtain a DNA methylation matrix table. Row i and column j table cell contains a vector about the statistics of CpG methylation states within the peak x(i) overlapped with peak y(j). If angle_of_matrix = "y", then row i and column j table cell contains a vector about the statistics of CpG methylation states within the peak y(j) overlapped with peak x(i).

By using save_MethMotif_logo = TRUE and angle_of_logo = "x" in function
intersectPeakMatrixResult, the function will plot and save (Meth)Motif logo for each peak x intersected with peak y (if any of peak x is derived from TFregulomeR compendium). If angle_of_logo = "y", the function will plot and save (Meth)Motif logo for each peak y intersected with peak x (if any of peak y is derived from TFregulomeR compendium). Indeed, it’s not necessary to plot all (Meth)Motif logos for every pair of intersection at the same time. One can focus only on some subsets of peak_list_x and peak_list_y, using saving_MethMotif_logo_x_id and saving_MethMotif_logo_y_id respectively.

By using return_tag_density = TRUE and angle_of_tag_density = "x" in function
intersectPeakMatrixResult, user can obtain a read enrichment matrix table. Row i and column j table cell denotes a read enrichment value within the peak x(i) overlapped with peak y(j). If angle_of_tag_density = "y", then row i and column j table cell denotes a read enrichment value within the peak y(j) overlapped with peak y(x).There are five read enrichment values to be selected using tag_density_value: median, mean, SD (standard deviation), quartile_25 (first quartile) and quartile_75 (third quartile).

Lastly, by using return_external_source = TRUE and angle_of_external_source = "x" in function
intersectPeakMatrixResult, user can obtain an external signal value matrix table. Row i and column j table cell denotes an external signal value within the peak x(i) overlapped with peak y(j). If angle_of_external_source = "y", then row i and column j table cell denotes an external signal value within the peak y(j) overlapped with peak y(x).There are five external signal values to be selected using external_source_value: median, mean, SD (standard deviation), quartile_25 (first quartile) and quartile_75 (third quartile).

In order to make cofactor analysis more straightforward, a function cofactorReport has been exclusively designed. Just by simply inputting the output of intersectPeakMatrix into the function, cofactor report will be saved as a PDF file for each peak x. In the report, top cofactors of peak x will be reported along with motif sequences, DNA methylation within motif and 200bp peak regions as well as read enrichment scores (median, first quartile and third quartile).

# get the intersection matrix for K562 common peaks
IM_K562_common_result <- intersectPeakMatrixResult(intersectPeakMatrix = intersectMatrix_common, 
                                                   return_intersection_matrix = TRUE, 
                                                   angle_of_matrix = "x", 
                                                   return_methylation_profile = TRUE, 
                                                   angle_of_methylation_profile = "x",
                                                   return_tag_density = TRUE,
                                                   angle_of_tag_density = "x",
                                                   tag_density_value = "median")
#> Start getting the results of intersectPeakMatrix ...
#> ... ... You chose to return intersection matrix;
#> ... ... ... You chose x-wise intersection matrix;
#> ... ... You chose to return tag density;
#> ... ... ... the tag density value you chose to return is median
#> ... ... ... You chose to return tag density for peak list x;
#> ... ... You chose to return methylation profile;
#> ... ... ... You chose to return methylation profile for peak list x;
#> ... ... You chose NOT to save MethMotif logo in PDF if any;

names(IM_K562_common_result)
#> [1] "intersection_matrix"        "tag_density_matrix"         "methylation_profile_matrix"

# return intersection matrix table for K562 CEBPB shared peaks
IM_K562_common_intersect_matrix <- IM_K562_common_result$intersection_matrix
dim(IM_K562_common_intersect_matrix)
#> [1]   1 131 # 1 row represents the 1 input peak x, 131 columns represent the 131 peak y

# e.g. 1.111111%, 3.333333% and  1.111111% of K562 common peaks overlapped with MM1_HSA_K562_AFF1, MM1_HSA_K562_ARID2 and MM1_HSA_K562_ARID3A respectively.
IM_K562_common_intersect_matrix[1,1:3]
#>                    MM1_HSA_K562_AFF1 MM1_HSA_K562_ARID2 MM1_HSA_K562_ARID3A
#> MM1_HSA_K562_CEBPB          1.111111           3.333333            1.111111

# find the top 10 TFs co-binding in K562 CEBPB shared peaks
IM_K562_common_result_t <- as.data.frame(t(IM_K562_common_intersect_matrix))
attach(IM_K562_common_result_t)
IM_K562_common_result_order <- as.data.frame(IM_K562_common_result_t[order(-MM1_HSA_K562_CEBPB),,drop = FALSE])
detach(IM_K562_common_result_t)
head(IM_K562_common_result_order, n = 10)
#>                    MM1_HSA_K562_CEBPB
#> MM1_HSA_K562_CEBPB         100.000000
#> MM1_HSA_K562_CEBPD          36.666667
#> MM1_HSA_K562_CTCF           12.222222
#> MM1_HSA_K562_E4F1           10.000000
#> MM1_HSA_K562_JUND           10.000000
#> MM1_HSA_K562_FOS             8.888889
#> MM1_HSA_K562_JUN             8.888889
#> MM1_HSA_K562_CTCFL           7.777778
#> MM1_HSA_K562_ELF4            7.777778
#> MM1_HSA_K562_ATF4            6.666667

# The highest co-binding factor in K562 CEBPB common peaks is CEBPD.
# Then plot MethMotif logo for the K562 CEBPB common sites intersected with CEBPD peaks
intersectPeakMatrixResult(intersectPeakMatrix = intersectMatrix_common, 
                          save_MethMotif_logo = TRUE, 
                          angle_of_logo = "x", 
                          saving_MethMotif_logo_y_id = c("MM1_HSA_K562_CEBPD"))
#> Start getting the results of intersectPeakMatrix ...
#> ... ... You chose NOT to return intersection matrix;
#> ... ... You chose NOT to return methylation profile;
#> ... ... You chose to save MethMotif logo in PDF if any;
#> ... ... ... You chose x-wise MethMotif logo;
#> ... ... ... You chose entropy logo;
#> ... ... ... You chose to show all methylation levels;
#> Success: a PDF named 'MM1_HSA_K562_CEBPB_overlapped_with_MM1_HSA_K562_CEBPD-logo-entropy.pdf' 
#> has been saved!

# return DNA methylation matrix table for K562 CEBPB shared peaks
IM_K562_common_methylation <- IM_K562_common_result$methylation_profile_matrix
dim(IM_K562_common_methylation)
#> [1]   1 131 # 1 row represents the 1 input peak x, 131 columns represent the 131 peak y
rownames(IM_K562_common_methylation)
#> [1] "MM1_HSA_K562_CEBPB"
colnames(IM_K562_common_methylation)[1:5]
#> [1] "MM1_HSA_K562_AFF1"   "MM1_HSA_K562_ARID2"  "MM1_HSA_K562_ARID3A"
#> [4] "MM1_HSA_K562_ATF1"   "MM1_HSA_K562_ATF2"  

# the methylation level in the intersected regions between CEBPB and CEBPBD in K562
# each cell in the matrix is a list()
IM_K562_common_methylation["MM1_HSA_K562_CEBPB","MM1_HSA_K562_CEBPD"][[1]]
#>         CpG_num
#> 0-10%        64 #64 CpG methylation scores are less than 0.1 (homogeneously unmethylated) in +/-100bp window around intersected peak summits
#> 10-20%        8
#> 20-30%        7
#> 30-40%        2
#> 40-50%        1
#> 50-60%        1
#> 60-70%        8
#> 70-80%        3
#> 80-90%       10
#> 90-100%      16 #16 CpG methylation scores are more than 0.9 (homogeneously methylated) in +/-100bp window around intersected peak summits


# return read enrichment matrix table for K562 CEBPB shared peaks
IM_K562_common_read_enrichment <- IM_K562_common_result$tag_density_matrix
IM_K562_common_read_enrichment[,seq(1,3,1)]
#>                    MM1_HSA_K562_AFF1 MM1_HSA_K562_ARID2 MM1_HSA_K562_ARID3A
#> MM1_HSA_K562_CEBPB          54.27878           34.56091            17.68113
# the read ernichment median value in the K562 CEBPB peaks overlapped with AFF1 is 54.27878

# Simply using cofactorReport(), cofactors for each peak x will be automatically reported
cofactorReport(intersectPeakMatrix = intersectMatrix_common)
#> Start cofactorReport ...
#> ... The maximum number of cofactors to be reported is 10
#> ... The minimum percent of co-binding peaks for a cofactor is 5%
#> ... Each peak set derived from TFregulomeR compendium in 'peak list x' will be reported in an individual PDF file
#> ... ... Start reporting peak id 'MM1_HSA_K562_CEBPB' ...
#> ... ... ... The number of cofactors passing 'cobinding_threshold' for peak id 'MM1_HSA_K562_CEBPB' is 15. Only top 10 will be selected.
#> ... ... ... Cofactor report for id 'MM1_HSA_K562_CEBPB' has been saved as MM1_HSA_K562_CEBPB_cofactor_report.pdf

# get the intersection matrix for K562 exclusive peaks
IM_K562_exclusive_result <- intersectPeakMatrixResult(intersectPeakMatrix = intersectMatrix_exclusive, 
                                                      return_intersection_matrix = TRUE, 
                                                      angle_of_matrix = "x", 
                                                      return_methylation_profile = TRUE, 
                                                      angle_of_methylation_profile = "x",
                                                      return_tag_density = TRUE,
                                                      angle_of_tag_density = "x",
                                                      tag_density_value = "median")
#> Start getting the results of intersectPeakMatrix ...
#> ... ... You chose to return intersection matrix;
#> ... ... ... You chose x-wise intersection matrix;
#> ... ... You chose to return tag density;
#> ... ... ... the tag density value you chose to return is median
#> ... ... ... You chose to return tag density for peak list x;
#> ... ... You chose to return methylation profile;
#> ... ... ... You chose to return methylation profile for peak list x;
#> ... ... You chose NOT to save MethMotif logo in PDF if any;

names(IM_K562_exclusive_result)
#> [1] "intersection_matrix"        "tag_density_matrix"         "methylation_profile_matrix"

# return intersection matrix table for K562 CEBPB exclusive peaks
IM_K562_exclusive_intersect_matrix <- IM_K562_exclusive_result$intersection_matrix
# e.g. 0.4160888%, 0% and  0.554785% of K562 exclusive peaks overlapped with MM1_HSA_K562_AFF1, MM1_HSA_K562_ARID2 and MM1_HSA_K562_ARID3A respectively.
IM_K562_exclusive_intersect_matrix[1,1:3]
#>                    MM1_HSA_K562_AFF1 MM1_HSA_K562_ARID2 MM1_HSA_K562_ARID3A
#> MM1_HSA_K562_CEBPB         0.4160888                  0            0.554785

# find the top 10 TFs co-binding in K562 exclusive results
IM_K562_exclusive_result_t <- as.data.frame(t(IM_K562_exclusive_intersect_matrix))
attach(IM_K562_exclusive_result_t)
IM_K562_exclusive_result_order <- as.data.frame(IM_K562_exclusive_result_t[order(-MM1_HSA_K562_CEBPB),,drop = FALSE])
detach(IM_K562_exclusive_result_t)
head(IM_K562_exclusive_result_order, n = 10)
#>                      MM1_HSA_K562_CEBPB
#> MM1_HSA_K562_CEBPB           100.000000
#> MM1_HSA_K562_ATF4             35.367545
#> MM1_HSA_K562_TCF12            17.337032
#> MM1_HSA_K562_CBFA2T3          16.643551
#> MM1_HSA_K562_TAL1             16.366158
#> MM1_HSA_K562_GATA1            11.650485
#> MM1_HSA_K562_FOXM1             9.431345
#> MM1_HSA_K562_ATF7              8.876560
#> MM1_HSA_K562_NFE2              8.599168
#> MM1_HSA_K562_MAFG              7.073509

# The highest co-binding factor in K562 CEBPB exclusive peaks is ATF4.
# Then plot MethMotif logo for the K562 CEBPB exclusive sites intersected with ATF4 peaks
intersectPeakMatrixResult(intersectPeakMatrix = intersectMatrix_exclusive, 
                          save_MethMotif_logo = TRUE, 
                          angle_of_logo = "x", 
                          saving_MethMotif_logo_y_id = c("MM1_HSA_K562_ATF4"))
#> Start getting the results of intersectPeakMatrix ...
#> ... ... You chose NOT to return intersection matrix;
#> ... ... You chose NOT to return methylation profile;
#> ... ... You chose to save MethMotif logo in PDF if any;
#> ... ... ... You chose x-wise MethMotif logo;
#> ... ... ... You chose entropy logo;
#> ... ... ... You chose to show all methylation levels;
#> Success: a PDF named 'MM1_HSA_K562_CEBPB_overlapped_with_MM1_HSA_K562_ATF4-logo-entropy.pdf' 
#> has been saved!

# the methylation levels in the intersected regions between CEBPB and ATF4
IM_K562_exclusive_methylation <- IM_K562_exclusive_result$methylation_profile_matrix
IM_K562_exclusive_methylation["MM1_HSA_K562_CEBPB","MM1_HSA_K562_ATF4"][[1]]
#>         CpG_num
#> 0-10%       303
#> 10-20%       44
#> 20-30%       22
#> 30-40%       19
#> 40-50%       12
#> 50-60%       11
#> 60-70%        8
#> 70-80%       14
#> 80-90%       12
#> 90-100%       8

# Simply using cofactorReport(), cofactors for each peak x will be automatically reported
cofactorReport(intersectPeakMatrix = intersectMatrix_exclusive)
#> Start cofactorReport ...
#> ... The maximum number of cofactors to be reported is 10
#> ... The minimum percent of co-binding peaks for a cofactor is 5%
#> ... Each peak set derived from TFregulomeR compendium in 'peak list x' will be reported in an individual PDF file
#> ... ... Start reporting peak id 'MM1_HSA_K562_CEBPB' ...
#> ... ... ... The number of cofactors passing 'cobinding_threshold' for peak id 'MM1_HSA_K562_CEBPB' is 13. Only top 10 will be selected.
#> ... ... ... Cofactor report for id 'MM1_HSA_K562_CEBPB' has been saved as MM1_HSA_K562_CEBPB_cofactor_report.pdf

Figure 8. MethMotif logo of K562 CEBPB common peaks intersected with K562 CEBPD peaks

Figure 9. MethMotif logo of K562 CEBPB exclusive peaks intersected with K562 ATF4 peaks

Figure 10. cofactorReport output

Intersected peak matrix extended - TF interactome demo

TF interactome is a global view of TF cooperativity in a cell type. The function intersectPeakMatrix implemented in TFregulomeR is able to do TF interactome analysis using TFregulomeR data compendium. By just simply setting TF x and y list in the input of intersectPeakMatrix as the same list of all TFs in a cell type, pair-wise intersection profiles of every two TFs in the cell type will be charted. If the analysed cell type in TFregulomeR data compendium is originated from MethMotif database, users can even profile DNA methylation landscapes across the TF interactome by choosing methylation_profile_in_narrow_region = TRUE. This strategy is important to identify mCpG-binding TF complexes in a cell type. Moreover, by taking advantage of parameter external_source in intersectPeakMatrix, users are allowed to chart the signal of their own interest across TF interactome. For example, if DNase-seq bedgraph is provided (fourth column is DNase-seq read intensity), chromatin accessibility could be profiled across TF interactome.

In order to facilitate TF interactome analysis and result interpretation, we have built a new function interactome3D in TFregulomeR (>= 1.2.2). The input of this function is the direct output of intersectPeakMatrix, and the output of the function is a dynamic 3D html report showing TF interactome coupled with CpG methylation and/or external signals (if return_interactome_with_mCpG = TRUE and/or return_interactome_with_external_source = TRUE in the function). The analysis processes of this function are: 1) obtain three matrix tables from intersectPeakMatrix output, namely TF intersection, CpG methylation and external signal value matrix tables; 2) both row and column of TF intersection matrix table will undergo hierarchical clustering with euclidean distance to classify those TFs that tend to co-localise with one another; 3) CpG methylation and external signal value matrix tables will be row- and column-wise reordered based on the result of clustered TF intersection matrix table; and 4) TF intersection matrix will be integrated with CpG methylation and external signal value matrix tables separately to form two dynamic 3D graphs.

In the dynamic 3D graph, x and y axis show TFs in a cell type, the color shade denotes the percentage of TFx peaks overlapped with TFy, and z axis show the percentage of mCpGs (the beta score threshold can be modified via mCpG_threshold in interactome3D) or external signal value (modified via external_source_value in interactome3D). Particularly, if z axis value is -1, it means: 1) the overlapped peak regions between TFx and TFy is zero; or 2) CpG methylation information or external signal value is not available (methylation_profile_in_narrow_region = FALSE or analysed cell type not from MethMotif or external signal not provided during intersect matrix analysis).

In the following small demo, we profile TF interactome among CEBPA, CEBPB, CEBPD, and CEBPG coupled with CpG methylation and chromatin accessibility in HepG2.

# we generate DNase-seq bedgraph in HepG2 (fourth column is normalised read intensity). In order to speed up analysis, we didnt use DNase-seq bedgraph at whole genome levels; instead, we only selected those regions overlapped by all TF cistrome in HepG2.
DNase <- read.table("HepG2_DNase-seq-UW_in_HepG2_cistrome.txt")

HepG2_CEBP_id <- c("MM1_HSA_HepG2_CEBPA","MM1_HSA_HepG2_CEBPB",
                   "MM1_HSA_HepG2_CEBPD","MM1_HSA_HepG2_CEBPG")

HepG2_CEBP_interactome <- intersectPeakMatrix(peak_id_x = HepG2_CEBP_id,
                                              motif_only_for_id_x = TRUE,
                                              peak_id_y = HepG2_CEBP_id,
                                              motif_only_for_id_y = TRUE, 
                                              methylation_profile_in_narrow_region = TRUE,
                                              external_source = DNase)
# Simply use interactome3D to generate TF interactome report
interactome3D(intersectPeakMatrix = HepG2_CEBP_interactome,
              return_interactome_with_mCpG = TRUE, 
              return_interactome_with_external_source = TRUE)
#> Start interactome3D ...
#> ... You chose to report TF interactome coupled with DNA methylation ...
#> ... ... The mCpG threshold you chose is 0.8
#> ... You chose to report TF interactome coupled with external source signal ...
#> ... ... The external source signal value you chose is median
#> ... report of TF interactome with mCpG portion has been saved as 'TF_interactome_with_mCpG.html'
#> ... report of TF interactome with external source signal has been saved as 'TF_interactome_with_external_source.html'

Export motif PFM and beta score matrix

As aforementioned, exportMMPFM is not only designed for searchMotif outputs, but also compatible with the outputs from commonPeaks, exclusivePeaks, intersectPeakMatrix. Here, we want to export the motif PFMs and beta score matrices for K562 CEBPB common and exclusive peaks, as well as the common peaks intersected with CEBPD peaks and the exclusive peaks intersected with ATF4 peaks.

# export motif PFM and beta score matrix for K562 CEBPB common peaks
exportMMPFM(fun_output = commonPeak_output, 
            fun = "commonPeaks", 
            save_motif_PFM = TRUE, 
            save_betaScore_matrix = TRUE)
#> Start exporting ... ...
#> ... ... You chose to save motif PFM and beta score matrix.
#> ... ... export commonPeaks...
#> ... ... ... export id = MM1_HSA_K562_CEBPB_common_peaks
#> ... ... ... ... Beta score matrix has been saved as 'MM1_HSA_K562_CEBPB_common_peaks-methScore.txt'.
#> ... ... ... ... Motif PFM has been saved as 'MM1_HSA_K562_CEBPB_common_peaks-motif-MEME.txt'.
#> ... ... ... export id = HCT116_CEBPB_common_peaks
#> ... ... ... the original peaks of HCT116_CEBPB_common_peaks is not loaded from TFregulomeR, 
#> or in the common peak the number of TFBS is zero. Hence no further action for this id!

# export motif PFM and beta score matrix for K562 CEBPB exclusive peaks
exportMMPFM(fun_output = exclusivePeak_output, 
            fun = "exclusivePeaks", 
            save_motif_PFM = TRUE, 
            save_betaScore_matrix = TRUE)
#> Start exporting ... ...
#> ... ... You chose to save motif PFM and beta score matrix.
#> ... ... export exclusivePeaks...
#> ... ... ... export id = MM1_HSA_K562_CEBPB_exclusive_peaks
#> ... ... ... ... Beta score matrix has been saved as 'MM1_HSA_K562_CEBPB_exclusive_peaks-methScore.txt'.
#> ... ... ... ... Motif PFM has been saved as 'MM1_HSA_K562_CEBPB_exclusive_peaks-motif-MEME.txt'.

# export motif PFM and beta score matrix for K562 CEBPB common peaks intersected with K562 CEBPD peaks
exportMMPFM(fun_output = intersectMatrix_common, 
            fun = "intersectPeakMatrix", 
            save_motif_PFM = TRUE, 
            save_betaScore_matrix = TRUE, 
            angle_of_matrix_for_intersectPeakMatrix = "x", 
            saving_id_y_for_intersectPeakMatrix = c("MM1_HSA_K562_CEBPD"))
#> Start exporting ... ...
#> ... ... You chose to save motif PFM and beta score matrix.
#> ... ... export intersectPeakMatrix...
#> ... ... we will export in the x wide of intersectPeakMatrix output since the input angle_of_matrix_for_intersectPeakMatrix = 'x'
#> ... ... ... export id = MM1_HSA_K562_CEBPB
#> ... ... ... ... Beta score matrix has been saved as 'MM1_HSA_K562_CEBPB_overlapped_with_MM1_HSA_K562_CEBPD-methScore.txt'.
#> ... ... ... ... Motif PFM has been saved as 'MM1_HSA_K562_CEBPB_overlapped_with_MM1_HSA_K562_CEBPD-motif-MEME.txt'.

# export motif PFM and beta score matrix for K562 CEBPB exclusive peaks intersected with K562 ATF4 peaks
exportMMPFM(fun_output = intersectMatrix_exclusive, 
            fun = "intersectPeakMatrix", 
            save_motif_PFM = TRUE, 
            save_betaScore_matrix = TRUE, 
            angle_of_matrix_for_intersectPeakMatrix = "x", 
            saving_id_y_for_intersectPeakMatrix = c("MM1_HSA_K562_ATF4"))
#> Start exporting ... ...
#> ... ... You chose to save motif PFM and beta score matrix.
#> ... ... export intersectPeakMatrix...
#> ... ... we will export in the x wide of intersectPeakMatrix output since the input angle_of_matrix_for_intersectPeakMatrix = 'x'
#> ... ... ... export id = MM1_HSA_K562_CEBPB
#> ... ... ... ... Beta score matrix has been saved as 'MM1_HSA_K562_CEBPB_overlapped_with_MM1_HSA_K562_ATF4-methScore.txt'.
#> ... ... ... ... Motif PFM has been saved as 'MM1_HSA_K562_CEBPB_overlapped_with_MM1_HSA_K562_ATF4-motif-MEME.txt'.

Motif distribution

TFregulomeR also allows users to plot the distributions of a given TFBS from the TFregulomeR warehouse in a list of peak sets, using motifDistrib and plotDistrib sequentially. By providing the TFregulomeR ID in id as the input of the motifDistrib, motifDistrib will calculate the occurrences the TFBSs in the given list of peak sets input in peak_list. The unique IDs corresponding to peak_list is required to be provided at the same time using peak_id. If a peak set is derived from TFregulomeR compendium, the TFregulomeR ID should be provided correspondingly; if it is self provided, you can name it with a unique ID yourself.

It should be noted that even though the loaded peak regions from the TFregulomeR compendium are the peak summits, you don’t need to expand the regions. Once you tell motifDistrib the peak set is a TFregulomeR TF subset by providing TFregulomeR ID in the peak_id, it will automatically operate on the peaks itself.

The output of motifDistrib is the input of plotDistrib. In each motif distribution plot, x-axis is the relative distance (bp) to the peak center, while y-axis is the percentage of the TFBS at the position.

Here, we show the distributions of K562 CEBPB motif in K562 CEBPB exclusive peaks and our own peaks locally loaded previously.

# loading my peaks
my_peak_path <- system.file("extdata", "HCT116_CEBPb_binding_sites.txt", package = "TFregulomeR")
my_peak <- read.delim(my_peak_path, sep = "\t", header = FALSE)

motifDistrib_output <- motifDistrib(id = "MM1_HSA_K562_CEBPB", 
                                    peak_list = list(K562_CEBPB_exclusive_peak,
                                                     my_peak),
                                    peak_id = c("MM1_HSA_K562_CEBPB","my_peak"))
#> motifDistrib starts analysing for MethMotif ID = MM1_HSA_K562_CEBPB
#> ... ... analysing peak set MM1_HSA_K562_CEBPB
#> ... ... analysing peak set my_peak
plotDistrib(motifDistrib = motifDistrib_output)
#> Distribution of motif MM1_HSA_K562_CEBPB in peak set MM1_HSA_K562_CEBPB has been saved!
#> Distribution of motif MM1_HSA_K562_CEBPB in peak set my_peak has been saved!

Figure 11. Motif distribution

Annotate TFBS locations

TFregulomeR is able to annotate TFBS genomic locations using genomeAnnotate. The annotation process is following the order: promoter, TTS, exon, 5’ UTR exon, 3’ UTR exon, intron and intergenic region. Specifically, promoter is defined as the range from 1000bp upstream of TSS to 100bp downstream of TSS, and TTS is defined as the range from 100bp upstream of TTS to 1000bp downstream of TTS. Users can change the parameters using promoter_range and TTS_range respectively. The annotation output of genomeAnnotate is intuitive, not only will a data.frame containing annotation results be returned, but also an HTML report will be saved.

# annotate the locations of K562 CEBPB exclusive peaks
# loading UCSC knownGene
library(TxDb.Hsapiens.UCSC.hg38.knownGene)

K562_CEBPB_exclusivePeak_loc <- genomeAnnotate(peaks = K562_CEBPB_exclusive_peak, 
                                               return_annotation = TRUE, 
                                               return_html_report = TRUE)
#> Start genomeAnnotate ...
#> ... ... You chose to return annotated results in a data.frame.
#> ... ... You chose to return an HTML report.
#> ... ... annotating promoters defined as upstream 1000bp and downstream 100bp
#> ... ... annotating TTS defined as upstream 100bp and downstream 1000bp
#> ... ... annotating exons
#> ... ... annotating 5' UTR
#> ... ... annotating 3' UTR
#> ... ... annotating introns
#> ... ... annotating intergenic regions
#> ... ... An html report has been generated as 'genomeAnnotate_result.html'!
#> ... ... The annotation results have been returned in a data.frame!

head(K562_CEBPB_exclusivePeak_loc)
#>    chr     start       end                      id   annotation geneName
#> 1 chr3 133629333 133629333  genomeAnnotate_peak_68 promoter-TSS   TOPBP1
#> 2 chr1 155858041 155858041 genomeAnnotate_peak_255 promoter-TSS    GON4L
#> 3 chr3 193662882 193662882 genomeAnnotate_peak_280 promoter-TSS     OPA1
#> 4 chr4  72038972  72038972 genomeAnnotate_peak_426 promoter-TSS   NPFFR2
#> 5 chr4  74448009  74448009 genomeAnnotate_peak_447 promoter-TSS     AREG
#> 6 chr4 143905637 143905637 genomeAnnotate_peak_635 promoter-TSS     GYPE
#>                                                                                                    transcript
#> 1                                                                         ENST00000513818.1;ENST00000506779.1
#> 2 ENST00000368331.5;ENST00000271883.9;ENST00000620426.4;ENST00000361040.9;ENST00000471341.5;ENST00000622608.1
#> 3                                                                                           ENST00000445863.1
#> 4                                                                         ENST00000395999.5;ENST00000358749.3
#> 5                                                                                           ENST00000511560.1
#> 6                                                       ENST00000358615.8;ENST00000506264.5;ENST00000437468.2
#>             distanceToTSS
#> 1                 941;602
#> 2 745;745;745;836;823;745
#> 3                       8
#> 4                 205;401
#> 5                     629
#> 6               77;117;73

TFregulomeR - genomeAnnotate Result

Table - genomic annotations of the peaks:

chr	start	end	annotation	geneName	transcript	distanceToTSS
chr3	133629333	133629333	promoter-TSS	TOPBP1	uc062nzf.1;uc062nzg.1	941;602
chr1	155858041	155858041	promoter-TSS	GON4L	uc001flz.4;uc057lsw.1;uc057lsx.1;uc001fmc.5;uc057lta.1;uc057ltb.1	745;745;745;836;823;745
chr3	193662882	193662882	promoter-TSS	OPA1	uc062rmv.1	8
chr4	72038972	72038972	promoter-TSS	NPFFR2	uc003hgi.3;uc003hgh.3	205;401
chr4	74448009	74448009	promoter-TSS	AREG	uc062xhs.1	629
chr4	143905637	143905637	promoter-TSS	GYPE	uc003ijj.4;uc062zwx.1;uc003ijk.5	77;117;73
chr4	144019438	144019438	promoter-TSS	GYPB	uc062zxb.1;uc003ijm.2;uc062zxc.1;uc062zxd.1;uc062zxe.1;uc011chw.2;uc011chx.2;uc062zxf.1;uc062zxg.1;uc062zxh.1;uc062zxi.1;uc062zxj.1;uc062zxk.1;uc062zxl.1;uc062zxm.1	98;98;100;113;113;113;113;113;113;113;113;135;100;92;92
chr4	144140785	144140785	promoter-TSS	GYPA	uc003ijo.5;uc011cib.3;uc062zxp.1;uc062zxq.1;uc010ioq.4;uc003ijp.5;uc010ior.4;uc062zxr.1;uc062zxs.1;uc062zxt.1	33;91;115;112;149;149;149;149;91;91
chr5	34526833	34526833	promoter-TSS		uc063cvq.1	531
chr5	65654961	65654961	promoter-TSS	TRAPPC13	uc063eaw.1	672
chr5	154835258	154835258	promoter-TSS	FAXDC2	uc063jab.1;uc063jac.1;uc063jaj.1	303;169;15
chr1	202441817	202441817	promoter-TSS	PPP1R12B	uc057ome.2	708
chr7	151969726	151969726	promoter-TSS	GALNTL5	uc064jlx.1	888
chr8	65790050	65790050	promoter-TSS	PDE7A	uc003xvp.4	965
chr8	67013338	67013338	promoter-TSS	PPP1R42	uc064nlb.1;uc064nle.1	356;332
chr8	141340554	141340554	promoter-TSS	LINC01300	uc003ywe.3	4
chr9	97232341	97232341	promoter-TSS	AL590705.2	uc064upp.1	859
chr9	131078469	131078469	promoter-TSS	LAMC3	uc064wol.1	773
chr1	246306156	246306156	promoter-TSS	SMYD3	uc057qyk.1	146
chr10	1159665	1159665	promoter-TSS	LINC00200	uc010qag.1;uc057rfe.1	102;138
chr11	6563474	6563474	promoter-TSS	DNHD1	uc057ymf.1;uc057ymg.1	602;962
chr11	13924915	13924915	promoter-TSS	AC022240.1	uc057zhm.1	51
chr11	60056606	60056606	promoter-TSS	MS4A3	uc001noo.4;uc058bso.1;uc001non.4;uc001nom.4;uc058bsp.1	18;21;21;76;96
chr11	71705585	71705585	promoter-TSS	AP003498.1	uc058evw.1	180
chr11	71706193	71706193	promoter-TSS	AP003498.1	uc058evw.1	788
chr1	31945341	31945341	promoter-TSS	PTP4A2	uc057eht.1	484
chr11	116776684	116776684	promoter-TSS	ZPR1	uc058hqk.1	807
chr11	119108733	119108733	promoter-TSS	DPAGT1	uc031yhj.1	401
chr12	57767233	57767233	promoter-TSS	CYP27B1	uc058qcl.1;uc001spz.2;uc058qco.1;uc058qcp.1	137;17;721;776
chr14	72724354	72724354	promoter-TSS	DPF3	uc001xnd.3;uc059czs.1	248;582
chr15	51603818	51603818	promoter-TSS	DMXL2	uc059jgn.1	116
chr16	153259	153259	promoter-TSS	HBM	uc059ogw.1	632
chr19	45263801	45263801	promoter-TSS	MARK4	uc060zxf.1	885
chr1	94902064	94902064	promoter-TSS	CNN3	uc057imh.1	190
chr22	27990887	27990887	promoter-TSS	TTC28	uc062cto.1	810
chr3	9978368	9978368	promoter-TSS	EMC3	uc062gqg.1	70
chr3	57661617	57661617	promoter-TSS	DENND6A	uc062kxv.1	790
chr5	43486535	43486535	TTS	C5orf34	uc003jnz.3	28609
chr6	2990804	2990804	TTS	LINC01011	uc063llg.1;uc063llk.1	2836;2145
chr6	6655187	6655187	TTS	LY86	uc063lqc.1;uc003mwy.2	67078;66517
chr6	41934725	41934725	TTS	CCND3	uc003orn.4;uc032wvt.2;uc063omo.1;uc003orm.5;uc011duk.4;uc003orp.5;uc003oro.5;uc063omp.1	7122;114168;115631;6793;114168;114242;7029;113982
chr6	159806249	159806249	TTS	PNLDC1	uc063sqf.1;uc063sqg.1	5977;5954
chr8	123512396	123512396	TTS	FBXO32	uc064pyk.1	28792
chrX	48940870	48940870	TTS	AC233300.1	uc064zcn.1	877
chr1	227749098	227749098	TTS	SNAP47	uc057qag.1;uc057qar.1	20479;12924
chr11	18278601	18278601	TTS	HPS5	uc001moe.2;uc001mod.2;uc001mof.2	43572;43572;43572
chr11	117171490	117171490	TTS	PAFAH1B2	uc001pqe.4	27222
chr12	130669440	130669440	TTS	AC095350.1	uc058ved.1;uc058vef.1	10671;1970
chr16	20826112	20826112	TTS	REXO5	uc059rtb.1;uc059rtg.1;uc059rth.1	19494;19354;18954
chr16	31083329	31083329	TTS	PRSS53	uc002eaq.4;uc059tkn.1;uc059tko.1	5633;6298;11272
chr16	75578332	75578332	TTS	GABARAPL2	uc002fen.3;uc059xgx.1;uc059xgy.1;uc059xha.1	11980;11942;11733;2723
chr17	8120204	8120204	TTS	HES7	uc002gkb.3	3879
chr19	4654843	4654843	TTS	AC005339.1	uc060rye.1	680
chr19	11835775	11835775	TTS	ZNF440	uc002msp.2	21490
chr2	20592243	20592243	TTS	HS1BP3	uc061gzb.1	32529
chr2	68283668	68283668	TTS	CNRIP1	uc002sej.5	35731
chr2	74163128	74163128	TTS	MOB1A	uc002ski.3	15739
chr2	106521250	106521250	TTS	AC108868.1	uc061mqn.1	16828
chr2	169095822	169095822	TTS	DHRS9	uc061phj.1	26245
chr2	236073466	236073466	TTS	RN7SL204P	uc061tyg.1	1170
chr21	9328528	9328528	TTS	CR381653.1	uc061yzz.1	3468
chr3	101325638	101325638	TTS	SENP7	uc003dus.5;uc003duv.5	7630;187584
chr3	133748573	133748573	exon	TF	uc003epv.2;uc062nzm.1;uc062nzn.1;uc062nzp.1;uc062nzq.1;uc062nzr.1;uc062nzu.1;uc062nzv.1	2616;2532;2244;2179;2179;2179;1422;432
chr3	142713294	142713294	exon	PLS1	uc010huv.4;uc003euz.4	116906;89907
chr3	182784793	182784793	exon	AC069431.1	uc062qml.1	8600
chr4	8479388	8479388	exon	TRMT44	uc003gli.1	5871
chr4	124543668	124543668	exon	LINC02516	uc003ife.1	14765
chr5	10734238	10734238	exon	DAP	uc063cjd.1	26960
chr6	13979603	13979603	exon	RNF182	uc021ylw.2;uc021ylx.2;uc003nbe.4;uc003nbg.4	55156;54735;54735;54409
chr6	107699530	107699530	exon	AL096816.1	uc063qkh.1	687
chr8	76853173	76853173	exon	ZFHX4	uc003yau.3;uc064nvd.1	171953;149127
chr8	102257371	102257371	exon	AP002907.1	uc064pfv.1	978
chr9	6893515	6893515	exon	KDM4C	uc064sbq.1	135411
chr9	108903654	108903654	exon	ELP1	uc004bdm.5;uc010mtq.4	30461;30298
chrX	53374700	53374700	exon	SMC1A	uc004dsg.4;uc011moe.3	48027;48027
chr16	76635537	76635537	exon	AC104151.1	uc059xij.1;uc002fey.1	82119;538
chr17	33931279	33931279	exon		uc060dxq.1	4926
chr17	76689992	76689992	exon	MXRA7	uc060klv.1	226
chr19	32628841	32628841	exon	ANKRD27	uc002ntn.2;uc060wrv.1	46357;14458
chr19	46410798	46410798	exon	CCDC8	uc002pep.4	2785
chr1	86455890	86455890	exon	CLCA2	uc001dlr.5	31803
chr2	10057244	10057244	exon	CYS1	uc002rag.3	23166
chr2	30352194	30352194	exon	LINC01936	uc002rng.4	5534
chr2	130340353	130340353	exon	CCDC115	uc061ntn.1	1781
chr2	151658013	151658013	exon	NEB	uc061opi.1;uc002txu.3;uc021vrc.1;uc031rpp.1;uc032ofo.1;uc061opl.1;uc061opm.1	76473;76473;76473;76473;76473;75142;75142
chr2	151853483	151853483	exon	CACNB4	uc002tya.5;uc061oqf.1;uc002txy.5;uc061oqg.1;uc002txz.5;uc061oqh.1;uc010fnz.5	245595;244977;120456;120456;245237;120451;245539
chr2	178402071	178402071	exon	OSBPL6	uc002ulx.4	207421
chr1	11796319	11796319	exon	MTHFR	uc057chn.1;uc057cho.1;uc001atc.3;uc001atd.2	6925;7194;9738;9093
chr21	22210190	22210190	exon	AP000561.1	uc061zfw.1	250
chr21	36278075	36278075	exon	DOP1B	uc002yvg.4	113539
chr21	39323043	39323043	exon	BRWD1-AS1	uc031rvp.2	7335
chr3	133851829	133851829	intron	RAB6B	uc003epy.4;uc062oad.1;uc062oae.1;uc011blu.2;uc062oaf.1;uc062oag.1;uc062oah.1	43986;43765;43803;43623;43656;44006;42889
chr3	140237275	140237275	intron	CLSTN2	uc003etn.4;uc062ojy.1	302089;302087
chr3	150451455	150451455	intron	TSC22D2	uc003exv.4;uc062ozq.1;uc062ozr.1;uc062ozs.1	43119;40485;40445;40275
chr3	151047864	151047864	intron	CLRN1-AS1	uc011bny.2;uc003eyl.2	195379;75185
chr3	169781416	169781416	intron	MYNN	uc003fft.4;uc062pus.1;uc003ffu.4;uc010hwo.3;uc062puu.1	8584;8010;7988;7150;2126
chr3	171077467	171077467	intron	TNIK	uc003fhh.3;uc062pyb.1;uc003fhl.3;uc003fhi.3;uc003fhj.3;uc003fhk.3;uc003fhm.3;uc003fhn.3;uc003fho.3;uc003fhp.4;uc062pyd.1	382940;24604;382940;382595;382595;382595;382595;382595;382595;5214;2115
chr3	171443371	171443371	intron	TNIK	uc003fhh.3;uc003fhl.3;uc003fhi.3;uc003fhj.3;uc003fhk.3;uc003fhm.3;uc003fhn.3;uc003fho.3;uc062pyg.1	17036;17036;16691;16691;16691;16691;16691;16691;17033
chr3	179375349	179375349	intron	MFN1	uc003fjs.4;uc062qem.1;uc062qeo.1;uc062qeq.1	27656;27613;26587;12990
chr3	184270272	184270272	intron		uc062qqt.1;uc003fni.5	134895;20576
chr3	195714907	195714907	intron	MUC20-OT1	uc062ruk.1;uc062rul.1;uc062rvb.1;uc062rvc.1;uc062rvd.1;uc062rve.1;uc062rvf.1;uc062rvg.1;uc062rvh.1;uc062rvi.1;uc062rvj.1;uc062rvk.1;uc062rvl.1;uc062rvm.1;uc062rvn.1;uc062rvo.1;uc062rvp.1;uc062rvq.1;uc062rvr.1;uc062rvs.1;uc062rvt.1;uc062rvu.1;uc062rvv.1;uc062rvw.1;uc062rvx.1;uc062rvy.1;uc062rvz.1;uc062rwa.1;uc062rwb.1;uc062rwc.1;uc062rwd.1;uc062rwi.1;uc062rwj.1;uc062rwk.1;uc062rwl.1;uc062rwm.1;uc062rwn.1;uc062rwo.1;uc062rwp.1;uc062rwq.1;uc062rwr.1;uc062rws.1;uc062rwt.1;uc062rwu.1;uc062rwv.1;uc062rww.1;uc062rwx.1;uc062rwy.1;uc062rwz.1;uc062rxa.1;uc062rxb.1;uc062rxc.1;uc062rxd.1;uc062rxe.1;uc062rxf.1;uc062rxg.1;uc062rxh.1	13349;13267;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6752;6751;6748;6744;6728;6728;6728;6728;6728;6728;6728;6728;6728;6728;6728;6728;6728;6728;6726;6723;6721;6700;6664;6656;6620;6618;6426;6323;6249;6180
chr3	196143570	196143570	intron	LINC00885	uc011bts.2	933
chr4	9925832	9925832	intron	SLC2A9	uc003gmc.4;uc003gmd.4;uc062vdm.1;uc062vdn.1	95650;129042;114415;114377
chr4	17771099	17771099	intron	FAM184B	uc003gpm.5	10412
chr4	26346435	26346435	intron	RBPJ	uc062vsk.1;uc062vsl.1;uc062vsm.1;uc062vsn.1;uc003gry.3;uc003grx.3;uc003grz.3;uc062vso.1;uc062vsp.1;uc003gsb.3;uc062vsq.1;uc003gsa.3;uc062vsr.1;uc062vss.1;uc062vst.1;uc062vsu.1;uc062vsv.1;uc062vsw.1;uc062vsx.1;uc062vsy.1;uc062vsz.1	182979;73827;26772;26748;26724;26724;25871;25709;25647;25634;25627;25627;25427;25420;25414;25069;24832;24270;24225;23564;3294
chr4	56002114	56002114	intron	CEP135	uc003hbi.5;uc062wub.1	53132;37247
chr4	83495630	83495630	intron	ABRAXAS1	uc062xwm.1	27717
chr4	86917985	86917985	intron	AC093827.5	uc062xzo.1;uc062xzt.1	16746;18192
chr4	109148639	109148639	intron	COL25A1	uc062yyd.1;uc062yyf.1;uc062yyg.1;uc062yyh.1	153727;154003;153727;154003
chr4	109149626	109149626	intron	COL25A1	uc062yyd.1;uc062yyf.1;uc062yyg.1;uc062yyh.1	152740;153016;152740;153016
chr4	109174280	109174280	intron	COL25A1	uc062yyd.1;uc062yyf.1;uc062yyg.1;uc062yyh.1	128086;128362;128086;128362
chr4	109177243	109177243	intron	COL25A1	uc062yyd.1;uc062yyf.1;uc062yyg.1;uc062yyh.1	125123;125399;125123;125399
chr4	113614462	113614462	intron	CAMK2D	uc003ibj.4;uc003ibk.3;uc003ibo.5;uc003ibi.4;uc062zdz.1;uc062zea.1;uc003ibm.3;uc062zeb.1;uc003ibl.3;uc003ibn.3	146982;147464;146755;146605;146605;146605;147100;146720;146667;146605
chr4	127968202	127968202	intron	ABHD18	uc021xrt.2;uc062zmm.1;uc062zmn.1;uc062zmo.1;uc062zmp.1;uc062zmq.1	2895;2886;2824;2790;2727;2695
chr4	128941182	128941182	intron	SCLT1	uc003igp.3;uc003ign.3;uc010iob.2;uc062zns.1;uc062znt.1	152424;31189;152227;152229;56102
chr4	140492568	140492568	intron	MGAT4D	uc062ztr.1;uc062zts.1;uc062ztu.1;uc062ztv.1;uc031shc.2;uc062ztw.1;uc062ztx.1;uc062zty.1	5784;5683;5801;5763;5653;5653;5761;5808
chr4	144104991	144104991	intron	AC139713.2	uc062zwz.1	151355
chr4	154691442	154691442	intron	LRAT	uc063akf.1;uc063akg.1;uc003iol.4	64496;64480;64369
chr4	174658502	174658502	intron	GLRA3	uc003ity.3;uc003itz.3	170811;170724
chr5	14273025	14273025	intron	TRIO	uc063ckk.1;uc063ckl.1;uc003jff.4;uc063ckm.1;uc011cna.1	129322;129322;129322;129187;89226
chr1	167863292	167863292	intron	ADCY10	uc010plj.2;uc001ger.4;uc009wvk.4	50922;50809;50809
chr5	16593790	16593790	intron	RETREG1	uc003jfs.4	23267
chr5	18972453	18972453	intron	AC106744.1	uc063cny.1	169892
chr1	167869681	167869681	intron	ADCY10	uc010plj.2;uc001ger.4;uc009wvk.4	44533;44420;44420
chr5	36880872	36880872	intron	NIPBL	uc003jkl.5;uc003jkk.5;uc063dar.1	4112;4081;3870
chr5	37225834	37225834	intron	CPLANE1	uc063day.1;uc011cpa.1	21957;23593
chr5	39006855	39006855	intron	RICTOR	uc003jlp.4;uc063ddm.1;uc003jlo.4;uc063ddr.1;uc063dds.1;uc063ddt.1	67552;67533;67543;67533;67543;67533
chr1	14774515	14774515	intron	KAZN	uc057cmd.1;uc001avm.5;uc031tqj.2	175810;175810;175797
chr5	55304769	55304769	intron	DHX29	uc003jpx.4;uc063dpv.1;uc010ivw.4;uc063dpx.1	2952;2904;2924;2770
chr5	69305706	69305706	intron	CCDC125	uc003jvv.2;uc063efy.1;uc003jvy.4;uc063efz.1;uc003jvw.3	27009;14876;14876;27102;27086
chr5	72108067	72108067	intron	MAP1B	uc063ekc.1;uc003kbw.5;uc063ekd.1;uc063eke.1;uc063ekf.1	832;832;589;589;580
chr5	73126622	73126622	intron	TMEM171	uc003kcm.3;uc003kcn.4	6329;6052
chr5	80234449	80234449	intron	SERINC5	uc003kgk.4;uc063exm.1;uc284pwz.1;uc011ctj.2;uc063exo.1;uc063exq.1	21569;21569;21569;21629;21564;20748
chr1	14800970	14800970	intron	KAZN	uc057cmd.1;uc001avm.5;uc031tqj.2	202265;202265;202252
chr1	172402145	172402145	intron	DNM3	uc057nia.1;uc001gie.5;uc001gif.5;uc001gih.2;uc057nif.1;uc057nih.1;uc001gii.2	560644;560644;560644;23126;13180;13230;41937
chr5	89343862	89343862	intron	MEF2C-AS1	uc063fgz.1	454416
chr5	90575983	90575983	intron	ADGRV1	uc063fie.1;uc003kju.4	46638;17182
chr5	118894955	118894955	intron	DTWD2	uc063ggh.1;uc003ksa.4;uc063ggi.1;uc063ggj.1	93264;93589;93583;93583
chr5	120507352	120507352	intron	PRR16	uc003ksq.4;uc003ksp.4;uc063gil.1	43073;43027;41720
chr5	122863586	122863586	intron	SNX24	uc063gkv.1;uc063gkw.1;uc011cwo.3;uc010jcy.4;uc063gkx.1;uc003ktf.3;uc063gky.1	20146;19353;18136;18067;17980;17959;17922
chr5	126882023	126882023	intron	MARCH3	uc003kuf.4	148784
chr5	126984058	126984058	intron	MARCH3	uc003kuf.4;uc063gox.1	46749;46110
chr1	174746423	174746423	intron	RABGAP1L	uc001gjx.4;uc057nln.2;uc001gkc.5;uc057nlo.2;uc057nlp.1;uc001gkd.5;uc057nlq.1	586905;46009;45897;45691;45691;45402;45400
chr5	138571540	138571540	intron	HSPA9	uc003ldf.4;uc063hly.1;uc063hmb.1;uc063hmc.1;uc063hmd.1;uc063hme.1;uc063hmf.1;uc063hmg.1	3903;3860;3799;3862;3860;3625;3870;3468
chr5	149968454	149968454	intron	SLC26A2	uc003lrh.4;uc063ipj.1	7716;7668
chr1	176879933	176879933	intron	ASTN1	uc001glb.3;uc001glc.5;uc001gld.3	284933;284756;284749
chr5	160048221	160048221	intron	TTC1	uc032vqo.2;uc003lxu.5;uc063jft.1;uc063jfu.1;uc063jfv.1	39107;39107;11714;9305;5087
chr5	171940555	171940555	intron	FBXW11	uc003mbl.2;uc003mbm.2;uc003mbn.2;uc063jqn.1;uc063jqr.1;uc063jqs.1;uc063jqt.1;uc063jqu.1;uc063jqv.1	66317;66084;66094;66093;66103;65992;66093;37200;66013
chr5	179927994	179927994	intron	RNF130	uc063kss.1	143726
chr6	1631412	1631412	intron	GMDS	uc021ykn.2;uc003mtq.4;uc063ljy.1	544578;614279;299586
chr6	10102587	10102587	intron	OFCC1	uc010joj.2;uc003myp.2;uc063lsx.1	12186;109020;108952
chr6	10887496	10887496	intron	AL024498.2	uc063luy.1;uc063lvn.1;uc003mzo.4	139701;664;664
chr6	11097700	11097700	intron	SMIM13	uc011dio.3;uc063lvv.1	3666;3468
chr6	15020316	15020316	intron	AL138720.1	uc063lzj.1;uc063lzk.1	1647;69686
chr6	20292385	20292385	intron	AL158198.1	uc063mcj.1	80297
chr6	21101963	21101963	intron	CDKAL1	uc003ndd.3;uc021ymk.2	567505;555621
chr6	26058824	26058824	intron		uc063mgc.1	65079
chr6	28615827	28615827	intron	ZBED9	uc063mlu.1	384
chr6	30872029	30872029	intron	DDR1-DT	uc063mvd.1	3888
chr6	34695006	34695006	intron	C6orf106	uc003ojr.4;uc003ojs.4	1840;1852
chr6	34969194	34969194	intron	ANKS1A	uc003ojx.5	79928
chr6	35267174	35267174	intron	ZNF76	uc011dsx.3;uc063obh.1;uc063obi.1;uc011dsy.2;uc063obj.1;uc063obk.1;uc003oki.2;uc003okj.2	8264;8191;7702;7675;7645;7569;7501;7399
chr6	35409455	35409455	intron	PPARD	uc003okm.3;uc011dtb.2;uc011dtc.2;uc063obq.1;uc003okn.4	66896;66896;66896;66840;66840
chr6	41976445	41976445	intron	CCND3	uc032wvt.2;uc063omo.1;uc011duk.4;uc003orp.5;uc063omp.1;uc063omq.1;uc063oms.1;uc063omu.1;uc063omx.1;uc063omy.1;uc063omz.1;uc063ona.1;uc063onb.1;uc063onc.1	72448;73911;72448;72522;72262;72258;72244;72242;72230;72244;72244;72242;72240;72242
chr6	42712126	42712126	intron	PRPH2	uc003osk.4	10447
chr1	185208384	185208384	intron	SWT1	uc001grg.5;uc001grh.5	51303;51224
chr6	47669886	47669886	intron	ADGRF2	uc063oyn.1;uc063oyo.1;uc003oyy.3	13449;13398;13295
chr6	52393164	52393164	intron	PAQR8	uc063pae.1;uc003pao.4;uc063paf.1	31742;31035;30717
chr6	63638016	63638016	intron	PHF3	uc063pif.1;uc063pig.1;uc063pih.1;uc063pii.1;uc003pep.2;uc063pij.1;uc003peo.4	2195;2192;2190;2179;2179;1904;1534
chr6	68728311	68728311	intron	ADGRB3	uc003pev.5;uc063pjs.1	92943;89634
chr6	79524004	79524004	intron	LCA5	uc003pix.4;uc003piy.3;uc063psb.1	13403;13425;13453
chr6	96788056	96788056	intron	AL033379.1	uc063qea.1	2918
chr6	100876037	100876037	intron	ASCC3	uc003pqk.5;uc063qha.1;uc003pql.5;uc063qhb.1;uc063qhc.1;uc003pqm.5	5326;5334;5326;5263;5310;5273
chr6	105035511	105035511	intron	LIN28B	uc003pqv.2	78462
chr6	108574662	108574662	intron	FOXO3	uc003psk.3;uc003psm.2	14826;13795
chr6	111439359	111439359	intron	REV3L	uc063qqq.1;uc063qqr.1;uc063qqs.1;uc003puz.5;uc003puy.6;uc063qqt.1	43559;43851;43869;44355;44355;44355
chr6	111782358	111782358	intron	FYN	uc063qry.1;uc003pvk.3;uc003pvj.4;uc063qrz.1;uc063qsk.1;uc063qsm.1;uc063qsn.1;uc063qsp.1;uc063qsq.1;uc063qsr.1;uc063qss.1;uc063qst.1;uc063qsu.1;uc063qsv.1;uc063qsw.1	90708;91093;11527;91071;90678;90922;91081;16064;76024;62585;90942;37719;11553;37621;64256
chr1	15862825	15862825	intron	SPEN	uc001axk.2	14960
chr6	118696476	118696476	intron	CEP85L	uc003pya.3;uc063qze.1;uc003pyc.5	13598;13598;13591
chr6	119066797	119066797	intron	FAM184A	uc063qzu.1;uc003pyj.5;uc003pyk.6;uc003pyl.6;uc063raa.1;uc063rab.1	11849;11945;82589;82389;11574;12381
chr6	122712776	122712776	intron	PKIB	uc031spm.2;uc003pyz.4;uc003pza.4;uc063rba.1;uc003pzb.4;uc003pzc.5	240858;240844;102543;102543;102543;60043
chr6	129508039	129508039	intron	LAMA2	uc063rgw.1;uc063rgx.1;uc063rgy.1	624897;624897;624841
chr6	143167460	143167460	intron	AIG1	uc063ryr.1;uc063rys.1;uc003qjh.5;uc011edw.3;uc003qjg.5;uc063ryu.1;uc063ryv.1;uc063ryx.1;uc063ryy.1	106963;106606;106583;106560;106558;106558;106514;41209;2515
chr6	146771213	146771213	intron	ADGB	uc010khx.4;uc063scd.1;uc063scf.1;uc063scg.1;uc063sch.1;uc063sci.1;uc063scj.1	172247;172207;54329;34694;773;767;767
chr6	146914687	146914687	intron	STXBP5-AS1	uc063scl.1;uc003qlt.3;uc003qlu.1;uc063sco.1;uc063scp.1;uc063scr.1	1197;289852;289926;217230;204775;217298
chr6	157521810	157521810	intron	ZDHHC14	uc003qqs.4;uc003qqt.4	140676;140676
chr6	157527842	157527842	intron	ZDHHC14	uc003qqs.4;uc003qqt.4	146708;146708
chr6	157842390	157842390	intron	SNX9	uc063smu.1;uc063smv.1;uc003qqv.3	142002;126343;19125
chr6	157892513	157892513	intron	SNX9	uc003qqv.3;uc063smx.1;uc063sna.1	69248;25263;272
chr6	167856032	167856032	intron	AFDN	uc063sxm.1;uc063sxn.1;uc063sxo.1;uc003qwc.3;uc021zij.1;uc063sxp.1;uc063sxq.1;uc063sxr.1;uc021zim.2;uc021zik.2	29109;29070;29070;29040;29040;29031;28898;28898;28898;28396
chr7	2960101	2960101	intron	CARD11	uc003smv.5;uc064bbr.1	83843;83743
chr7	5453240	5453240	intron	FBXL18	uc064bec.1	60511
chr7	30154455	30154455	intron	MTURN	uc003tar.2;uc064cmc.1;uc064cmf.1	19644;19466;8653
chr7	30896204	30896204	intron	AC004691.2	uc064cor.1	43930
chr7	39126088	39126088	intron	POU6F2	uc003thb.3;uc064czc.1;uc010kxo.4;uc064czd.1;uc064cze.1;uc064czf.1;uc064czg.1;uc064czh.1	148089;119279;113006;91752;42526;41032;40322;40226
chr7	65273031	65273031	intron	AC104073.4	uc284qac.1	3656
chr7	87825634	87825634	intron	RUNDC3B	uc003ujb.4;uc003ujc.4;uc011khe.3;uc064fbo.1	197220;197179;197018;125217
chr7	93651783	93651783	intron	GNGT1	uc064fkd.1	60209
chr7	97127083	97127083	intron	SDHAF3	uc003uoo.5;uc064fpq.1;uc064fpr.1	10492;9381;9359
chr7	100745458	100745458	intron	ZAN	uc032zzh.1;uc032zzi.1;uc032zzj.2;uc032zzk.2;uc032zzl.2;uc064ghk.1;uc064ghl.1;uc064ghm.1	11831;11831;11831;11831;11831;11831;11436;11436
chr7	103510850	103510850	intron	AC005064.1	uc033abm.1;uc010liz.3;uc064grw.1;uc064grx.1	65642;478665;478665;478665
chr1	204401079	204401079	intron	AL606489.1	uc057oqp.1	23228
chr7	108605940	108605940	intron	AC005487.1	uc064hby.1	7564
chr7	112968039	112968039	intron	HRAT17	uc033adw.2	27594
chr7	119838635	119838635	intron	LINC02476	uc064hll.1;uc064hlm.1	68739;68739
chr7	119905708	119905708	intron	LINC02476	uc064hll.1;uc064hlm.1	1666;1666
chr7	129635734	129635734	intron	NRF1	uc003voz.4;uc064hxl.1;uc003vpa.4;uc064hxm.1;uc003vpb.4;uc064hxn.1	24019;23986;23977;23977;5665;5655
chr7	130436456	130436456	intron	CEP41	uc011kpg.3;uc003vpz.5;uc064hzu.1;uc003vqa.5;uc064hzz.1;uc064iab.1;uc064iac.1;uc064iad.1;uc064iaf.1;uc003vqb.2;uc064iag.1;uc064iah.1	4753;4780;4570;4538;5976;4600;4569;4391;4599;4753;4572;4567
chr7	136049312	136049312	intron	AC024084.1	uc064iij.1	23550
chr1	206717628	206717628	intron	MAPKAPK2	uc001hel.3;uc001hem.3	32683;32590
chr7	136070332	136070332	intron	AC024084.1	uc064iij.1	44570
chr7	136079586	136079586	intron	AC024084.1	uc064iij.1	53824
chr7	143413985	143413985	intron	EPHA1-AS1	uc003wda.6;uc064iwj.1	6171;6151
chr1	207500453	207500453	intron	CR1	uc057ozn.1;uc057ozo.1;uc057ozp.1;uc057ozq.1;uc057ozr.1;uc057ozs.1;uc057ozt.1;uc001hfx.4;uc057ozu.1;uc057ozv.1;uc057ozw.1;uc057ozx.1	4305;4295;4295;4295;4224;4214;4191;4184;4111;4111;4111;4111
chr7	152424380	152424380	intron	KMT2C	uc003wla.3;uc064jml.1;uc064jmm.1;uc064jmv.1	11624;11624;11624;11418
chr7	158703157	158703157	intron	NCAPG2	uc064jsz.1;uc003wnv.3;uc003wnx.3;uc011kwe.3	546;1671;1646;1640
chr8	783049	783049	intron	DLGAP2	uc033azs.2;uc064jxd.1	45397;45306
chr8	6570764	6570764	intron	MCPH1	uc003wqi.4;uc064kao.1;uc064kap.1	164171;93497;93404
chr8	30134053	30134053	intron	LEPROTL1	uc064lrw.1;uc003xhx.3;uc064lrx.1;uc010lvg.4	38627;38572;38565;10632
chr8	39952207	39952207	intron	IDO2	uc003xno.2;uc010lwy.1;uc064mgi.1;uc064mgb.1	17592;17251;3040;43040
chr8	42033966	42033966	intron	KAT6A	uc003xon.4;uc010lxb.3;uc010lxc.3;uc010lxd.3	18020;18020;18020;18020
chr8	42251984	42251984	intron	AC083973.1	uc064mjy.1;uc064mjz.1	18926;11421
chr8	65768571	65768571	intron	MTFR1	uc064nhx.1;uc064nhy.1;uc003xvq.4;uc003xvp.4;uc003xvr.4;uc064nig.1;uc064nih.1;uc064nii.1	61525;61517;73380;20512;72936;73516;70176;73750
chr8	68077517	68077517	intron	PREX2	uc003xxv.2;uc003xxu.2;uc064nml.1	125398;125148;125027
chr8	72113799	72113799	intron	MSC-AS1	uc064nqk.1	269640
chr8	76993048	76993048	intron	PEX2	uc003yax.4;uc022awf.2;uc003yay.4;uc064nvm.1;uc064nvn.1	7177;7898;7218;7995;7181
chr1	17674943	17674943	intron	ARHGEF10L	uc001ban.4;uc001bao.4;uc057cum.1;uc010ocs.3;uc057cun.1	135107;94389;59854;56626;48935
chr8	95226570	95226570	intron	C8orf37-AS1	uc064otk.1	22113
chr8	95251764	95251764	intron	C8orf37-AS1	uc064otk.1;uc003yho.3	47307;17436
chr8	95278147	95278147	intron	C8orf37-AS1	uc064otk.1;uc064otl.1	73690;9310
chr8	98762059	98762059	intron	STK3	uc010mbm.3;uc003yip.5;uc003yio.5;uc064oxn.1;uc064oxq.1;uc064oxu.1	63611;63621;180511;63611;63586;63606
chr8	123106286	123106286	intron	TBC1D31	uc003ypp.3;uc064pwn.1;uc011lig.3;uc064pwo.1;uc064pwq.1;uc064pwr.1;uc064pww.1	33605;33603;33572;33572;33526;33519;5332
chr8	128067211	128067211	intron	PVT1	uc003ysl.4;uc064qeg.1;uc064qeh.1;uc064qei.1;uc064qek.1	176582;78026;77993;77949;57552
chr8	129582389	129582389	intron	CCDC26	uc064qez.1	97849
chr9	357761	357761	intron	DOCK8	uc003zgf.2;uc022bcu.1;uc064rul.1;uc010mgv.4;uc064run.1	142895;84712;84701;84690;84676
chr9	5632746	5632746	intron	RIC1	uc003zjh.5;uc003zjl.5;uc003zji.5	3720;3626;3626
chr1	219170394	219170394	intron	LYPLAL1-DT	uc057pmi.1	3566
chr9	97281099	97281099	intron	SUGT1P4-STRA6LP	uc011luu.3;uc011lur.3;uc064upr.1;uc064upt.1;uc064upu.1	42649;42615;42601;30269;30269
chr9	98688949	98688949	intron	GABBR2	uc004ays.5	20247
chr9	105760245	105760245	intron	TMEM38B	uc004bcu.3;uc064uxk.1;uc064uxl.1	65700;59260;54569
chr9	121253263	121253263	intron	GSN	uc064vku.1;uc064vkv.1	45468;45462
chr9	128914147	128914147	intron	LRRC8A	uc004bwl.5;uc010myp.4;uc010myq.4;uc064whi.1	32034;31950;31644;1402
chrX	9545498	9545498	intron	TBL1X	uc010ndq.3;uc010ndr.3;uc064xxu.1;uc004csq.5;uc064xxv.1;uc004csr.4;uc064xxw.1;uc064xxx.1	82202;82202;82177;82168;82166;80489;80203;10554
chrX	9864143	9864143	intron	SHROOM2	uc004csu.2	77686
chrX	10882685	10882685	intron	AC073529.1	uc064xyt.1	228423
chrX	11410043	11410043	intron	ARHGAP6	uc064xyw.1;uc004cup.2;uc064xyx.1;uc064xyy.1;uc004cur.1	17729;255657;255657;255039;255657
chrX	11588232	11588232	intron	ARHGAP6	uc004cup.2;uc064xyx.1;uc064xyy.1;uc004cur.1	77468;77468;76850;77468
chrX	37760241	37760241	intron	AF241726.2	uc064yne.1	410910
chrX	38139127	38139127	intron	AF241726.2	uc064yne.1	789796
chrX	44314343	44314343	intron	EFHC2	uc004dgb.5	29328
chrX	66024776	66024776	intron	VSIG4	uc064zqm.1;uc004dwh.3;uc004dwj.4;uc004dwi.3	8887;15330;15348;15283
chrX	77527434	77527434	intron	ATRX	uc004ecp.5;uc004ecq.6;uc065aay.1;uc065aaz.1	258834;258781;258781;69440
chrX	110327508	110327508	intron	AMMECR1	uc004eoq.4	112724
chrX	119549118	119549118	intron	CXorf56	uc004erj.3;uc011mtu.3	16254;16289
chrX	132046707	132046707	intron	STK26	uc004ewk.2;uc011mux.2;uc010nrj.2;uc004ewl.2;uc004ewm.2	23441;23404;23404;23401;23098
chr1	228290711	228290711	intron	OBSCN	uc001hsn.4;uc001hsq.2;uc009xez.2;uc057qea.1;uc057qeh.1	82580;82580;82580;82580;9937
chrX	152156085	152156085	intron	AC116666.1	uc065brk.1	17201
chrX	152343507	152343507	intron	GABRA3	uc010ntk.2;uc065brl.1	107850;45989
chrX	152412797	152412797	intron	GABRA3	uc010ntk.2	38560
chrX	152444521	152444521	intron	GABRA3	uc010ntk.2	6836
chr1	230127422	230127422	intron	GALNT2	uc057qgq.1;uc010pwa.2	69431;60212
chr1	231166591	231166591	intron	TRIM67	uc057qih.1;uc057qii.1;uc009xfn.2	4478;3620;3620
chr1	231418937	231418937	intron	EGLN1	uc001huv.2;uc057qiv.1	6106;1405
chr1	233665711	233665711	intron	KCNK1	uc010pxo.1;uc057qlj.1;uc031pst.2;uc057qlk.1;uc057qll.1;uc057qlm.1	51706;36152;36103;29186;15769;10015
chr1	235113597	235113597	intron	TOMM20	uc001hwl.4;uc057qmx.1;uc057qmy.1	15338;6301;8861
chr1	241697509	241697509	intron	WDR64	uc001hzg.3;uc057qtb.1;uc057qtc.1	45230;13928;13928
chr1	241698346	241698346	intron	WDR64	uc001hzg.3;uc057qtb.1;uc057qtc.1	46067;14765;14765
chr1	244999874	244999874	intron	EFCAB2	uc057qxh.1;uc001ibc.3;uc001ibd.3;uc057qxj.1;uc010pyp.2;uc057qxk.1;uc057qxl.1;uc001ibe.3;uc057qxm.1;uc057qxn.1	30168;30004;29891;29587;29546;29522;29517;29483;29468;29450
chr1	247858827	247858827	intron	TRIM58	uc001ido.4	1627
chr1	21777643	21777643	intron	USP48	uc001bfc.4;uc001bfb.4;uc010odq.3;uc001bff.4;uc057dci.1;uc057dcj.1;uc057dck.1	5551;5962;5347;5962;4376;5582;5299
chr10	17680486	17680486	intron	STAM	uc001ipj.3;uc057sbp.1;uc057sbq.1	36360;36277;36277
chr10	21962078	21962078	intron	DNAJC1	uc001irc.4;uc057sei.1;uc057sej.1;uc057sek.1	41646;41506;41320;41690
chr10	35182841	35182841	intron	CREM	uc001ixy.4;uc057ssd.1;uc057ssh.1;uc057ssi.1;uc057ssj.1;uc057ssl.1;uc057ssm.1;uc001ixz.4;uc001iya.4;uc057ssq.1;uc001iyc.4;uc001iyb.4;uc057ssr.1;uc001iyd.4;uc001iye.4;uc057sst.1;uc057ssu.1;uc057ssv.1;uc057ssw.1;uc001iyf.4;uc031pty.2;uc057ssy.1;uc057ssz.1;uc031pua.2;uc001iyh.4;uc057sta.1;uc057stb.1;uc057std.1	56049;56030;55892;55881;55851;55777;55749;55723;55678;55383;55383;55380;55363;45004;45004;45004;45004;34468;34439;15324;15114;15114;7253;7247;7247;7244;7242;3951
chr10	42680478	42680478	intron	LINC01518	uc057svq.1	11244
chr10	45152826	45152826	intron	AL358394.2	uc001jca.3	5431
chr1	23830366	23830366	intron	HMGCL	uc057dho.1	8253
chr10	50572096	50572096	intron	SGMS1	uc021pqo.2;uc001jje.4;uc057tgg.1;uc057tgj.1;uc057tgn.1	51880;51886;51880;52889;53066
chr10	58350782	58350782	intron	UBE2D1	uc001jke.3;uc021prc.2	15806;15802
chr10	61803469	61803469	intron	AC022398.1	uc057tna.1	17776
chr10	69334079	69334079	intron	HK1	uc057tth.1;uc057tti.1;uc057ttj.1;uc001jpg.5;uc001jph.5;uc057ttl.1;uc057ttm.1;uc057ttn.1;uc057ttp.1;uc057ttq.1;uc001jpk.5;uc001jpl.5;uc057ttr.1	64094;64078;64078;64078;64078;64073;64051;55550;51523;18293;18224;15234;14967
chr10	86759613	86759613	intron	BMPR1A	uc001kdy.4	2962
chr10	86981176	86981176	intron	ADIRF	uc057urb.1;uc057urd.1	12685;10434
chr10	89701911	89701911	intron	KIF20B	uc001kgr.3;uc001kgs.3;uc057uur.1	300;293;231
chr10	95532746	95532746	intron	SORBS1	uc001kkw.4;uc001kkv.4;uc001kku.3;uc057vai.1;uc057vaj.1;uc057vak.1;uc057van.1	28631;28631;28667;28631;28631;28608;28626
chr1	26287240	26287240	intron	UBXN11	uc001bly.4;uc057dpy.1;uc057dpz.1;uc057dqa.1;uc057dqb.1;uc057dqc.1;uc001blw.4;uc001bma.4;uc057dqd.1;uc057dqe.1;uc057dqf.1;uc057dqg.1;uc057dqh.1	19395;19399;19379;19395;19391;19392;19404;31024;31122;10785;19368;15926;15775
chr10	107047383	107047383	intron	SORCS1	uc001kym.4	117150
chr10	117272839	117272839	intron	SLC18A2	uc001ldd.3;uc057wfh.1	31745;31745
chr10	119075783	119075783	intron	EIF3A	uc001ldu.4;uc057whi.1	5020;5039
chr10	124640874	124640874	intron	AC068896.1	uc001lhu.2;uc001lhv.2;uc057wpv.1	150852;103175;102095
chr11	3517121	3517121	intron	AC127526.1	uc057yce.1;uc057ycf.1;uc057ycg.1	47801;64089;54865
chr11	10447867	10447867	intron	AMPD3	uc057yzt.1	130195
chr11	14652639	14652639	intron	PDE3B	uc001mln.4;uc010rcr.3;uc057ziu.1	8915;8807;8378
chr1	5917936	5917936	intron	NPHP4	uc057bsd.1;uc057bse.1;uc001alq.3;uc057bsf.1;uc057bsj.1	74487;74536;74534;74510;74536
chr11	16364467	16364467	intron	SOX6	uc001mmg.3;uc001mmf.4;uc001mmd.4;uc057zjt.1;uc057zjz.1;uc001mmh.2;uc001mmj.4;uc057zkb.1;uc057zkc.1	111920;44416;38378;38395;33094;374175;241523;17930;69982
chr11	17606821	17606821	intron	OTOG	uc031pzc.1;uc057znr.1;uc001mnh.1	59447;59447;28556
chr11	19174211	19174211	intron	ZDHHC13	uc001mpi.4;uc001mpj.4;uc057zti.1	57081;57065;57054
chr11	32424146	32424146	intron	WT1	uc001mtl.3;uc058ade.1;uc001mtn.4;uc001mtm.3;uc058adf.1;uc058adg.1;uc058adj.1	6666;11240;11483;6666;11198;11388;4882
chr11	44177047	44177047	intron	EXT2	uc001mxz.4;uc009ykt.4;uc001mya.4;uc058aqk.1;uc058aqm.1;uc058aqo.1	81497;81329;80849;80823;52177;47003
chr11	46543768	46543768	intron	AMBRA1	uc001ncu.3;uc001ncw.4;uc001ncx.4;uc010rgu.3;uc058awt.1	47595;50179;50252;47654;50276
chr11	57544768	57544768	intron	SMTNL1	uc021qjh.1;uc058bmq.1	2126;2124
chr11	65457389	65457389	intron	AP000769.1	uc058djw.1	2075
chr11	82793961	82793961	intron	AP001646.3	uc058fym.1	2298
chr11	87315417	87315417	intron	TMEM135	uc001pci.4;uc058ghp.1;uc001pch.4	277572;277466;277409
chr11	93344963	93344963	intron	DEUP1	uc058gly.1;uc001pdo.2;uc001pdp.4;uc058glz.1;uc058gma.1;uc058gmb.1;uc001pdq.5;uc058gmc.1;uc058gmd.1;uc058gme.1	14991;14972;14964;14963;14360;14317;14245;12749;12746;12740
chr11	109766151	109766151	intron	AP002444.1	uc058hdc.1	57695
chr11	114265282	114265282	intron	NNMT	uc058hnf.1;uc001por.2;uc058hng.1;uc058hnc.1	7494;7450;2393;91288
chr12	982154	982154	intron	RAD52	uc001qiu.2;uc001qiy.4;uc058jqx.1;uc058jqy.1	7898;7886;8967;9035
chr12	7748943	7748943	intron	CLEC4C	uc001qth.1;uc058kts.1	529;2661
chr12	8325307	8325307	intron	AC092745.1	uc058kwf.1;uc058kwg.1;uc058kwe.1	3430;3353;71495
chr12	12392674	12392674	intron	BORCS5	uc058lih.1;uc001ral.3;uc058lii.1;uc009zhu.3	35594;35552;35315;35255
chr12	23127887	23127887	intron	AC084816.1	uc001rfu.1;uc058lya.1;uc058lyb.1	163314;73649;28518
chr12	23716081	23716081	intron	SOX5	uc058lyf.1;uc058lyg.1;uc001rfw.4;uc058lyh.1;uc058lyi.1;uc010siv.4;uc058lyj.1;uc058lyk.1;uc058lym.1;uc058lyn.1	179944;233621;233621;233565;179945;234950;234838;22424;22424;18633
chr12	23887627	23887627	intron	SOX5	uc058lyf.1;uc058lyg.1;uc001rfw.4;uc058lyh.1;uc058lyi.1;uc010siv.4;uc058lyj.1;uc058lyp.1;uc001rga.3;uc058lyq.1	8398;62075;62075;62019;8399;63404;63292;63279;63392;56724
chr1	36247779	36247779	intron	THRAP3	uc001cae.5;uc001caf.5;uc057eud.1	23362;23346;23345
chr12	50416279	50416279	intron	LARP4	uc058nxs.1;uc001rwr.3;uc058nxt.1;uc058nxu.1;uc058nxv.1;uc058nxw.1;uc001rwp.3;uc058nxx.1;uc058nxy.1;uc001rwm.4;uc058nxz.1;uc001rws.3;uc058nya.1;uc058nyb.1;uc001rwn.4;uc001rwq.3;uc058nyc.1;uc058nyd.1;uc058nye.1	23895;15469;15421;15389;15383;15382;15379;15371;15370;15370;15349;15349;15346;15328;15322;15293;15277;15170;15085
chr12	51021044	51021044	intron	SLC11A2	uc001rxc.5;uc058nza.1;uc001rxe.5;uc001rxi.4;uc001rxk.2;uc058nzc.1;uc001rxh.3;uc058nzd.1;uc058nzh.1;uc058nzi.1;uc058nzj.1;uc058nzk.1;uc058nzl.1;uc058nzn.1;uc058nzo.1;uc058nzq.1;uc058nzr.1;uc058nzs.1	5314;5323;5293;5339;7230;7521;5096;5355;5348;5339;5346;5339;5318;5339;5339;5320;5334;3798
chr12	68181099	68181099	intron	IFNG-AS1	uc058qot.1	191569
chr12	89494093	89494093	intron	POC1B	uc001tbb.4;uc001tbc.4;uc001tba.4;uc058row.1;uc058rox.1;uc058roy.1;uc058rpa.1;uc058rpc.1;uc058rpd.1;uc058rpe.1	31521;31930;31469;31906;31903;31111;31899;31321;31926;31906
chr12	93938014	93938014	intron	AC012464.1	uc058ruc.1	5588
chr1	40182390	40182390	intron	RLF	uc001cfc.5	21016
chr12	110421268	110421268	intron	AC144548.1	uc058tdf.1	24279
chr1	40277083	40277083	intron	ZMPSTE24	uc001cfg.5	18975
chr12	110687911	110687911	intron	HVCN1	uc001trq.1;uc001trs.3;uc058tfo.1;uc010syd.2;uc058tfp.1;uc058tfr.1;uc058tfs.1	1505;1245;797;1866;1897;17038;1907
chr12	111585962	111585962	intron		uc058tha.1;uc058thb.1;uc001tsj.3;uc058thc.1;uc058thd.1;uc058the.1;uc001tsh.4;uc001tsi.4;uc001tsm.2;uc058thu.1;uc058thv.1	12859;13387;13713;13713;13713;13313;13710;13710;12603;13099;11844
chr12	121738505	121738505	intron	TMEM120B	uc058uil.1;uc001ubc.5	25752;25709
chr12	123801783	123801783	intron	DNAH10	uc001uft.5;uc058uve.1;uc010taw.2;uc058uvf.1	39594;39287;20678;20655
chr12	125111609	125111609	intron	AACS	uc001uhc.4;uc058uza.1;uc058uzb.1	46229;10714;6457
chr12	126945284	126945284	intron	LINC02405	uc001uho.4	115112
chr13	25200974	25200974	intron	LINC01076	uc058vyn.1	9320
chr13	50626245	50626245	intron	DLEU1	uc058xcb.1;uc058xcc.1;uc001vet.1	236046;236046;225018
chr13	67136316	67136316	intron	PCDH9	uc001vik.4;uc001vil.4;uc010thl.3	93501;94128;93501
chr14	37039208	37039208	intron	SLC25A21	uc021rsf.2;uc001wtz.3;uc059axu.1	133214;133657;133137
chr14	55387284	55387284	intron	ATG14	uc059bth.1;uc001xbx.3	4017;24573
chr14	76009852	76009852	intron	IFT43	uc059dpi.1;uc010asm.1;uc059dpr.1;uc059dps.1;uc059dpt.1;uc059dpu.1;uc001xsg.3;uc001xse.4;uc059dpv.1	107715;24098;24081;24071;24071;24069;24069;24067;24057
chr14	78320369	78320369	intron	NRXN3	uc284nlf.1;uc059dwi.1;uc284nlg.1;uc059dwj.1;uc059dwk.1	149995;77977;77331;76766;41725
chr14	89458012	89458012	intron	FOXN3	uc001xxo.5;uc010atk.4;uc059eep.1;uc059ees.1;uc059eet.1;uc059eeu.1	161117;161136;161124;36038;496646;36172
chr14	96803287	96803287	intron	VRK1	uc001yft.4;uc059exy.1	5982;5901
chr15	34271735	34271735	intron	SLC12A6	uc001zhv.4;uc001zhw.4;uc001zhx.4;uc001zic.4;uc001zia.4;uc059hhc.1;uc059hhd.1;uc001zid.4;uc001zhu.4;uc059hhe.1;uc010bau.4;uc001zib.4;uc059hhk.1;uc059hhl.1;uc059hhm.1;uc059hhn.1	47043;65108;65108;66016;66024;66024;66050;66324;27922;66321;66321;66021;27900;66321;66321;66006
chr15	39788970	39788970	intron	AC023908.2	uc059hob.1	12362
chr15	49619422	49619422	intron	FAM227B	uc001zxl.3;uc001zxm.4;uc059jbh.1;uc059jbj.1;uc059jbl.1;uc059jbm.1;uc059jbn.1;uc059jbo.1	1508;1506;1399;1395;1324;1386;1474;1393
chr1	51616084	51616084	intron	OSBPL9	uc057glp.1	38904
chr1	51639882	51639882	intron	OSBPL9	uc057glp.1;uc057glq.1;uc001csu.4;uc057glr.1;uc001cst.5;uc009vza.4;uc057gls.1	62702;22789;22789;22789;22772;22772;22767
chr15	59063216	59063216	intron	RNF111	uc002afs.5;uc002aft.5;uc002afv.5;uc002afw.5	75560;75263;75077;32392
chr15	64233551	64233551	intron	CSNK1G1	uc002anf.5;uc002anh.3;uc284nqe.1;uc284nqf.1;uc284nqg.1;uc284nqh.1;uc059kfm.1;uc284nql.1;uc284nqm.1;uc002anj.5;uc284nqp.1;uc284nqq.1;uc284nqr.1;uc284nqs.1;uc284nqt.1	122707;122621;67145;67171;122519;122580;67075;114826;122458;140217;67080;83444;122580;147868;83455
chr15	64564107	64564107	intron	ZNF609	uc002ann.4;uc059kgk.1	64814;61728
chr15	65733836	65733836	intron	RAB11A	uc059klf.1;uc002api.4;uc002aph.4;uc284nrh.1;uc059klb.1;uc059kle.1	7781;58287;58287;27600;58456;58370
chr15	65852188	65852188	intron	RAB11A	uc059klf.1;uc059klh.1;uc059kli.1	126133;59740;59740
chr15	74291384	74291384	intron	CCDC33	uc002axo.4;uc059lhq.1	55094;24665
chr15	74934378	74934378	intron	COX5A	uc002azi.5;uc059loe.1;uc059lof.1;uc059log.1;uc059loh.1;uc059loi.1	3789;3691;3701;3704;3397;3687
chr15	82802780	82802780	intron	FSD2	uc002bjd.4;uc010uom.3;uc059moy.1	3288;3273;3289
chr15	92661287	92661287	intron	FAM174B	uc059nln.1;uc059nlp.1	6755;148510
chr15	96316928	96316928	intron	NR2F2-AS1	uc059npo.1;uc002bto.2;uc059npz.1;uc059nqa.1;uc059nqb.1;uc059nqd.1;uc059nqe.1;uc031que.2	10092;10432;10419;5716;10187;10428;10168;10398
chr16	7150446	7150446	intron	RBFOX1	uc059qmm.1;uc002cyt.3;uc002cys.2;uc059qmn.1;uc010uxz.1;uc010uya.2;uc059qmr.1;uc059qms.1;uc059qmt.1;uc010uyb.2	1131351;1131314;1131314;1130852;1130742;667266;667121;667059;666652;376636
chr16	9598297	9598297	intron	AC007221.2	uc284nvl.1	56251
chr16	10359029	10359029	intron	ATF7IP2	uc059qrv.1	32594
chr16	20382434	20382434	intron	PDILT	uc002dhc.2	22302
chr16	27969779	27969779	intron	GSG1L	uc002doz.3;uc010bya.2;uc059slg.1	93729;93721;93408
chr16	27982443	27982443	intron	GSG1L	uc002doz.3;uc010bya.2;uc059slg.1	81065;81057;80744
chr16	28124440	28124440	intron	XPO6	uc002dpb.4;uc002dpa.4;uc059slo.1;uc059slp.1	87109;87428;9484;7922
chr16	29689755	29689755	intron	AC009086.2	uc059str.1;uc010vdu.3;uc002dto.4;uc059stt.1	26475;10746;10717;10572
chr16	48269185	48269185	intron	LONP2	uc002efi.2;uc002efj.2;uc059txi.1;uc059txj.1;uc059txk.1	24888;24851;24829;24796;6408
chr16	57687556	57687556	intron	ADGRG3	uc002emh.4;uc010vhv.3;uc059uwm.1;uc059uwo.1;uc059uwp.1	19368;19247;19238;7757;14173
chr16	83689438	83689438	intron	CDH13	uc002fgx.4;uc010vns.3;uc010vnu.3;uc059xrm.1;uc059xru.1;uc059xrv.1;uc059xsb.1	1062634;1062472;1062467;1062467;831084;831081;18859
chr1	61200145	61200145	intron	NFIA	uc057hbg.1;uc057hbk.1;uc001czy.4;uc057hbl.1;uc010oos.3;uc001czw.4;uc001czv.4;uc057hbm.1;uc057hbn.1;uc057hbo.1	334885;146314;122870;118377;118282;117836;117523;117421;117396;117390
chr17	19305559	19305559	intron	EPN2	uc002gvd.5;uc002gve.4;uc060cjk.1;uc002gvf.5;uc060cjl.1;uc060cjn.1;uc010vyo.3;uc060cjt.1;uc060cju.1;uc002gvh.2;uc010vyp.3;uc031qzg.1	68217;68181;68162;68162;68161;68142;68142;23603;23603;22579;22511;701
chr17	32247429	32247429	intron	RHOT1	uc060dts.1;uc060dtv.1	43491;39126
chr17	32786387	32786387	intron	MYO1D	uc002hho.2;uc060dvv.1;uc002hhp.2;uc010wcb.3	90789;76655;90756;90789
chr1	63403607	63403607	intron	ALG6	uc021oof.1;uc057hej.1;uc057hek.1	36016;32836;32836
chr17	35001108	35001108	intron	LIG3	uc002hik.3;uc002hij.4;uc060dze.1;uc060dzf.1	20613;20561;4073;3389
chr17	39283290	39283290	intron	FBXL20	uc002hrt.4;uc032fht.1;uc010cvu.4;uc060eql.1	118372;118365;118332;119232
chr17	40076774	40076774	intron	THRA	uc060ewm.1;uc060ewn.1;uc060ewo.1;uc021twy.2;uc002htw.4;uc002htx.4;uc002htv.4;uc060ewp.1;uc060ewq.1	18483;14869;14580;14580;13963;13963;13874;7870;4682
chr17	51331003	51331003	intron	LINC02071	uc060hpf.1	4161
chr17	63770463	63770463	intron	CCDC47	uc002jbs.5;uc010ddx.4;uc002jbt.3;uc060iqd.1;uc060iqe.1	3311;3106;3106;3100;5887
chr17	78272367	78272367	intron	LINC01993	uc031ren.1	6124
chr18	967247	967247	intron	AP005328.2	uc060muu.1	9635
chr18	1311568	1311568	intron	LINC00470	uc002klb.2;uc002kld.1;uc010dki.2;uc002klc.2;uc002klf.3;uc060mva.1;uc002kle.2;uc060mvb.1;uc002klg.3	47870;95611;48081;48021;47988;95609;47193;47193;48060
chr18	2002795	2002795	intron	AP005230.1	uc060mvj.1	238051
chr18	2180729	2180729	intron	AP005230.1	uc060mvj.1;uc060mvk.1	60117;59150
chr18	2855942	2855942	intron	EMILIN2	uc002kln.4	8911
chr18	3676221	3676221	intron	DLGAP1	uc002kme.3;uc060myl.1;uc010wyw.3;uc060mym.1;uc010wyx.2;uc060myn.1;uc002kmf.4;uc010wyy.3;uc010dkn.4;uc060myo.1;uc002kmg.4;uc010wyz.3	169137;198052;198058;198103;198545;475068;779113;198545;169136;198031;169074;779044
chr18	9172659	9172659	intron	AP005263.1	uc060nfc.1;uc060nfd.1;uc060nfo.1;uc002knv.4;uc060nfp.1;uc060nfq.1;uc060nfr.1;uc002knx.4	69922;69922;36430;35881;35862;35853;35819;35095
chr18	12477160	12477160	intron		uc060nnr.1;uc002kre.4;uc010wzy.3;uc010wzx.3;uc060nns.1;uc010wzw.3;uc060nnu.1;uc060nnv.1	70162;180973;180884;179571;159179;179575;15988;29358
chr18	80029869	80029869	intron	TXNL4A	uc010drg.4;uc060qll.1;uc060qls.1	4079;4021;4024
chr1	71043241	71043241	intron	PTGER3	uc001dfg.2;uc001dfl.1;uc057hmm.1;uc057hmn.1;uc057hmo.1;uc057hmp.1;uc001dfk.2;uc001dfo.4;uc001dfp.2;uc057hmr.1;uc057hms.1;uc001dfq.4	4546;4566;4566;4546;4566;4566;4391;4546;4546;4566;4566;4566
chr19	2739280	2739280	intron	AC006538.2	uc060rlo.1;uc010xgy.2;uc002lwg.4;uc060rlq.1;uc060rlr.1;uc002lwh.4;uc060rls.1	767;736;871;736;194;747;736
chr19	6667321	6667321	intron	TNFSF14	uc060ska.1;uc002mfk.3	2795;3266
chr19	8387886	8387886	intron		uc060ssp.1;uc060suu.1;uc002mjt.5	57154;2237;2798
chr1	75602672	75602672	intron	SLC44A5	uc001dgt.3;uc001dgu.4;uc057hrg.1	8443;8411;8423
chr19	21461208	21461208	intron	AC010615.4	uc060wgx.1;uc060wha.1;uc060whb.1;uc002npz.4;uc060whc.1;uc060whd.1;uc060whe.1	56048;2699;2671;2675;2664;2663;2688
chr19	28532995	28532995	intron	AC005394.2	uc002nsa.3;uc060wmz.1;uc060wnd.1	194684;2281;99732
chr19	29952848	29952848	intron	URI1	uc002nsq.4;uc060wpb.1;uc060wpc.1;uc002nsr.4;uc060wpf.1;uc060wpg.1	29203;29190;29156;10596;10586;10536
chr19	30538841	30538841	intron	ZNF536	uc060wpp.1;uc002nsu.2	310326;166428
chr19	32909497	32909497	intron	CEP89	uc002nty.4;uc060wsn.1;uc060wso.1	62465;50454;62457
chr19	38255187	38255187	intron	SPINT2	uc060xzg.1;uc060xzi.1;uc010efv.4;uc002ohq.3;uc060xzj.1	11151;1184;1344;1403;1344
chr19	39864306	39864306	intron	FCGBP	uc060ynu.1	70319
chr19	50375122	50375122	intron	NR1H2	uc061bpl.1;uc061bpm.1	45468;45429
chr19	52825949	52825949	intron	ZNF28	uc061cfw.1;uc061cfx.1	31650;31650
chr19	54654664	54654664	intron	LILRB4	uc061csn.1	10748
chr2	8923327	8923327	intron	MBOAT2	uc002qzg.2;uc061gmh.1;uc061gmj.1;uc061gmk.1;uc061gml.1;uc061gmm.1	80485;80330;80381;35262;80337;35314
chr2	11273893	11273893	intron	ROCK2	uc002rbd.2;uc061grr.1;uc061grv.1;uc061grx.1;uc061grz.1	70691;70691;70247;13843;74436
chr2	12279204	12279204	intron	MIR3681HG	uc061gts.1;uc002rbu.3	413318;272075
chr2	25054979	25054979	intron	EFR3B	uc002rfw.3;uc010eyh.4;uc061heq.1	12848;12848;12515
chr2	26040196	26040196	intron	RAB10	uc061hgs.1;uc002rgv.4	6088;6085
chr2	28916126	28916126	intron	WDR43	uc002rmo.3;uc061htn.1	21482;11119
chr2	32558688	32558688	intron	BIRC6	uc010ezu.4;uc061hyh.1;uc061hyk.1	201659;33929;16129
chr2	32768490	32768490	intron	TTC27	uc002rom.4;uc061hyr.1	140427;31704
chr2	38691096	38691096	intron	GALM	uc002rqy.4;uc061iiu.1;uc061iiv.1;uc061ijb.1	25185;24978;24852;9800
chr2	46316318	46316318	intron	EPAS1	uc061iux.1;uc061iuy.1;uc002ruv.3;uc061iuz.1	22650;19709;18915;18553
chr2	53769795	53769795	intron	CHAC2	uc002rxk.2;uc002rxg.3;uc061jcz.2;uc021vhl.2;uc002rxh.3;uc061jdc.1;uc061jdd.1;uc061jdh.1;uc061jdi.1	2002;17146;90364;17148;17213;90194;17053;90124;17814
chr2	68536108	68536108	intron	APLF	uc002sep.4;uc061jyk.1;uc061jyl.1;uc002seq.2	68546;68535;68504;6469
chr2	80362116	80362116	intron	CTNNA2	uc010yse.3;uc010ysf.3;uc010ysg.3;uc061lcq.1;uc010ysi.4;uc061lcz.1;uc032nsw.2;uc061lda.1;uc061ldb.1;uc061ldc.1	1176884;849115;848980;710563;57010;56997;48736;48736;48736;33840
chr2	121555728	121555728	intron	CLASP1	uc061nlf.1;uc061nlg.1;uc061nlh.1;uc061nli.1;uc010yyz.3	93747;93747;93747;93858;93709
chr2	134195565	134195565	intron	MGAT5	uc061nyf.1;uc061nyg.1;uc061nyh.1	75581;75581;75382
chr2	144451632	144451632	intron	ZEB2	uc002tvu.4;uc061ofw.1;uc061ofx.1;uc010zbm.3;uc061ofy.1;uc061ofz.1;uc061oga.1;uc061ogb.1;uc061ogc.1;uc061oge.1;uc061ogg.1;uc061ogh.1;uc061ogi.1;uc061ogj.1;uc061ogk.1;uc061ogl.1;uc061ogm.1;uc061ogn.1;uc061ogo.1;uc061ogp.1;uc061ogq.1	68762;68469;66192;68486;65918;68373;68758;66085;66628;65873;66817;68728;66773;68486;65924;68753;65128;66264;68486;68767;66393
chr2	144500779	144500779	intron	ZEB2	uc002tvu.4;uc061ofw.1;uc061ofx.1;uc010zbm.3;uc061ofy.1;uc061ofz.1;uc061oga.1;uc061ogb.1;uc061ogc.1;uc061oge.1;uc061ogg.1;uc061ogh.1;uc061ogi.1;uc061ogj.1;uc061ogk.1;uc061ogl.1;uc061ogm.1;uc061ogn.1;uc061ogo.1;uc061ogp.1;uc061ogq.1;uc002tvx.2	19615;19322;17045;19339;16771;19226;19611;16938;17481;16726;17670;19581;17626;19339;16777;19606;15981;17117;19339;19620;17246;16852
chr2	144716183	144716183	intron	TEX41	uc061ohn.1;uc061oho.1;uc061ohp.1;uc061ohq.1;uc061ohr.1;uc061ohs.1;uc061oht.1;uc061ohu.1;uc061ohv.1;uc061ohw.1;uc061ohx.1;uc061ohy.1;uc021vqn.3;uc061ohz.1;uc061oia.1;uc061oib.1;uc061oic.1;uc061oid.1;uc061oie.1;uc061oif.1;uc061oig.1;uc061oih.1;uc061oii.1;uc061oij.1;uc061oik.1;uc061oil.1;uc061oim.1;uc061oin.1;uc031rpo.2;uc061oio.1;uc061oip.1;uc061oiq.1;uc061oir.1;uc061ois.1;uc061oit.1;uc061oiu.1;uc061oiv.1;uc061oiw.1;uc061oix.1;uc061oiy.1;uc061oiz.1;uc061oja.1;uc061ojc.1;uc061ojd.1	48215;48215;48215;48204;48204;48204;48204;48204;48204;48204;48199;48197;48197;48197;48197;48197;48197;48197;48197;48196;48192;48188;48175;48170;48170;48168;48163;48162;48156;48152;48148;48145;48141;48141;48141;48139;48138;48133;48125;48104;48095;48095;25118;7986
chr2	152573288	152573288	intron	FMNL2	uc002tye.3	238050
chr2	157935504	157935504	intron	UPP2	uc061otz.1	58801
chr2	159722770	159722770	intron	MARCH7	uc010zcn.3;uc010foq.5;uc002uax.5;uc061owl.1;uc061owm.1	10312;10280;10270;10262;10246
chr2	167964827	167964827	intron	STK39	uc061pgc.1;uc002uea.4;uc061pgd.1	148278;283313;47833
chr2	169248574	169248574	intron	LRP2	uc002ues.4;uc010zdf.2	114110;113959
chr2	170299054	170299054	intron	MYO3B	uc002ufv.6;uc061pjy.1;uc002ufy.4;uc002ufz.4;uc061pjz.1;uc061pka.1;uc002uga.4;uc002ugb.4	120908;120908;120908;120908;120908;118904;99845;99801
chr2	171208820	171208820	intron	TLK1	uc002ugp.3	22493
chr2	173891591	173891591	intron	AC106900.1	uc061pto.1	7836
chr2	177735603	177735603	intron	PDE11A	uc002ulq.4;uc002ulr.4;uc002uls.2;uc002ulp.4;uc061qaz.1;uc061qba.1	337151;372735;187191;153134;169570;191405
chr2	187084666	187084666	intron	AC007319.1	uc061qmh.1;uc061qmi.1	81445;81391
chr1	108912696	108912696	intron	GPSM2	uc010ovc.3;uc057iwo.1;uc057iwm.1	37345;35714;50787
chr2	205090146	205090146	intron	PARD3B	uc002var.2;uc010fub.3;uc002vao.3;uc002vap.3;uc002vaq.3;uc061rrh.1;uc061rri.1;uc061rrj.1	544352;544305;544145;544145;544145;124993;124993;124993
chr2	208296048	208296048	intron	PIKFYVE	uc002vcv.4;uc061rxa.1;uc002vcz.3;uc002vcw.4;uc002vcx.4;uc061rxc.1	29780;29780;29780;29769;29769;29684
chr2	216261336	216261336	intron	AC012513.1	uc061sew.1;uc002vgb.4	1740;110690
chr20	18176445	18176445	intron	KAT14	uc021wbb.2;uc010zru.3;uc002wqk.3	34124;33992;31345
chr20	22414447	22414447	intron	AL133464.1	uc284pqs.1;uc010zsn.2;uc284pqu.1	6199;6195;38912
chr20	38160501	38160501	intron	TGM2	uc002xhr.5;uc031rtj.2;uc061wxc.1;uc061wxd.1;uc061wxe.1	4870;4780;4757;6006;6076
chr20	53323641	53323641	intron	TSHZ2	uc002xwo.3;uc021wex.2;uc061xun.1;uc061xum.1	351233;138357;67661;163632
chr1	113784277	113784277	intron	RSBN1	uc001edq.4;uc057jnv.1;uc057jnw.1;uc057jnx.1;uc057jny.1	28198;28170;28015;27997;28011
chr20	61797527	61797527	intron	CDH4	uc032pok.1;uc061yfw.1;uc032pop.2	545100;227391;197765
chr21	15558160	15558160	intron	AJ009632.2	uc284psa.1;uc284psb.1;uc284psf.1	69097;69097;69001
chr21	29643037	29643037	intron	GRIK1	uc061zls.1;uc002ynn.4;uc011acs.3;uc061zlt.1;uc061zlu.1;uc061zlv.1;uc002yno.3;uc061zlw.1;uc061zlx.1;uc002ynr.4	296494;296587;296636;296984;51026;296494;296874;296874;51026;296995
chr21	35475840	35475840	intron	RUNX1	uc061zwk.1;uc031rvg.2	528826;104919
chr21	35994750	35994750	intron	RUNX1	uc061zwk.1	9916
chr21	38113163	38113163	intron		uc062abb.1;uc002ywp.3;uc062abd.1	8175;8196;8167
chr22	19419952	19419952	intron	HIRA	uc010grn.2;uc002zpf.2;uc062bll.1;uc062bln.1;uc062blo.1	11743;11780;27738;10310;27497
chr22	21620216	21620216	intron	UBE2L3	uc031rxe.1;uc002zva.2;uc011aig.3;uc031rxf.2	70768;52668;52668;52485
chr22	30966743	30966743	intron	MORC2	uc062dgc.1;uc003aje.2	1554;1457
chr22	40476528	40476528	intron	MRTFA	uc003ayw.3;uc010gye.3;uc032qph.2;uc010gyf.3;uc062enz.1;uc062eod.1;uc003ayy.3	160156;160156;57351;160162;75782;57282;160158
chr22	40621983	40621983	intron	MRTFA	uc003ayw.3;uc010gye.3;uc010gyf.3;uc003ayy.3;uc062eoe.1	14701;14701;14707;14703;14718
chr22	47645594	47645594	intron	AL117329.1	uc003bik.2	13919
chr3	9410067	9410067	intron	SETD5	uc062gjv.1;uc062gjw.1;uc003brt.3;uc062gjx.1;uc062gjy.1;uc062gjz.1;uc062gka.1;uc062gkb.1	12451;12347;12347;12210;12203;12193;12166;12162
chr1	120939893	120939893	intron	LINC00623	uc031upg.2;uc057jww.1;uc057jwx.1;uc057jwy.1;uc057jwz.1	26617;26617;26586;26564;26513
chr3	10571878	10571878	intron	ATP2B2	uc003bvw.3	136152
chr3	14047157	14047157	intron	TPRXL	uc062gyb.1;uc062gyc.1;uc062gyd.1;uc062gye.1;uc062gyf.1;uc062gyg.1;uc062gyh.1;uc003byg.4;uc062gyi.1;uc062gyj.1;uc062gyk.1	109900;109777;109777;109760;109755;103813;44732;29755;29676;29603;27456
chr3	14836110	14836110	intron	FGD5	uc003bzc.4;uc011avk.3;uc062hal.1	17147;17147;14677
chr3	17076155	17076155	intron	PLCL2	uc032rgc.2;uc011awd.3;uc062hfp.1	191195;143064;65660
chr3	17445062	17445062	intron	TBC1D5	uc003cbf.3;uc062hfs.1;uc062hft.1;uc010hev.4;uc062hfv.1;uc062hfw.1;uc062hfx.1;uc062hfy.1;uc062hfz.1;uc062hga.1;uc062hgb.1;uc062hgc.1;uc062hgd.1;uc062hge.1;uc062hgf.1;uc062hgg.1;uc062hgh.1;uc062hgi.1;uc062hgj.1	295844;297435;999754;254957;297589;297544;297538;297552;297552;297541;297552;297531;297552;294416;297501;297538;254943;297512;178853
chr3	20323478	20323478	intron	SGO1-AS1	uc284pts.1	137122
chr3	20352762	20352762	intron	SGO1-AS1	uc284pts.1	166406
chr3	20809059	20809059	intron	SGO1-AS1	uc284pts.1;uc284ptu.1	622703;76674
chr3	24303609	24303609	intron	THRB	uc003ccx.5;uc003ccy.5;uc003ccz.5;uc062hnq.1;uc062hnr.1;uc062hns.1;uc062hnt.1;uc062hnu.1;uc062hnv.1;uc062hnw.1;uc062hnx.1	191212;191212;191201;191395;191121;191141;191352;191187;191672;191161;191193
chr3	24484020	24484020	intron	THRB	uc003ccx.5;uc003ccy.5;uc003ccz.5;uc062hnq.1;uc062hnr.1;uc062hns.1;uc062hnt.1;uc062hnu.1;uc062hnv.1;uc062hnw.1;uc062hnx.1	10801;10801;10790;10984;10710;10730;10941;10776;11261;10750;10782
chr3	32049016	32049016	intron	OSBPL10	uc062hug.1	28563
chr3	41291849	41291849	intron	ULK4	uc003ckv.5	670580
chr3	44968622	44968622	intron	ZDHHC3	uc003cod.4;uc003cog.4;uc062iwo.1;uc062iwq.1;uc062iws.1	7559;7554;7543;7562;1308
chr1	149634059	149634059	intron	LINC00869	uc057kko.1;uc057kkp.1;uc057kkq.1;uc031uxj.2;uc031uxh.2	27724;27026;26751;26714;26649
chr3	77048385	77048385	intron	ROBO2	uc062lpj.1;uc062lpk.1;uc062lpn.1;uc003dpy.5;uc062lpp.1;uc062lpq.1	1141689;1141582;737288;8498;7654;7654
chr1	151010550	151010550	intron	PRUNE1	uc057kro.1;uc057krp.1;uc057krq.1;uc057krr.1;uc001ewh.2;uc057krs.1;uc057krt.1;uc001ewi.2;uc001ewj.2	2129;2099;2082;2072;2072;2071;2067;2066;2019
chr3	105626970	105626971	intergenic	CBLB	uc062mdd.1	47021
chr3	119695686	119695687	intergenic	MAATS1	uc010hqy.3	7334
chr3	128415444	128415445	intergenic	DNAJB8-AS1	uc003ekl.1	48148
chr3	130184388	130184389	intergenic	LINC02021	uc062ntu.1	71736
chr3	130343212	130343213	intergenic	COL6A5	uc062ntv.1	2302
chr1	153435288	153435289	intergenic	S100A7L2	uc010pdx.3	4659
chr3	142715089	142715090	intergenic	TRPC1	uc003evc.4	8983
chr3	143964137	143964138	intergenic	C3orf58	uc003evo.4	7659
chr3	152097965	152097966	intergenic	AC108718.1	uc062pci.1	53757
chr3	179456168	179456169	intergenic	GNB4	uc003fjv.5	4578
chr3	185572358	185572359	intergenic	SENP2	uc062qyw.1	10136
chr4	4037492	4037493	intergenic		uc032tbc.2	46840
chr1	156804192	156804193	intergenic	PRCC	uc057mde.1	8380
chr1	157263086	157263087	intergenic	AL138900.3	uc057meg.1	17628
chr4	73989248	73989249	intergenic	PPBP	uc003hhj.4	1051
chr4	80233185	80233186	intergenic	FGF5	uc003hmd.4	33412
chr4	108696369	108696370	intergenic	RNU6-431P	uc062yxt.1	44219
chr4	126734670	126734671	intergenic	RF00019	uc062zlh.1	293547
chr1	163359348	163359349	intergenic	NUF2	uc057mwv.1	19093
chr4	128676415	128676416	intergenic	AC078850.2	uc062zni.1	75008
chr1	166691690	166691691	intergenic	POGK	uc057nad.1	147755
chr4	183395611	183395612	intergenic		uc063bgg.1	11058
chr4	183534023	183534024	intergenic	AC107214.2	uc063bgl.1	17129
chr5	6846425	6846426	intergenic	RN7SKP79	uc063cdg.1	2035
chr1	168220713	168220714	intergenic	SFT2D2	uc057ncw.1	5223
chr1	168317404	168317405	intergenic	TBX19	uc057ndc.1	9387
chr5	108622468	108622469	intergenic	LINC01023	uc031sko.2	105790
chr5	109567572	109567573	intergenic	AC091917.1	uc063fwv.1	69695
chr5	138489093	138489094	intergenic	EGR1	uc003ldb.2	23603
chr5	148682226	148682227	intergenic	HTR4	uc063imc.1	4991
chr5	157613637	157613638	intergenic	RNU6-390P	uc063jdt.1	34140
chr5	177114353	177114354	intergenic	FGFR4	uc063kbq.1	17162
chr6	4167384	4167385	intergenic	AL136309.3	uc063loq.1	20723
chr6	4399876	4399877	intergenic	RNA5SP202	uc032wjm.2	28085
chr6	10643548	10643549	intergenic	RF00019	uc063luo.1	31218
chr6	15101359	15101360	intergenic	AL138720.1	uc063lzk.1	11356
chr6	16031393	16031394	intergenic	LINC02543	uc063maa.1	36449
chr6	19968458	19968459	intergenic	AL022068.1	uc063mca.1	129378
chr6	20526104	20526105	intergenic	CDKAL1	uc003ndd.3	8351
chr6	26595078	26595079	intergenic	ABT1	uc003nii.4	1872
chr6	27716017	27716018	intergenic	LINC01012	uc003njk.3	21938
chr6	28711988	28711989	intergenic		uc021ysf.3	53671
chr6	28794799	28794800	intergenic	AL662890.1	uc063mly.1	44205
chr6	37711434	37711435	intergenic	MDGA1	uc063ogr.1	12128
chr1	15674991	15674992	intergenic	AL121992.2	uc057cok.1	8135
chr6	43969658	43969659	intergenic	C6orf223	uc063ouz.1	30920
chr6	49653639	49653640	intergenic	RHAG	uc063ozd.1	16800
chr1	185409515	185409516	intergenic	RNU7-183P	uc057nyf.1	24727
chr6	52330748	52330749	intergenic	PAQR8	uc063pae.1	30671
chr6	67352901	67352902	intergenic	RNA5SP208	uc063pjh.1	114405
chr6	67433328	67433329	intergenic	RNA5SP208	uc063pjh.1	33978
chr6	84789379	84789380	intergenic	TBX18	uc003pkl.4	24860
chr6	86683158	86683159	intergenic	RN7SL643P	uc063pyg.1	247310
chr6	87869251	87869252	intergenic		uc063qaa.1	21709
chr6	118723930	118723931	intergenic	CEP85L	uc063qze.1	13855
chr1	15996656	15996657	intergenic	SRARP	uc001axn.4	7578
chr6	148157766	148157767	intergenic	AL359382.1	uc063sdg.1	20362
chr6	149140125	149140126	intergenic		uc063sdq.1	36667
chr6	154518096	154518097	intergenic	CNKSR3	uc003qpy.4	7437
chr6	154714547	154714548	intergenic	SCAF8	uc003qpz.5	18776
chr6	164331276	164331277	intergenic	AL358972.1	uc063stz.1	17366
chr6	170590900	170590901	intergenic	PDCD2	uc021ziq.2	6208
chr7	12725756	12725757	intergenic	RF00264	uc064bps.1	24867
chr7	12826233	12826234	intergenic	RN7SKP228	uc064bpt.1	50449
chr7	13777452	13777453	intergenic	AC005019.2	uc064bpx.1	81571
chr7	24763535	24763536	intergenic	GSDME,GSDME	uc003sxa.2	5422
chr7	25930676	25930677	intergenic	AC010719.1	uc064cdq.1	18725
chr7	30252125	30252126	intergenic	ZNRF2	uc003tat.3	32180
chr7	47217820	47217821	intergenic	TNS3	uc022aco.2	62496
chr7	47686014	47686015	intergenic	C7orf65	uc010kyp.2	30770
chr7	64434971	64434972	intergenic	ZNF680	uc064dxr.1	109246
chr7	65851983	65851984	intergenic	RNU6-973P	uc064dyt.1	7781
chr7	72550634	72550635	intergenic	CALN1	uc064edf.1	103483
chr7	73526282	73526283	intergenic	BAZ1B	uc064eeg.1	4004
chr7	73562534	73562535	intergenic	BCL7B	uc003tyf.3	4532
chr7	73886042	73886043	intergenic	TMEM270	uc064ehn.1	24870
chr7	76168138	76168139	intergenic	SRRM3	uc032ztw.2	33760
chr7	81209251	81209252	intergenic	AC005008.2	uc064ewe.1	21461
chr7	97515740	97515741	intergenic	RN7SKP104	uc064fpu.1	83518
chr7	100699214	100699215	intergenic	POP7	uc003uwh.5	6837
chr7	101265985	101265986	intergenic	RNU6-1104P	uc064gln.1	4054
chr7	106862949	106862950	intergenic	PIK3CG	uc003vdv.6	2327
chr7	106979759	106979760	intergenic	PRKAR2B	uc003vdx.4	64888
chr7	107002621	107002622	intergenic	PRKAR2B	uc003vdx.4	42026
chr7	107018168	107018169	intergenic	PRKAR2B	uc003vdx.4	26479
chr7	107025755	107025756	intergenic	PRKAR2B	uc003vdx.4	18892
chr7	128099379	128099380	intergenic	SND1	uc064htb.1	9890
chr7	128373833	128373834	intergenic	PRRT4	uc064htp.1	12148
chr7	135845570	135845571	intergenic		uc064iie.1	47461
chr7	139492497	139492498	intergenic	KLRG2	uc003vvb.4	8785
chr7	146058092	146058093	intergenic	AC073308.1	uc064iym.1	8030
chr7	148465156	148465157	intergenic	RN7SL72P	uc064iza.1	26847
chr8	213268	213269	intergenic	ZNF596;ZNF596	uc064jvr.1	18867
chr8	30925191	30925192	intergenic	TEX15	uc064ltx.1	34585
chr8	37970831	37970832	intergenic	ADRB3	uc003xkr.3	3866
chr8	59555554	59555555	intergenic	AC087664.1	uc064nch.1	38145
chr8	73396993	73396994	intergenic	STAU2-AS1	uc064nsb.1	23009
chr8	90167736	90167737	intergenic	LINC00534	uc064ojb.1	53750
chr8	90169094	90169095	intergenic	LINC00534	uc064ojb.1	52392
chr8	90194400	90194401	intergenic	LINC00534	uc064ojb.1	27086
chr8	95927321	95927322	intergenic	AC012339.1	uc064otr.1	65780
chr8	100982519	100982520	intergenic	RN7SL685P	uc064pdd.1	24636
chr8	102501395	102501396	intergenic		uc064pgf.1	20470
chr8	126323003	126323004	intergenic	AC087667.1	uc064qcp.1	2490
chr8	128788820	128788821	intergenic		uc064qeu.1	30972
chr8	129024123	129024124	intergenic	CCDC26	uc064qex.1	58122
chr8	143194965	143194966	intergenic	GPIHBP1	uc033cbs.1	18226
chr9	73981618	73981619	intergenic	RNA5SP286	uc064ttm.1	230541
chr9	112181453	112181454	intergenic	SUSD1	uc004bfu.5	6045
chr1	223210645	223210646	intergenic	SUSD4	uc057prx.1	18803
chr9	126759377	126759378	intergenic		uc064vtl.1	41044
chr1	224037775	224037776	intergenic	AC138393.3	uc057ptk.1	43513
chrX	25072378	25072379	intergenic	RN7SL91P	uc064yke.1	11439
chrX	39863149	39863150	intergenic	AL592164.1	uc064yop.1	14791
chr1	226001301	226001302	intergenic	SDE2	uc001hpu.5	1970
chrX	48398020	48398021	intergenic	SSX4B	uc004djh.3	13877
chrX	48750963	48750964	intergenic	GLOD5	uc011mmh.3	10785
chrX	66012262	66012263	intergenic	AL034397.3	uc004dwg.1	3197
chr1	226331791	226331792	intergenic	LIN9	uc057pyh.1	21922
chrX	68587750	68587751	intergenic	RNU6-245P	uc064zrl.1	48201
chr1	226483370	226483371	intergenic	RN7SKP165	uc057pys.1	37433
chrX	70960934	70960935	intergenic	SLC7A3	uc004dyo.4	29809
chrX	71000385	71000386	intergenic	RNU1-56P	uc064ztc.1	20051
chrX	109959686	109959687	intergenic	TMEM164	uc065arl.1	42943
chrX	111447505	111447506	intergenic	RNU6-496P	uc065asj.1	25897
chr1	227526702	227526703	intergenic	RNA5SP77	uc057qaa.1	34477
chrX	136884675	136884676	intergenic	RBMX	uc065bio.1	3911
chr1	234491445	234491446	intergenic	TARBP1	uc001hwd.3	12342
chr1	241804888	241804889	intergenic	WDR64	uc057qtf.1	34433
chr1	242026001	242026002	intergenic		uc057qtp.1	1947
chr10	7028390	7028391	intergenic	AL590095.1	uc057rof.1	68760
chr10	22227263	22227264	intergenic	AL157831.2	uc057sem.1	9189
chr10	43984273	43984274	intergenic	LINC00841	uc001jbe.1	45270
chr1	23932247	23932248	intergenic	MIR378F	uc021oik.1	3177
chr10	61492456	61492457	intergenic	TMEM26	uc001jlo.3	39006
chr10	62724633	62724634	intergenic	ADO	uc001jmg.4	80222
chr10	71160102	71160103	intergenic	UNC5B	uc001jro.4	52466
chr10	78072674	78072675	intergenic		uc057ukl.1	33076
chr10	80130161	80130162	intergenic	PLAC9	uc057umr.1	1519
chr10	80757024	80757025	intergenic	AL731544.1	uc057unx.1	107227
chr10	80770015	80770016	intergenic	AL731544.1	uc057unx.1	120218
chr1	26012619	26012620	intergenic	RF00283	uc057doo.1	6403
chr10	124454170	124454171	intergenic	NKX1-2	uc010quf.3	3986
chr10	128579539	128579540	intergenic	AL390763.1	uc057wve.1	259325
chr11	3307790	3307791	intergenic	AC123788.1	uc057ybf.1	28929
chr11	10921016	10921017	intergenic	AC069360.1	uc001mjn.2	21694
chr11	18649921	18649922	intergenic	SPTY2D1	uc057zss.1	15130
chr11	45826668	45826669	intergenic	AC044839.4	uc058atn.1	13449
chr11	47946870	47946871	intergenic	PTPRJ	uc058bio.1	33686
chr11	59968226	59968227	intergenic		uc058bsl.1	56680
chr11	86197669	86197670	intergenic	AP003097.2	uc058gfp.1	4187
chr11	118878381	118878382	intergenic	CXCR5	uc001pue.5	5383
chr12	13288749	13288750	intergenic	EMP1	uc058lkv.1	75297
chr1	35788265	35788266	intergenic	CLSPN	uc057esz.1	18298
chr12	54195947	54195948	intergenic	SMUG1	uc284nfy.1	6939
chr1	38265835	38265836	intergenic	LINC01343	uc021olp.2	51068
chr1	38310233	38310234	intergenic	RNU6-753P	uc057eyz.1	86424
chr1	38523684	38523685	intergenic	LINC01685	uc057eza.1	48488
chr12	89393825	89393826	intergenic	AC010201.1	uc058rot.1	22005
chr12	95460328	95460329	intergenic	METAP2	uc058ryc.1	13190
chr12	95600810	95600811	intergenic	RF00019	uc058ryp.1	16877
chr12	103907793	103907794	intergenic	HSP90B1	uc058smf.1	22312
chr12	105895646	105895647	intergenic	AC011595.2	uc058sqs.1	162606
chr12	108270711	108270712	intergenic		uc058suf.1	30883
chr1	40131127	40131128	intergenic	RLF	uc001cfc.5	30244
chr1	40340204	40340205	intergenic	SMAP2	uc057fff.1	4644
chr1	40555078	40555079	intergenic	ZNF684	uc057fgb.1	13764
chr12	116303271	116303272	intergenic	MED13L,MED13L,MED13L,MED13L,MED13L	uc001tvw.4	25933
chr12	127158869	127158870	intergenic		uc058vbf.1	7256
chr13	22428260	22428261	intergenic	LINC00540	uc001uoj.4	217975
chr13	60183149	60183150	intergenic	RNY4P28	uc058xim.1	4679
chr14	20794846	20794847	intergenic	RNASE1	uc001vyf.4	7558
chr14	22647072	22647073	intergenic		uc058zib.1	8599
chr14	33866876	33866877	intergenic	EGLN3	uc059aqy.1	64458
chr14	65144859	65144860	intergenic	RNU2-14P	uc059cmj.1	20317
chr14	68759379	68759380	intergenic	RNU6-921P	uc059csx.1	23351
chr15	50802895	50802896	intergenic	SPPL2A	uc001zyv.4	37087
chr15	55106388	55106389	intergenic	AC025272.1	uc059jkv.1	18389
chr15	81942466	81942467	intergenic	AC025219.1	uc059mmk.1	53198
chr15	95176521	95176522	intergenic	AC087633.2	uc059not.1	18292
chr16	7998164	7998165	intergenic		uc032dqe.2	98733
chr1	54753516	54753517	intergenic	PARS2	uc001cxy.4	10996
chr16	17014678	17014679	intergenic	AC109446.4	uc059rib.1	70369
chr16	25052706	25052707	intergenic	LINC02175	uc021tfn.2	14229
chr16	27107336	27107337	intergenic	AC092725.1	uc059sii.1	39478
chr16	31981092	31981093	intergenic	ZNF267	uc059tpt.1	69238
chr16	55376922	55376923	intergenic	MMP2	uc059uht.1	12776
chr16	59368830	59368831	intergenic	RNU4-58P	uc059vev.1	171294
chr16	65769299	65769300	intergenic	AC022164.1	uc059vhh.1	94483
chr16	79673868	79673869	intergenic	LINC01229	uc059xli.1	2238
chr17	8714065	8714066	intergenic	CCDC42	uc002glo.4	30720
chr17	21523159	21523160	intergenic	AC233702.5	uc060csu.1	3645
chr17	77220947	77220948	intergenic	SEC14L1	uc060kpc.1	8232
chr1	67558633	67558634	intergenic	RNU6-1031P	uc057hjy.1	17506
chr18	1438662	1438663	intergenic	LINC00470	uc060mvd.1	30318
chr18	1498432	1498433	intergenic	AC008109.1	uc060mvf.1	10749
chr18	22066172	22066173	intergenic	AC091043.1	uc060nvj.1	21886
chr18	56238731	56238732	intergenic	AC006305.1	uc060pkt.1	47469
chr18	62450609	62450610	intergenic	ZCCHC2	uc002lip.5	72396
chr19	4074613	4074614	intergenic	ZBTB7A	uc002lzi.4	7668
chr19	32061631	32061632	intergenic	LINC01533	uc060wqt.1	13009
chr19	40435416	40435417	intergenic	SERTAD3	uc060ysh.1	5967
chr19	52692622	52692623	intergenic	ZNF83	uc061cey.1	2126
chr19	53769432	53769433	intergenic	RNU6-165P	uc061clb.1	2800
chr2	21208006	21208007	intergenic	AC018742.1	uc061gzu.1	13167
chr2	38654227	38654228	intergenic	GALM	uc002rqy.4	11681
chr2	55824310	55824311	intergenic	EFEMP1	uc284ppi.1	93468
chr2	64061369	64061370	intergenic	AC012368.2	uc061jsy.1	26875
chr2	70130872	70130873	intergenic	LINC01816	uc002sgg.6	5555
chr2	103525994	103525995	intergenic	AC013727.1	uc061mnj.1	339969
chr2	105601161	105601162	intergenic	NCK2	uc002tdg.4	143734
chr1	95108422	95108423	intergenic	TMEM56	uc001drb.4	8914
chr2	136434408	136434409	intergenic	RN7SKP141	uc021vqa.1	44076
chr1	99768615	99768616	intergenic	FRRS1	uc057ipp.1	1984
chr1	99772978	99772979	intergenic	FRRS1	uc057ipp.1	6347
chr2	170085137	170085138	intergenic	UBR3	uc061pjw.1	5113
chr2	170806773	170806774	intergenic	GAD1	uc061pni.1	6438
chr2	176934291	176934292	intergenic	RNU6-187P	uc061pzp.1	4201
chr2	177554554	177554555	intergenic	TTC30B	uc002uln.4	1540
chr2	184205386	184205387	intergenic	MIR548AE1	uc032omi.1	173587
chr2	186532930	186532931	intergenic	ZC3H15	uc061qly.1	26217
chr2	190378206	190378207	intergenic	INPP1	uc061qrk.1	9562
chr1	109085281	109085282	intergenic	TMEM167B	uc001dwn.4	4520
chr2	215283439	215283440	intergenic	AC073284.1	uc061sce.1	8447
chr2	215288305	215288306	intergenic	AC073284.1	uc061sce.1	13313
chr2	217108810	217108811	intergenic		uc061sgr.1	67686
chr2	222798075	222798076	intergenic		uc032otq.2	4895
chr1	11562862	11562863	intergenic	DISP3	uc057cgg.1	29186
chr20	23660695	23660696	intergenic	AL121894.1	uc061vvy.1	4305
chr20	36067381	36067382	intergenic	AL035420.3	uc061wsf.1	2818
chr20	41694994	41694995	intergenic	RNU6-1018P	uc021wdv.1	9903
chr20	50195466	50195467	intergenic	CEBPB-AS1	uc061xrw.1	3968
chr20	53638295	53638296	intergenic	ZNF217	uc061xuv.1	28388
chr20	58110191	58110192	intergenic	AL354984.2	uc061xzn.1	38121
chr21	22136986	22136987	intergenic	LINC00308	uc021whu.1	38369
chr21	22190755	22190756	intergenic	RNU4-45P	uc061zfv.1	14435
chr22	16441326	16441327	intergenic	AP000547.2	uc062bev.1	79731
chr22	19578285	19578286	intergenic	LINC00895	uc011ahb.2	11446
chr22	20685555	20685556	intergenic	TMEM191A;TMEM191A	uc002zsx.2	15557
chr22	21648373	21648374	intergenic	AP000553.1	uc062bzn.1	3895
chr22	22002895	22002896	intergenic	TOP3B	uc002zvt.6	20079
chr1	116872058	116872059	intergenic		uc057jsq.1	33090
chr3	4482052	4482053	intergenic	ITPR1-DT	uc032rcf.2	11109
chr1	12015683	12015684	intergenic	MIIP	uc057ciw.1	3781
chr3	9590360	9590361	intergenic	LHFPL4	uc003bry.3	36558
chr3	9638320	9638321	intergenic	MTMR14	uc003bsa.4	11111
chr3	14634063	14634064	intergenic	AC090952.1	uc062had.1	15367
chr1	144494334	144494335	intergenic	RF00003	uc057jzc.1	29685
chr3	36971413	36971414	intergenic	AC011816.2	uc062ibc.1	2257
chr3	43071469	43071470	intergenic	AC092042.1	uc062itc.1	16008
chr3	46013934	46013935	intergenic	XCR1	uc062izi.1	13551
chr3	47481920	47481921	intergenic	SCAP	uc062jdq.1	4794
chr3	65052248	65052249	intergenic	LINC02040	uc062lgz.1	122708
chr3	69692098	69692099	intergenic	MITF;MITF	uc003dnz.4	47335
chr1	150907696	150907697	intergenic	SETDB1	uc057kqi.1	18565
chr3	87795045	87795046	intergenic		uc062lsg.1	1416

Annotate TFBS functions

The key function of transcription factors is to regulate gene expression. By working with Genomic Regions Enrichment of Annotations Tool (GREAT), TFregulomeR allows users to annotate the functions of TFBSs using greatAnnotate. Given that GREAT server doesn’t support hg38, liftOver R package has been incorporated in TFregulomeR to convert hg38 to hg19. However, since v1.16.1, rGREAT has linked to GREAT version 4.0.4, which supports hg38. Hence, TFregulomeR has accordingly enabled a direct annotation for genomic regions in hg38 from development version v1.2.1 on (This feature will be further added in the future stable version v1.3.0). The annotation output of greatAnnotate is intuitive, not only will a data.frame containing annotation results be returned, but also an HTML report will be saved. The HTML report takes advantage of rbokeh package, which presents a vivid and dynamic interface.

# annotate the functions of K562 CEBPB exclusive peaks
# loading GREAT R package 'rGREAT'
library(rGREAT)
# the peak assembly is "hg38", and 'liftOver' is needed for conversion (no need if using TFregulomeR >= v1.2.1 and rGREAT >= 1.16.1)
library(liftOver)
# 'rbokeh' is required for an HTML report generation
library(rbokeh)

K562_CEBPB_exclusivePeak_func <- greatAnnotate(peaks = K562_CEBPB_exclusive_peak, 
                                               return_annotation = TRUE, 
                                               return_html_report = TRUE)
#> Start greatAnnotate ...
#> ... ... You chose to return annotated results in a data.frame.
#> ... ... You chose to return an HTML report.
#> ... ... assembly is hg38. Now converting to hg19 using liftOver...
#> ... ... number of the original input regions is 721
#> ... ... number of the regions successfully converted to hg19 is 721
#> ... ... start GREAT analysis
#> ... ... An html report has been generated as 'greatAnnotate_result.html'!
#> ... ... The annotation results have been returned in a data.frame!

head(K562_CEBPB_exclusivePeak_func)
#>   category         ID                             name
#> 1       MF GO:0005488                          binding
#> 2       MF GO:0005515                  protein binding
#> 3       BP GO:0065007            biological regulation
#> 4       BP GO:0050794   regulation of cellular process
#> 5       BP GO:0019222  regulation of metabolic process
#> 6       BP GO:0050789 regulation of biological process
#>   number_of_targeting_genes adjusted_pvalue
#> 1                       817    0.0001415278
#> 2                       497    0.0007221064
#> 3                       653    0.0059313854
#> 4                       602    0.0059313854
#> 5                       386    0.0072758686
#> 6                       628    0.0072758686

TFregulomeR - GreatAnnotate Result

Figure - Gene ontology analyses of targeting genes:

Table - Gene ontology analyses of targeting genes:

GO ID	GO name	adjusted p-value	number of targeting genes
GO:0065007	biological regulation	0.0059313853615	653
GO:0050794	regulation of cellular process	0.0059313853615	602
GO:0019222	regulation of metabolic process	0.0072758685718	386
GO:0050789	regulation of biological process	0.0072758685718	628
GO:0015669	gas transport	0.0072758685718	5
GO:0080090	regulation of primary metabolic process	0.00896198756116667	351
GO:0005488	binding	0.00014152778558	817
GO:0005515	protein binding	0.0007221064418	497
GO:0005622	intracellular	9.507918375e-06	812
GO:0044424	intracellular part	6.3673425e-05	787
GO:0044464	cell part	0.0008246806875	1025
GO:0005623	cell	0.0008246806875	1025
GO:0043231	intracellular membrane-bounded organelle	0.0008246806875	615
GO:0043227	membrane-bounded organelle	0.0008246806875	615
GO:0043229	intracellular organelle	0.00404576196428571	673
GO:0043226	organelle	0.00494924765625	673

Connect with TFBSTools

TFregulomeR is not working alone. We have built a function allowing conversion of the motif matrix in TFregulomeR warehouse to the subclass PFMatrix in TFBSTools, using toTFBSTools.

library(TFBSTools)
K562_CEBPB_TFBS_PFM <- toTFBSTools(id = "MM1_HSA_K562_CEBPB")
K562_CEBPB_TFBS_PFM
#> An object of class PFMatrix
#> ID: MM1_HSA_K562_CEBPB
#> Name: CEBPB
#> Matrix Class: Unknown
#> strand: *
#> Tags: 
#> list()
#> Background: 
#>      A      C      G      T 
#> 0.2717 0.2283 0.2283 0.2717 
#> Matrix: 
#>   [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] [,11]
#> A   93  305    0    0   49    0  280   91  290   533    15
#> C   61   74    0    0    0  533   59  205  242     0   215
#> G  192  141    0    0  390    0  139    3    0     0    73
#> T  187   13  533  533   94    0   55  234    1     0   230