Cargando…
iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data
BACKGROUND: Chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-seq) or microarray hybridization (ChIP-chip) has been widely used to determine the genomic occupation of transcription factors (TFs). We have previously developed a probabilistic method, called TIP (Target...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4983039/ https://www.ncbi.nlm.nih.gov/pubmed/27519564 http://dx.doi.org/10.1186/s12864-016-2963-0 |
_version_ | 1782447847919058944 |
---|---|
author | Yang, Chia-Chun Andrews, Erik H. Chen, Min-Hsuan Wang, Wan-Yu Chen, Jeremy J. W. Gerstein, Mark Liu, Chun-Chi Cheng, Chao |
author_facet | Yang, Chia-Chun Andrews, Erik H. Chen, Min-Hsuan Wang, Wan-Yu Chen, Jeremy J. W. Gerstein, Mark Liu, Chun-Chi Cheng, Chao |
author_sort | Yang, Chia-Chun |
collection | PubMed |
description | BACKGROUND: Chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-seq) or microarray hybridization (ChIP-chip) has been widely used to determine the genomic occupation of transcription factors (TFs). We have previously developed a probabilistic method, called TIP (Target Identification from Profiles), to identify TF target genes using ChIP-seq/ChIP-chip data. To achieve high specificity, TIP applies a conservative method to estimate significance of target genes, with the trade-off being a relatively low sensitivity of target gene identification compared to other methods. Additionally, TIP’s output does not render binding-peak locations or intensity, information highly useful for visualization and general experimental biological use, while the variability of ChIP-seq/ChIP-chip file formats has made input into TIP more difficult than desired. DESCRIPTION: To improve upon these facets, here we present are fined TIP with key extensions. First, it implements a Gaussian mixture model for p-value estimation, increasing target gene identification sensitivity and more accurately capturing the shape of TF binding profile distributions. Second, it enables the incorporation of TF binding-peak data by identifying their locations in significant target gene promoter regions and quantifies their strengths. Finally, for full ease of implementation we have incorporated it into a web server (http://syslab3.nchu.edu.tw/iTAR/) that enables flexibility of input file format, can be used across multiple species and genome assembly versions, and is freely available for public use. The web server additionally performs GO enrichment analysis for the identified target genes to reveal the potential function of the corresponding TF. CONCLUSIONS: The iTAR web server provides a user-friendly interface and supports target gene identification in seven species, ranging from yeast to human. To facilitate investigating the quality of ChIP-seq/ChIP-chip data, the web server generates the chart of the characteristic binding profiles and the density plot of normalized regulatory scores. The iTAR web server is a useful tool in identifying TF target genes from ChIP-seq/ChIP-chip data and discovering biological insights. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2963-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4983039 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49830392016-08-14 iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data Yang, Chia-Chun Andrews, Erik H. Chen, Min-Hsuan Wang, Wan-Yu Chen, Jeremy J. W. Gerstein, Mark Liu, Chun-Chi Cheng, Chao BMC Genomics Database BACKGROUND: Chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-seq) or microarray hybridization (ChIP-chip) has been widely used to determine the genomic occupation of transcription factors (TFs). We have previously developed a probabilistic method, called TIP (Target Identification from Profiles), to identify TF target genes using ChIP-seq/ChIP-chip data. To achieve high specificity, TIP applies a conservative method to estimate significance of target genes, with the trade-off being a relatively low sensitivity of target gene identification compared to other methods. Additionally, TIP’s output does not render binding-peak locations or intensity, information highly useful for visualization and general experimental biological use, while the variability of ChIP-seq/ChIP-chip file formats has made input into TIP more difficult than desired. DESCRIPTION: To improve upon these facets, here we present are fined TIP with key extensions. First, it implements a Gaussian mixture model for p-value estimation, increasing target gene identification sensitivity and more accurately capturing the shape of TF binding profile distributions. Second, it enables the incorporation of TF binding-peak data by identifying their locations in significant target gene promoter regions and quantifies their strengths. Finally, for full ease of implementation we have incorporated it into a web server (http://syslab3.nchu.edu.tw/iTAR/) that enables flexibility of input file format, can be used across multiple species and genome assembly versions, and is freely available for public use. The web server additionally performs GO enrichment analysis for the identified target genes to reveal the potential function of the corresponding TF. CONCLUSIONS: The iTAR web server provides a user-friendly interface and supports target gene identification in seven species, ranging from yeast to human. To facilitate investigating the quality of ChIP-seq/ChIP-chip data, the web server generates the chart of the characteristic binding profiles and the density plot of normalized regulatory scores. The iTAR web server is a useful tool in identifying TF target genes from ChIP-seq/ChIP-chip data and discovering biological insights. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2963-0) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-12 /pmc/articles/PMC4983039/ /pubmed/27519564 http://dx.doi.org/10.1186/s12864-016-2963-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Database Yang, Chia-Chun Andrews, Erik H. Chen, Min-Hsuan Wang, Wan-Yu Chen, Jeremy J. W. Gerstein, Mark Liu, Chun-Chi Cheng, Chao iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data |
title | iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data |
title_full | iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data |
title_fullStr | iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data |
title_full_unstemmed | iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data |
title_short | iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data |
title_sort | itar: a web server for identifying target genes of transcription factors using chip-seq or chip-chip data |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4983039/ https://www.ncbi.nlm.nih.gov/pubmed/27519564 http://dx.doi.org/10.1186/s12864-016-2963-0 |
work_keys_str_mv | AT yangchiachun itarawebserverforidentifyingtargetgenesoftranscriptionfactorsusingchipseqorchipchipdata AT andrewserikh itarawebserverforidentifyingtargetgenesoftranscriptionfactorsusingchipseqorchipchipdata AT chenminhsuan itarawebserverforidentifyingtargetgenesoftranscriptionfactorsusingchipseqorchipchipdata AT wangwanyu itarawebserverforidentifyingtargetgenesoftranscriptionfactorsusingchipseqorchipchipdata AT chenjeremyjw itarawebserverforidentifyingtargetgenesoftranscriptionfactorsusingchipseqorchipchipdata AT gersteinmark itarawebserverforidentifyingtargetgenesoftranscriptionfactorsusingchipseqorchipchipdata AT liuchunchi itarawebserverforidentifyingtargetgenesoftranscriptionfactorsusingchipseqorchipchipdata AT chengchao itarawebserverforidentifyingtargetgenesoftranscriptionfactorsusingchipseqorchipchipdata |