Cargando…
A varying threshold method for ChIP peak-calling using multiple sources of information
Motivation: Gene regulation commonly involves interaction among DNA, proteins and biochemical conditions. Using chromatin immunoprecipitation (ChIP) technologies, protein–DNA interactions are routinely detected in the genome scale. Computational methods that detect weak protein-binding signals and s...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935431/ https://www.ncbi.nlm.nih.gov/pubmed/20823314 http://dx.doi.org/10.1093/bioinformatics/btq379 |
_version_ | 1782186401445445632 |
---|---|
author | Chen, Kuan-Bei Zhang, Yu |
author_facet | Chen, Kuan-Bei Zhang, Yu |
author_sort | Chen, Kuan-Bei |
collection | PubMed |
description | Motivation: Gene regulation commonly involves interaction among DNA, proteins and biochemical conditions. Using chromatin immunoprecipitation (ChIP) technologies, protein–DNA interactions are routinely detected in the genome scale. Computational methods that detect weak protein-binding signals and simultaneously maintain a high specificity yet remain to be challenging. An attractive approach is to incorporate biologically relevant data, such as protein co-occupancy, to improve the power of protein-binding detection. We call the additional data related with the target protein binding as supporting tracks. Results: We propose a novel but rigorous statistical method to identify protein occupancy in ChIP data using multiple supporting tracks (PASS2). We demonstrate that utilizing biologically related information can significantly increase the discovery of true protein-binding sites, while still maintaining a desired level of false positive calls. Applying the method to GATA1 restoration in mouse erythroid cell line, we detected many new GATA1-binding sites using GATA1 co-occupancy data. Availability: http://stat.psu.edu/∼yuzhang/pass2.tar Contact: yuzhang@stat.psu.edu |
format | Text |
id | pubmed-2935431 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-29354312010-09-08 A varying threshold method for ChIP peak-calling using multiple sources of information Chen, Kuan-Bei Zhang, Yu Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: Gene regulation commonly involves interaction among DNA, proteins and biochemical conditions. Using chromatin immunoprecipitation (ChIP) technologies, protein–DNA interactions are routinely detected in the genome scale. Computational methods that detect weak protein-binding signals and simultaneously maintain a high specificity yet remain to be challenging. An attractive approach is to incorporate biologically relevant data, such as protein co-occupancy, to improve the power of protein-binding detection. We call the additional data related with the target protein binding as supporting tracks. Results: We propose a novel but rigorous statistical method to identify protein occupancy in ChIP data using multiple supporting tracks (PASS2). We demonstrate that utilizing biologically related information can significantly increase the discovery of true protein-binding sites, while still maintaining a desired level of false positive calls. Applying the method to GATA1 restoration in mouse erythroid cell line, we detected many new GATA1-binding sites using GATA1 co-occupancy data. Availability: http://stat.psu.edu/∼yuzhang/pass2.tar Contact: yuzhang@stat.psu.edu Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935431/ /pubmed/20823314 http://dx.doi.org/10.1093/bioinformatics/btq379 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Chen, Kuan-Bei Zhang, Yu A varying threshold method for ChIP peak-calling using multiple sources of information |
title | A varying threshold method for ChIP peak-calling using multiple sources of information |
title_full | A varying threshold method for ChIP peak-calling using multiple sources of information |
title_fullStr | A varying threshold method for ChIP peak-calling using multiple sources of information |
title_full_unstemmed | A varying threshold method for ChIP peak-calling using multiple sources of information |
title_short | A varying threshold method for ChIP peak-calling using multiple sources of information |
title_sort | varying threshold method for chip peak-calling using multiple sources of information |
topic | Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935431/ https://www.ncbi.nlm.nih.gov/pubmed/20823314 http://dx.doi.org/10.1093/bioinformatics/btq379 |
work_keys_str_mv | AT chenkuanbei avaryingthresholdmethodforchippeakcallingusingmultiplesourcesofinformation AT zhangyu avaryingthresholdmethodforchippeakcallingusingmultiplesourcesofinformation AT chenkuanbei varyingthresholdmethodforchippeakcallingusingmultiplesourcesofinformation AT zhangyu varyingthresholdmethodforchippeakcallingusingmultiplesourcesofinformation |