Cargando…
ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data
BACKGROUND: ChIP-Seq is a powerful tool for identifying the interaction between genomic regulators and their bound DNAs, especially for locating transcription factor binding sites. However, high cost and high rate of false discovery of transcription factor binding sites identified from ChIP-Seq data...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2893127/ https://www.ncbi.nlm.nih.gov/pubmed/20525272 http://dx.doi.org/10.1186/1742-4682-7-18 |
_version_ | 1782183008980172800 |
---|---|
author | Wu, Song Wang, Jianmin Zhao, Wei Pounds, Stanley Cheng, Cheng |
author_facet | Wu, Song Wang, Jianmin Zhao, Wei Pounds, Stanley Cheng, Cheng |
author_sort | Wu, Song |
collection | PubMed |
description | BACKGROUND: ChIP-Seq is a powerful tool for identifying the interaction between genomic regulators and their bound DNAs, especially for locating transcription factor binding sites. However, high cost and high rate of false discovery of transcription factor binding sites identified from ChIP-Seq data significantly limit its application. RESULTS: Here we report a new algorithm, ChIP-PaM, for identifying transcription factor target regions in ChIP-Seq datasets. This algorithm makes full use of a protein-DNA binding pattern by capitalizing on three lines of evidence: 1) the tag count modelling at the peak position, 2) pattern matching of a specific tag count distribution, and 3) motif searching along the genome. A novel data-based two-step eFDR procedure is proposed to integrate the three lines of evidence to determine significantly enriched regions. Our algorithm requires no technical controls and efficiently discriminates falsely enriched regions from regions enriched by true transcription factor (TF) binding on the basis of ChIP-Seq data only. An analysis of real genomic data is presented to demonstrate our method. CONCLUSIONS: In a comparison with other existing methods, we found that our algorithm provides more accurate binding site discovery while maintaining comparable statistical power. |
format | Text |
id | pubmed-2893127 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28931272010-06-29 ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data Wu, Song Wang, Jianmin Zhao, Wei Pounds, Stanley Cheng, Cheng Theor Biol Med Model Research BACKGROUND: ChIP-Seq is a powerful tool for identifying the interaction between genomic regulators and their bound DNAs, especially for locating transcription factor binding sites. However, high cost and high rate of false discovery of transcription factor binding sites identified from ChIP-Seq data significantly limit its application. RESULTS: Here we report a new algorithm, ChIP-PaM, for identifying transcription factor target regions in ChIP-Seq datasets. This algorithm makes full use of a protein-DNA binding pattern by capitalizing on three lines of evidence: 1) the tag count modelling at the peak position, 2) pattern matching of a specific tag count distribution, and 3) motif searching along the genome. A novel data-based two-step eFDR procedure is proposed to integrate the three lines of evidence to determine significantly enriched regions. Our algorithm requires no technical controls and efficiently discriminates falsely enriched regions from regions enriched by true transcription factor (TF) binding on the basis of ChIP-Seq data only. An analysis of real genomic data is presented to demonstrate our method. CONCLUSIONS: In a comparison with other existing methods, we found that our algorithm provides more accurate binding site discovery while maintaining comparable statistical power. BioMed Central 2010-06-03 /pmc/articles/PMC2893127/ /pubmed/20525272 http://dx.doi.org/10.1186/1742-4682-7-18 Text en Copyright ©2010 Wu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Wu, Song Wang, Jianmin Zhao, Wei Pounds, Stanley Cheng, Cheng ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data |
title | ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data |
title_full | ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data |
title_fullStr | ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data |
title_full_unstemmed | ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data |
title_short | ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data |
title_sort | chip-pam: an algorithm to identify protein-dna interaction using chip-seq data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2893127/ https://www.ncbi.nlm.nih.gov/pubmed/20525272 http://dx.doi.org/10.1186/1742-4682-7-18 |
work_keys_str_mv | AT wusong chippamanalgorithmtoidentifyproteindnainteractionusingchipseqdata AT wangjianmin chippamanalgorithmtoidentifyproteindnainteractionusingchipseqdata AT zhaowei chippamanalgorithmtoidentifyproteindnainteractionusingchipseqdata AT poundsstanley chippamanalgorithmtoidentifyproteindnainteractionusingchipseqdata AT chengcheng chippamanalgorithmtoidentifyproteindnainteractionusingchipseqdata |