Cargando…
Copy number variation detection using next generation sequencing read counts
BACKGROUND: A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV iden...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021345/ https://www.ncbi.nlm.nih.gov/pubmed/24731174 http://dx.doi.org/10.1186/1471-2105-15-109 |
_version_ | 1782316223128666112 |
---|---|
author | Wang, Heng Nettleton, Dan Ying, Kai |
author_facet | Wang, Heng Nettleton, Dan Ying, Kai |
author_sort | Wang, Heng |
collection | PubMed |
description | BACKGROUND: A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV identification using NGS are limited. We propose a new methodology for detecting CNVs using NGS data. This method (henceforth denoted by m-HMM) is based on a hidden Markov model with emission probabilities that are governed by mixture distributions. We use the Expectation-Maximization (EM) algorithm to estimate the parameters in the model. RESULTS: A simulation study demonstrates that our proposed m-HMM approach has greater power for detecting copy number gains and losses relative to existing methods. Furthermore, application of our m-HMM to DNA sequencing data from the two maize inbred lines B73 and Mo17 to identify CNVs that may play a role in creating phenotypic differences between these inbred lines provides results concordant with previous array-based efforts to identify CNVs. CONCLUSIONS: The new m-HMM method is a powerful and practical approach for identifying CNVs from NGS data. |
format | Online Article Text |
id | pubmed-4021345 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40213452014-05-28 Copy number variation detection using next generation sequencing read counts Wang, Heng Nettleton, Dan Ying, Kai BMC Bioinformatics Research Article BACKGROUND: A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV identification using NGS are limited. We propose a new methodology for detecting CNVs using NGS data. This method (henceforth denoted by m-HMM) is based on a hidden Markov model with emission probabilities that are governed by mixture distributions. We use the Expectation-Maximization (EM) algorithm to estimate the parameters in the model. RESULTS: A simulation study demonstrates that our proposed m-HMM approach has greater power for detecting copy number gains and losses relative to existing methods. Furthermore, application of our m-HMM to DNA sequencing data from the two maize inbred lines B73 and Mo17 to identify CNVs that may play a role in creating phenotypic differences between these inbred lines provides results concordant with previous array-based efforts to identify CNVs. CONCLUSIONS: The new m-HMM method is a powerful and practical approach for identifying CNVs from NGS data. BioMed Central 2014-04-14 /pmc/articles/PMC4021345/ /pubmed/24731174 http://dx.doi.org/10.1186/1471-2105-15-109 Text en Copyright © 2014 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Article Wang, Heng Nettleton, Dan Ying, Kai Copy number variation detection using next generation sequencing read counts |
title | Copy number variation detection using next generation sequencing read counts |
title_full | Copy number variation detection using next generation sequencing read counts |
title_fullStr | Copy number variation detection using next generation sequencing read counts |
title_full_unstemmed | Copy number variation detection using next generation sequencing read counts |
title_short | Copy number variation detection using next generation sequencing read counts |
title_sort | copy number variation detection using next generation sequencing read counts |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021345/ https://www.ncbi.nlm.nih.gov/pubmed/24731174 http://dx.doi.org/10.1186/1471-2105-15-109 |
work_keys_str_mv | AT wangheng copynumbervariationdetectionusingnextgenerationsequencingreadcounts AT nettletondan copynumbervariationdetectionusingnextgenerationsequencingreadcounts AT yingkai copynumbervariationdetectionusingnextgenerationsequencingreadcounts |