Cargando…

Copy number variation detection using next generation sequencing read counts

BACKGROUND: A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV iden...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Heng, Nettleton, Dan, Ying, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021345/
https://www.ncbi.nlm.nih.gov/pubmed/24731174
http://dx.doi.org/10.1186/1471-2105-15-109
_version_ 1782316223128666112
author Wang, Heng
Nettleton, Dan
Ying, Kai
author_facet Wang, Heng
Nettleton, Dan
Ying, Kai
author_sort Wang, Heng
collection PubMed
description BACKGROUND: A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV identification using NGS are limited. We propose a new methodology for detecting CNVs using NGS data. This method (henceforth denoted by m-HMM) is based on a hidden Markov model with emission probabilities that are governed by mixture distributions. We use the Expectation-Maximization (EM) algorithm to estimate the parameters in the model. RESULTS: A simulation study demonstrates that our proposed m-HMM approach has greater power for detecting copy number gains and losses relative to existing methods. Furthermore, application of our m-HMM to DNA sequencing data from the two maize inbred lines B73 and Mo17 to identify CNVs that may play a role in creating phenotypic differences between these inbred lines provides results concordant with previous array-based efforts to identify CNVs. CONCLUSIONS: The new m-HMM method is a powerful and practical approach for identifying CNVs from NGS data.
format Online
Article
Text
id pubmed-4021345
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40213452014-05-28 Copy number variation detection using next generation sequencing read counts Wang, Heng Nettleton, Dan Ying, Kai BMC Bioinformatics Research Article BACKGROUND: A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV identification using NGS are limited. We propose a new methodology for detecting CNVs using NGS data. This method (henceforth denoted by m-HMM) is based on a hidden Markov model with emission probabilities that are governed by mixture distributions. We use the Expectation-Maximization (EM) algorithm to estimate the parameters in the model. RESULTS: A simulation study demonstrates that our proposed m-HMM approach has greater power for detecting copy number gains and losses relative to existing methods. Furthermore, application of our m-HMM to DNA sequencing data from the two maize inbred lines B73 and Mo17 to identify CNVs that may play a role in creating phenotypic differences between these inbred lines provides results concordant with previous array-based efforts to identify CNVs. CONCLUSIONS: The new m-HMM method is a powerful and practical approach for identifying CNVs from NGS data. BioMed Central 2014-04-14 /pmc/articles/PMC4021345/ /pubmed/24731174 http://dx.doi.org/10.1186/1471-2105-15-109 Text en Copyright © 2014 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research Article
Wang, Heng
Nettleton, Dan
Ying, Kai
Copy number variation detection using next generation sequencing read counts
title Copy number variation detection using next generation sequencing read counts
title_full Copy number variation detection using next generation sequencing read counts
title_fullStr Copy number variation detection using next generation sequencing read counts
title_full_unstemmed Copy number variation detection using next generation sequencing read counts
title_short Copy number variation detection using next generation sequencing read counts
title_sort copy number variation detection using next generation sequencing read counts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021345/
https://www.ncbi.nlm.nih.gov/pubmed/24731174
http://dx.doi.org/10.1186/1471-2105-15-109
work_keys_str_mv AT wangheng copynumbervariationdetectionusingnextgenerationsequencingreadcounts
AT nettletondan copynumbervariationdetectionusingnextgenerationsequencingreadcounts
AT yingkai copynumbervariationdetectionusingnextgenerationsequencingreadcounts