Cargando…

A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data

BACKGROUND: Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted massive parallel sequencing (MPS) protocols. MPS is widely used in biomedical research and clinical diagnostics as the fast and accurate tool for the detection of short genetic variations. However, id...

Descripción completa

Detalles Bibliográficos
Autores principales: Demidov, German, Simakova, Tamara, Vnuchkova, Julia, Bragin, Anton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5075217/
https://www.ncbi.nlm.nih.gov/pubmed/27770783
http://dx.doi.org/10.1186/s12859-016-1272-6
_version_ 1782461824286851072
author Demidov, German
Simakova, Tamara
Vnuchkova, Julia
Bragin, Anton
author_facet Demidov, German
Simakova, Tamara
Vnuchkova, Julia
Bragin, Anton
author_sort Demidov, German
collection PubMed
description BACKGROUND: Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted massive parallel sequencing (MPS) protocols. MPS is widely used in biomedical research and clinical diagnostics as the fast and accurate tool for the detection of short genetic variations. However, identification of larger variations such as structure variants and copy number variations (CNV) is still being a challenge for targeted MPS. Some approaches and tools for structural variants detection were proposed, but they have limitations and often require datasets of certain type, size and expected number of amplicons affected by CNVs. In the paper, we describe novel algorithm for high-resolution germinal CNV detection in the PCR-enriched targeted sequencing data and present accompanying tool. RESULTS: We have developed a machine learning algorithm for the detection of large duplications and deletions in the targeted sequencing data generated with PCR-based enrichment step. We have performed verification studies and established the algorithm’s sensitivity and specificity. We have compared developed tool with other available methods applicable for the described data and revealed its higher performance. CONCLUSION: We showed that our method has high specificity and sensitivity for high-resolution copy number detection in targeted sequencing data using large cohort of samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1272-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5075217
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50752172016-10-27 A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data Demidov, German Simakova, Tamara Vnuchkova, Julia Bragin, Anton BMC Bioinformatics Research Article BACKGROUND: Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted massive parallel sequencing (MPS) protocols. MPS is widely used in biomedical research and clinical diagnostics as the fast and accurate tool for the detection of short genetic variations. However, identification of larger variations such as structure variants and copy number variations (CNV) is still being a challenge for targeted MPS. Some approaches and tools for structural variants detection were proposed, but they have limitations and often require datasets of certain type, size and expected number of amplicons affected by CNVs. In the paper, we describe novel algorithm for high-resolution germinal CNV detection in the PCR-enriched targeted sequencing data and present accompanying tool. RESULTS: We have developed a machine learning algorithm for the detection of large duplications and deletions in the targeted sequencing data generated with PCR-based enrichment step. We have performed verification studies and established the algorithm’s sensitivity and specificity. We have compared developed tool with other available methods applicable for the described data and revealed its higher performance. CONCLUSION: We showed that our method has high specificity and sensitivity for high-resolution copy number detection in targeted sequencing data using large cohort of samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1272-6) contains supplementary material, which is available to authorized users. BioMed Central 2016-10-22 /pmc/articles/PMC5075217/ /pubmed/27770783 http://dx.doi.org/10.1186/s12859-016-1272-6 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Demidov, German
Simakova, Tamara
Vnuchkova, Julia
Bragin, Anton
A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data
title A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data
title_full A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data
title_fullStr A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data
title_full_unstemmed A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data
title_short A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data
title_sort statistical approach to detection of copy number variations in pcr-enriched targeted sequencing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5075217/
https://www.ncbi.nlm.nih.gov/pubmed/27770783
http://dx.doi.org/10.1186/s12859-016-1272-6
work_keys_str_mv AT demidovgerman astatisticalapproachtodetectionofcopynumbervariationsinpcrenrichedtargetedsequencingdata
AT simakovatamara astatisticalapproachtodetectionofcopynumbervariationsinpcrenrichedtargetedsequencingdata
AT vnuchkovajulia astatisticalapproachtodetectionofcopynumbervariationsinpcrenrichedtargetedsequencingdata
AT braginanton astatisticalapproachtodetectionofcopynumbervariationsinpcrenrichedtargetedsequencingdata
AT demidovgerman statisticalapproachtodetectionofcopynumbervariationsinpcrenrichedtargetedsequencingdata
AT simakovatamara statisticalapproachtodetectionofcopynumbervariationsinpcrenrichedtargetedsequencingdata
AT vnuchkovajulia statisticalapproachtodetectionofcopynumbervariationsinpcrenrichedtargetedsequencingdata
AT braginanton statisticalapproachtodetectionofcopynumbervariationsinpcrenrichedtargetedsequencingdata