Cargando…

cnvCapSeq: detecting copy number variation in long-range targeted resequencing data

Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spe...

Descripción completa

Detalles Bibliográficos
Autores principales: Bellos, Evangelos, Kumar, Vikrant, Lin, Clarabelle, Maggi, Jordi, Phua, Zai Yang, Cheng, Ching-Yu, Cheung, Chui Ming Gemmy, Hibberd, Martin L., Wong, Tien Yin, Coin, Lachlan J. M., Davila, Sonia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4227763/
https://www.ncbi.nlm.nih.gov/pubmed/25228465
http://dx.doi.org/10.1093/nar/gku849
_version_ 1782343869298376704
author Bellos, Evangelos
Kumar, Vikrant
Lin, Clarabelle
Maggi, Jordi
Phua, Zai Yang
Cheng, Ching-Yu
Cheung, Chui Ming Gemmy
Hibberd, Martin L.
Wong, Tien Yin
Coin, Lachlan J. M.
Davila, Sonia
author_facet Bellos, Evangelos
Kumar, Vikrant
Lin, Clarabelle
Maggi, Jordi
Phua, Zai Yang
Cheng, Ching-Yu
Cheung, Chui Ming Gemmy
Hibberd, Martin L.
Wong, Tien Yin
Coin, Lachlan J. M.
Davila, Sonia
author_sort Bellos, Evangelos
collection PubMed
description Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spectrum of genetic variation, including copy number variants (CNVs). Various methods exist for detecting CNV in whole-genome and exome sequencing datasets. However, no algorithms have been specifically designed for contiguous target sequencing, despite its increasing importance in clinical and research applications. We have developed cnvCapSeq, a novel method for accurate and sensitive CNV discovery and genotyping in long-range targeted resequencing. cnvCapSeq was benchmarked using a simulated contiguous capture sequencing dataset comprising 21 genomic loci of various lengths. cnvCapSeq was shown to outperform the best existing exome CNV method by a wide margin both in terms of sensitivity (92.0 versus 48.3%) and specificity (99.8 versus 70.5%). We also applied cnvCapSeq to a real capture sequencing cohort comprising a contiguous 358 kb region that contains the Complement Factor H gene cluster. In this dataset, cnvCapSeq identified 41 samples with CNV, including two with duplications, with a genotyping accuracy of 99%, as ascertained by quantitative real-time PCR.
format Online
Article
Text
id pubmed-4227763
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-42277632014-11-21 cnvCapSeq: detecting copy number variation in long-range targeted resequencing data Bellos, Evangelos Kumar, Vikrant Lin, Clarabelle Maggi, Jordi Phua, Zai Yang Cheng, Ching-Yu Cheung, Chui Ming Gemmy Hibberd, Martin L. Wong, Tien Yin Coin, Lachlan J. M. Davila, Sonia Nucleic Acids Res Methods Online Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spectrum of genetic variation, including copy number variants (CNVs). Various methods exist for detecting CNV in whole-genome and exome sequencing datasets. However, no algorithms have been specifically designed for contiguous target sequencing, despite its increasing importance in clinical and research applications. We have developed cnvCapSeq, a novel method for accurate and sensitive CNV discovery and genotyping in long-range targeted resequencing. cnvCapSeq was benchmarked using a simulated contiguous capture sequencing dataset comprising 21 genomic loci of various lengths. cnvCapSeq was shown to outperform the best existing exome CNV method by a wide margin both in terms of sensitivity (92.0 versus 48.3%) and specificity (99.8 versus 70.5%). We also applied cnvCapSeq to a real capture sequencing cohort comprising a contiguous 358 kb region that contains the Complement Factor H gene cluster. In this dataset, cnvCapSeq identified 41 samples with CNV, including two with duplications, with a genotyping accuracy of 99%, as ascertained by quantitative real-time PCR. Oxford University Press 2014-11-10 2014-09-16 /pmc/articles/PMC4227763/ /pubmed/25228465 http://dx.doi.org/10.1093/nar/gku849 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Bellos, Evangelos
Kumar, Vikrant
Lin, Clarabelle
Maggi, Jordi
Phua, Zai Yang
Cheng, Ching-Yu
Cheung, Chui Ming Gemmy
Hibberd, Martin L.
Wong, Tien Yin
Coin, Lachlan J. M.
Davila, Sonia
cnvCapSeq: detecting copy number variation in long-range targeted resequencing data
title cnvCapSeq: detecting copy number variation in long-range targeted resequencing data
title_full cnvCapSeq: detecting copy number variation in long-range targeted resequencing data
title_fullStr cnvCapSeq: detecting copy number variation in long-range targeted resequencing data
title_full_unstemmed cnvCapSeq: detecting copy number variation in long-range targeted resequencing data
title_short cnvCapSeq: detecting copy number variation in long-range targeted resequencing data
title_sort cnvcapseq: detecting copy number variation in long-range targeted resequencing data
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4227763/
https://www.ncbi.nlm.nih.gov/pubmed/25228465
http://dx.doi.org/10.1093/nar/gku849
work_keys_str_mv AT bellosevangelos cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT kumarvikrant cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT linclarabelle cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT maggijordi cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT phuazaiyang cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT chengchingyu cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT cheungchuiminggemmy cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT hibberdmartinl cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT wongtienyin cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT coinlachlanjm cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata
AT davilasonia cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata