Cargando…
cnvCapSeq: detecting copy number variation in long-range targeted resequencing data
Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spe...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4227763/ https://www.ncbi.nlm.nih.gov/pubmed/25228465 http://dx.doi.org/10.1093/nar/gku849 |
_version_ | 1782343869298376704 |
---|---|
author | Bellos, Evangelos Kumar, Vikrant Lin, Clarabelle Maggi, Jordi Phua, Zai Yang Cheng, Ching-Yu Cheung, Chui Ming Gemmy Hibberd, Martin L. Wong, Tien Yin Coin, Lachlan J. M. Davila, Sonia |
author_facet | Bellos, Evangelos Kumar, Vikrant Lin, Clarabelle Maggi, Jordi Phua, Zai Yang Cheng, Ching-Yu Cheung, Chui Ming Gemmy Hibberd, Martin L. Wong, Tien Yin Coin, Lachlan J. M. Davila, Sonia |
author_sort | Bellos, Evangelos |
collection | PubMed |
description | Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spectrum of genetic variation, including copy number variants (CNVs). Various methods exist for detecting CNV in whole-genome and exome sequencing datasets. However, no algorithms have been specifically designed for contiguous target sequencing, despite its increasing importance in clinical and research applications. We have developed cnvCapSeq, a novel method for accurate and sensitive CNV discovery and genotyping in long-range targeted resequencing. cnvCapSeq was benchmarked using a simulated contiguous capture sequencing dataset comprising 21 genomic loci of various lengths. cnvCapSeq was shown to outperform the best existing exome CNV method by a wide margin both in terms of sensitivity (92.0 versus 48.3%) and specificity (99.8 versus 70.5%). We also applied cnvCapSeq to a real capture sequencing cohort comprising a contiguous 358 kb region that contains the Complement Factor H gene cluster. In this dataset, cnvCapSeq identified 41 samples with CNV, including two with duplications, with a genotyping accuracy of 99%, as ascertained by quantitative real-time PCR. |
format | Online Article Text |
id | pubmed-4227763 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-42277632014-11-21 cnvCapSeq: detecting copy number variation in long-range targeted resequencing data Bellos, Evangelos Kumar, Vikrant Lin, Clarabelle Maggi, Jordi Phua, Zai Yang Cheng, Ching-Yu Cheung, Chui Ming Gemmy Hibberd, Martin L. Wong, Tien Yin Coin, Lachlan J. M. Davila, Sonia Nucleic Acids Res Methods Online Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spectrum of genetic variation, including copy number variants (CNVs). Various methods exist for detecting CNV in whole-genome and exome sequencing datasets. However, no algorithms have been specifically designed for contiguous target sequencing, despite its increasing importance in clinical and research applications. We have developed cnvCapSeq, a novel method for accurate and sensitive CNV discovery and genotyping in long-range targeted resequencing. cnvCapSeq was benchmarked using a simulated contiguous capture sequencing dataset comprising 21 genomic loci of various lengths. cnvCapSeq was shown to outperform the best existing exome CNV method by a wide margin both in terms of sensitivity (92.0 versus 48.3%) and specificity (99.8 versus 70.5%). We also applied cnvCapSeq to a real capture sequencing cohort comprising a contiguous 358 kb region that contains the Complement Factor H gene cluster. In this dataset, cnvCapSeq identified 41 samples with CNV, including two with duplications, with a genotyping accuracy of 99%, as ascertained by quantitative real-time PCR. Oxford University Press 2014-11-10 2014-09-16 /pmc/articles/PMC4227763/ /pubmed/25228465 http://dx.doi.org/10.1093/nar/gku849 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Bellos, Evangelos Kumar, Vikrant Lin, Clarabelle Maggi, Jordi Phua, Zai Yang Cheng, Ching-Yu Cheung, Chui Ming Gemmy Hibberd, Martin L. Wong, Tien Yin Coin, Lachlan J. M. Davila, Sonia cnvCapSeq: detecting copy number variation in long-range targeted resequencing data |
title | cnvCapSeq: detecting copy number variation in long-range targeted resequencing data |
title_full | cnvCapSeq: detecting copy number variation in long-range targeted resequencing data |
title_fullStr | cnvCapSeq: detecting copy number variation in long-range targeted resequencing data |
title_full_unstemmed | cnvCapSeq: detecting copy number variation in long-range targeted resequencing data |
title_short | cnvCapSeq: detecting copy number variation in long-range targeted resequencing data |
title_sort | cnvcapseq: detecting copy number variation in long-range targeted resequencing data |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4227763/ https://www.ncbi.nlm.nih.gov/pubmed/25228465 http://dx.doi.org/10.1093/nar/gku849 |
work_keys_str_mv | AT bellosevangelos cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT kumarvikrant cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT linclarabelle cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT maggijordi cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT phuazaiyang cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT chengchingyu cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT cheungchuiminggemmy cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT hibberdmartinl cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT wongtienyin cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT coinlachlanjm cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata AT davilasonia cnvcapseqdetectingcopynumbervariationinlongrangetargetedresequencingdata |