Cargando…

CONTRA: copy number analysis for targeted resequencing

Motivation: In light of the increasing adoption of targeted resequencing (TR) as a cost-effective strategy to identify disease-causing variants, a robust method for copy number variation (CNV) analysis is needed to maximize the value of this promising technology. Results: We present a method for CNV...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jason, Lupat, Richard, Amarasinghe, Kaushalya C., Thompson, Ella R., Doyle, Maria A., Ryland, Georgina L., Tothill, Richard W., Halgamuge, Saman K., Campbell, Ian G., Gorringe, Kylie L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3348560/
https://www.ncbi.nlm.nih.gov/pubmed/22474122
http://dx.doi.org/10.1093/bioinformatics/bts146
_version_ 1782232398803501056
author Li, Jason
Lupat, Richard
Amarasinghe, Kaushalya C.
Thompson, Ella R.
Doyle, Maria A.
Ryland, Georgina L.
Tothill, Richard W.
Halgamuge, Saman K.
Campbell, Ian G.
Gorringe, Kylie L.
author_facet Li, Jason
Lupat, Richard
Amarasinghe, Kaushalya C.
Thompson, Ella R.
Doyle, Maria A.
Ryland, Georgina L.
Tothill, Richard W.
Halgamuge, Saman K.
Campbell, Ian G.
Gorringe, Kylie L.
author_sort Li, Jason
collection PubMed
description Motivation: In light of the increasing adoption of targeted resequencing (TR) as a cost-effective strategy to identify disease-causing variants, a robust method for copy number variation (CNV) analysis is needed to maximize the value of this promising technology. Results: We present a method for CNV detection for TR data, including whole-exome capture data. Our method calls copy number gains and losses for each target region based on normalized depth of coverage. Our key strategies include the use of base-level log-ratios to remove GC-content bias, correction for an imbalanced library size effect on log-ratios, and the estimation of log-ratio variations via binning and interpolation. Our methods are made available via CONTRA (COpy Number Targeted Resequencing Analysis), a software package that takes standard alignment formats (BAM/SAM) and outputs in variant call format (VCF4.0), for easy integration with other next-generation sequencing analysis packages. We assessed our methods using samples from seven different target enrichment assays, and evaluated our results using simulated data and real germline data with known CNV genotypes. Availability and implementation: Source code and sample data are freely available under GNU license (GPLv3) at http://contra-cnv.sourceforge.net/ Contact: Jason.Li@petermac.org Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3348560
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33485602012-05-09 CONTRA: copy number analysis for targeted resequencing Li, Jason Lupat, Richard Amarasinghe, Kaushalya C. Thompson, Ella R. Doyle, Maria A. Ryland, Georgina L. Tothill, Richard W. Halgamuge, Saman K. Campbell, Ian G. Gorringe, Kylie L. Bioinformatics Original Papers Motivation: In light of the increasing adoption of targeted resequencing (TR) as a cost-effective strategy to identify disease-causing variants, a robust method for copy number variation (CNV) analysis is needed to maximize the value of this promising technology. Results: We present a method for CNV detection for TR data, including whole-exome capture data. Our method calls copy number gains and losses for each target region based on normalized depth of coverage. Our key strategies include the use of base-level log-ratios to remove GC-content bias, correction for an imbalanced library size effect on log-ratios, and the estimation of log-ratio variations via binning and interpolation. Our methods are made available via CONTRA (COpy Number Targeted Resequencing Analysis), a software package that takes standard alignment formats (BAM/SAM) and outputs in variant call format (VCF4.0), for easy integration with other next-generation sequencing analysis packages. We assessed our methods using samples from seven different target enrichment assays, and evaluated our results using simulated data and real germline data with known CNV genotypes. Availability and implementation: Source code and sample data are freely available under GNU license (GPLv3) at http://contra-cnv.sourceforge.net/ Contact: Jason.Li@petermac.org Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-05-15 2012-04-02 /pmc/articles/PMC3348560/ /pubmed/22474122 http://dx.doi.org/10.1093/bioinformatics/bts146 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Li, Jason
Lupat, Richard
Amarasinghe, Kaushalya C.
Thompson, Ella R.
Doyle, Maria A.
Ryland, Georgina L.
Tothill, Richard W.
Halgamuge, Saman K.
Campbell, Ian G.
Gorringe, Kylie L.
CONTRA: copy number analysis for targeted resequencing
title CONTRA: copy number analysis for targeted resequencing
title_full CONTRA: copy number analysis for targeted resequencing
title_fullStr CONTRA: copy number analysis for targeted resequencing
title_full_unstemmed CONTRA: copy number analysis for targeted resequencing
title_short CONTRA: copy number analysis for targeted resequencing
title_sort contra: copy number analysis for targeted resequencing
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3348560/
https://www.ncbi.nlm.nih.gov/pubmed/22474122
http://dx.doi.org/10.1093/bioinformatics/bts146
work_keys_str_mv AT lijason contracopynumberanalysisfortargetedresequencing
AT lupatrichard contracopynumberanalysisfortargetedresequencing
AT amarasinghekaushalyac contracopynumberanalysisfortargetedresequencing
AT thompsonellar contracopynumberanalysisfortargetedresequencing
AT doylemariaa contracopynumberanalysisfortargetedresequencing
AT rylandgeorginal contracopynumberanalysisfortargetedresequencing
AT tothillrichardw contracopynumberanalysisfortargetedresequencing
AT halgamugesamank contracopynumberanalysisfortargetedresequencing
AT campbelliang contracopynumberanalysisfortargetedresequencing
AT gorringekyliel contracopynumberanalysisfortargetedresequencing