Cargando…

CoNVEX: copy number variation estimation in exome sequencing data using HMM

BACKGROUND: One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, finding CNV in Cancer samples using WES data has not been f...

Descripción completa

Detalles Bibliográficos
Autores principales: Amarasinghe, Kaushalya C, Li, Jason, Halgamuge, Saman K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549847/
https://www.ncbi.nlm.nih.gov/pubmed/23368785
http://dx.doi.org/10.1186/1471-2105-14-S2-S2
_version_ 1782256483911598080
author Amarasinghe, Kaushalya C
Li, Jason
Halgamuge, Saman K
author_facet Amarasinghe, Kaushalya C
Li, Jason
Halgamuge, Saman K
author_sort Amarasinghe, Kaushalya C
collection PubMed
description BACKGROUND: One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, finding CNV in Cancer samples using WES data has not been fully explored. RESULTS: We present a new method, called CoNVEX, to estimate copy number variation in whole exome sequencing data. It uses ratio of tumour and matched normal average read depths at each exonic region, to predict the copy gain or loss. The useful signal produced by WES data will be hindered by the intrinsic noise present in the data itself. This limits its capacity to be used as a highly reliable CNV detection source. Here, we propose a method that consists of discrete wavelet transform (DWT) to reduce noise. The identification of copy number gains/losses of each targeted region is performed by a Hidden Markov Model (HMM). CONCLUSION: HMM is frequently used to identify CNV in data produced by various technologies including Array Comparative Genomic Hybridization (aCGH) and WGS. Here, we propose an HMM to detect CNV in cancer exome data. We used modified data from 1000 Genomes project to evaluate the performance of the proposed method. Using these data we have shown that CoNVEX outperforms the existing methods significantly in terms of precision. Overall, CoNVEX achieved a sensitivity of more than 92% and a precision of more than 50%.
format Online
Article
Text
id pubmed-3549847
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35498472013-01-23 CoNVEX: copy number variation estimation in exome sequencing data using HMM Amarasinghe, Kaushalya C Li, Jason Halgamuge, Saman K BMC Bioinformatics Proceedings BACKGROUND: One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, finding CNV in Cancer samples using WES data has not been fully explored. RESULTS: We present a new method, called CoNVEX, to estimate copy number variation in whole exome sequencing data. It uses ratio of tumour and matched normal average read depths at each exonic region, to predict the copy gain or loss. The useful signal produced by WES data will be hindered by the intrinsic noise present in the data itself. This limits its capacity to be used as a highly reliable CNV detection source. Here, we propose a method that consists of discrete wavelet transform (DWT) to reduce noise. The identification of copy number gains/losses of each targeted region is performed by a Hidden Markov Model (HMM). CONCLUSION: HMM is frequently used to identify CNV in data produced by various technologies including Array Comparative Genomic Hybridization (aCGH) and WGS. Here, we propose an HMM to detect CNV in cancer exome data. We used modified data from 1000 Genomes project to evaluate the performance of the proposed method. Using these data we have shown that CoNVEX outperforms the existing methods significantly in terms of precision. Overall, CoNVEX achieved a sensitivity of more than 92% and a precision of more than 50%. BioMed Central 2013-01-21 /pmc/articles/PMC3549847/ /pubmed/23368785 http://dx.doi.org/10.1186/1471-2105-14-S2-S2 Text en Copyright ©2013 Amarasinghe et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Amarasinghe, Kaushalya C
Li, Jason
Halgamuge, Saman K
CoNVEX: copy number variation estimation in exome sequencing data using HMM
title CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_full CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_fullStr CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_full_unstemmed CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_short CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_sort convex: copy number variation estimation in exome sequencing data using hmm
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549847/
https://www.ncbi.nlm.nih.gov/pubmed/23368785
http://dx.doi.org/10.1186/1471-2105-14-S2-S2
work_keys_str_mv AT amarasinghekaushalyac convexcopynumbervariationestimationinexomesequencingdatausinghmm
AT lijason convexcopynumbervariationestimationinexomesequencingdatausinghmm
AT halgamugesamank convexcopynumbervariationestimationinexomesequencingdatausinghmm