Cargando…

An evaluation of copy number variation detection tools for cancer using whole exome sequencing data

BACKGROUND: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequ...

Descripción completa

Detalles Bibliográficos
Autores principales: Zare, Fatima, Dow, Michelle, Monteleone, Nicholas, Hosny, Abdelrahman, Nabavi, Sheida
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5452530/
https://www.ncbi.nlm.nih.gov/pubmed/28569140
http://dx.doi.org/10.1186/s12859-017-1705-x
_version_ 1783240451114926080
author Zare, Fatima
Dow, Michelle
Monteleone, Nicholas
Hosny, Abdelrahman
Nabavi, Sheida
author_facet Zare, Fatima
Dow, Michelle
Monteleone, Nicholas
Hosny, Abdelrahman
Nabavi, Sheida
author_sort Zare, Fatima
collection PubMed
description BACKGROUND: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequencing (WES) has become primary strategy for sequencing patient samples and study their genomics aberrations. However, compared to whole genome sequencing, WES introduces more biases and noise that make CNV detection very challenging. Additionally, tumors’ complexity makes the detection of cancer specific CNVs even more difficult. Although many CNV detection tools have been developed since introducing NGS data, there are few tools for somatic CNV detection for WES data in cancer. RESULTS: In this study, we evaluated the performance of the most recent and commonly used CNV detection tools for WES data in cancer to address their limitations and provide guidelines for developing new ones. We focused on the tools that have been designed or have the ability to detect cancer somatic aberrations. We compared the performance of the tools in terms of sensitivity and false discovery rate (FDR) using real data and simulated data. Comparative analysis of the results of the tools showed that there is a low consensus among the tools in calling CNVs. Using real data, tools show moderate sensitivity (~50% - ~80%), fair specificity (~70% - ~94%) and poor FDRs (~27% - ~60%). Also, using simulated data we observed that increasing the coverage more than 10× in exonic regions does not improve the detection power of the tools significantly. CONCLUSIONS: The limited performance of the current CNV detection tools for WES data in cancer indicates the need for developing more efficient and precise CNV detection methods. Due to the complexity of tumors and high level of noise and biases in WES data, employing advanced novel segmentation, normalization and de-noising techniques that are designed specifically for cancer data is necessary. Also, CNV detection development suffers from the lack of a gold standard for performance evaluation. Finally, developing tools with user-friendly user interfaces and visualization features can enhance CNV studies for a broader range of users. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1705-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5452530
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54525302017-06-02 An evaluation of copy number variation detection tools for cancer using whole exome sequencing data Zare, Fatima Dow, Michelle Monteleone, Nicholas Hosny, Abdelrahman Nabavi, Sheida BMC Bioinformatics Research Article BACKGROUND: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequencing (WES) has become primary strategy for sequencing patient samples and study their genomics aberrations. However, compared to whole genome sequencing, WES introduces more biases and noise that make CNV detection very challenging. Additionally, tumors’ complexity makes the detection of cancer specific CNVs even more difficult. Although many CNV detection tools have been developed since introducing NGS data, there are few tools for somatic CNV detection for WES data in cancer. RESULTS: In this study, we evaluated the performance of the most recent and commonly used CNV detection tools for WES data in cancer to address their limitations and provide guidelines for developing new ones. We focused on the tools that have been designed or have the ability to detect cancer somatic aberrations. We compared the performance of the tools in terms of sensitivity and false discovery rate (FDR) using real data and simulated data. Comparative analysis of the results of the tools showed that there is a low consensus among the tools in calling CNVs. Using real data, tools show moderate sensitivity (~50% - ~80%), fair specificity (~70% - ~94%) and poor FDRs (~27% - ~60%). Also, using simulated data we observed that increasing the coverage more than 10× in exonic regions does not improve the detection power of the tools significantly. CONCLUSIONS: The limited performance of the current CNV detection tools for WES data in cancer indicates the need for developing more efficient and precise CNV detection methods. Due to the complexity of tumors and high level of noise and biases in WES data, employing advanced novel segmentation, normalization and de-noising techniques that are designed specifically for cancer data is necessary. Also, CNV detection development suffers from the lack of a gold standard for performance evaluation. Finally, developing tools with user-friendly user interfaces and visualization features can enhance CNV studies for a broader range of users. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1705-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-31 /pmc/articles/PMC5452530/ /pubmed/28569140 http://dx.doi.org/10.1186/s12859-017-1705-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Zare, Fatima
Dow, Michelle
Monteleone, Nicholas
Hosny, Abdelrahman
Nabavi, Sheida
An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
title An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
title_full An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
title_fullStr An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
title_full_unstemmed An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
title_short An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
title_sort evaluation of copy number variation detection tools for cancer using whole exome sequencing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5452530/
https://www.ncbi.nlm.nih.gov/pubmed/28569140
http://dx.doi.org/10.1186/s12859-017-1705-x
work_keys_str_mv AT zarefatima anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT dowmichelle anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT monteleonenicholas anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT hosnyabdelrahman anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT nabavisheida anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT zarefatima evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT dowmichelle evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT monteleonenicholas evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT hosnyabdelrahman evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata
AT nabavisheida evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata