Cargando…
An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
BACKGROUND: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequ...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5452530/ https://www.ncbi.nlm.nih.gov/pubmed/28569140 http://dx.doi.org/10.1186/s12859-017-1705-x |
_version_ | 1783240451114926080 |
---|---|
author | Zare, Fatima Dow, Michelle Monteleone, Nicholas Hosny, Abdelrahman Nabavi, Sheida |
author_facet | Zare, Fatima Dow, Michelle Monteleone, Nicholas Hosny, Abdelrahman Nabavi, Sheida |
author_sort | Zare, Fatima |
collection | PubMed |
description | BACKGROUND: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequencing (WES) has become primary strategy for sequencing patient samples and study their genomics aberrations. However, compared to whole genome sequencing, WES introduces more biases and noise that make CNV detection very challenging. Additionally, tumors’ complexity makes the detection of cancer specific CNVs even more difficult. Although many CNV detection tools have been developed since introducing NGS data, there are few tools for somatic CNV detection for WES data in cancer. RESULTS: In this study, we evaluated the performance of the most recent and commonly used CNV detection tools for WES data in cancer to address their limitations and provide guidelines for developing new ones. We focused on the tools that have been designed or have the ability to detect cancer somatic aberrations. We compared the performance of the tools in terms of sensitivity and false discovery rate (FDR) using real data and simulated data. Comparative analysis of the results of the tools showed that there is a low consensus among the tools in calling CNVs. Using real data, tools show moderate sensitivity (~50% - ~80%), fair specificity (~70% - ~94%) and poor FDRs (~27% - ~60%). Also, using simulated data we observed that increasing the coverage more than 10× in exonic regions does not improve the detection power of the tools significantly. CONCLUSIONS: The limited performance of the current CNV detection tools for WES data in cancer indicates the need for developing more efficient and precise CNV detection methods. Due to the complexity of tumors and high level of noise and biases in WES data, employing advanced novel segmentation, normalization and de-noising techniques that are designed specifically for cancer data is necessary. Also, CNV detection development suffers from the lack of a gold standard for performance evaluation. Finally, developing tools with user-friendly user interfaces and visualization features can enhance CNV studies for a broader range of users. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1705-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5452530 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54525302017-06-02 An evaluation of copy number variation detection tools for cancer using whole exome sequencing data Zare, Fatima Dow, Michelle Monteleone, Nicholas Hosny, Abdelrahman Nabavi, Sheida BMC Bioinformatics Research Article BACKGROUND: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequencing (WES) has become primary strategy for sequencing patient samples and study their genomics aberrations. However, compared to whole genome sequencing, WES introduces more biases and noise that make CNV detection very challenging. Additionally, tumors’ complexity makes the detection of cancer specific CNVs even more difficult. Although many CNV detection tools have been developed since introducing NGS data, there are few tools for somatic CNV detection for WES data in cancer. RESULTS: In this study, we evaluated the performance of the most recent and commonly used CNV detection tools for WES data in cancer to address their limitations and provide guidelines for developing new ones. We focused on the tools that have been designed or have the ability to detect cancer somatic aberrations. We compared the performance of the tools in terms of sensitivity and false discovery rate (FDR) using real data and simulated data. Comparative analysis of the results of the tools showed that there is a low consensus among the tools in calling CNVs. Using real data, tools show moderate sensitivity (~50% - ~80%), fair specificity (~70% - ~94%) and poor FDRs (~27% - ~60%). Also, using simulated data we observed that increasing the coverage more than 10× in exonic regions does not improve the detection power of the tools significantly. CONCLUSIONS: The limited performance of the current CNV detection tools for WES data in cancer indicates the need for developing more efficient and precise CNV detection methods. Due to the complexity of tumors and high level of noise and biases in WES data, employing advanced novel segmentation, normalization and de-noising techniques that are designed specifically for cancer data is necessary. Also, CNV detection development suffers from the lack of a gold standard for performance evaluation. Finally, developing tools with user-friendly user interfaces and visualization features can enhance CNV studies for a broader range of users. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1705-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-31 /pmc/articles/PMC5452530/ /pubmed/28569140 http://dx.doi.org/10.1186/s12859-017-1705-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Zare, Fatima Dow, Michelle Monteleone, Nicholas Hosny, Abdelrahman Nabavi, Sheida An evaluation of copy number variation detection tools for cancer using whole exome sequencing data |
title | An evaluation of copy number variation detection tools for cancer using whole exome sequencing data |
title_full | An evaluation of copy number variation detection tools for cancer using whole exome sequencing data |
title_fullStr | An evaluation of copy number variation detection tools for cancer using whole exome sequencing data |
title_full_unstemmed | An evaluation of copy number variation detection tools for cancer using whole exome sequencing data |
title_short | An evaluation of copy number variation detection tools for cancer using whole exome sequencing data |
title_sort | evaluation of copy number variation detection tools for cancer using whole exome sequencing data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5452530/ https://www.ncbi.nlm.nih.gov/pubmed/28569140 http://dx.doi.org/10.1186/s12859-017-1705-x |
work_keys_str_mv | AT zarefatima anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT dowmichelle anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT monteleonenicholas anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT hosnyabdelrahman anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT nabavisheida anevaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT zarefatima evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT dowmichelle evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT monteleonenicholas evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT hosnyabdelrahman evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata AT nabavisheida evaluationofcopynumbervariationdetectiontoolsforcancerusingwholeexomesequencingdata |