Cargando…
Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such a...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986197/ https://www.ncbi.nlm.nih.gov/pubmed/27526849 http://dx.doi.org/10.1186/s12864-016-2942-5 |
_version_ | 1782448164762025984 |
---|---|
author | Nabavi, Sheida |
author_facet | Nabavi, Sheida |
author_sort | Nabavi, Sheida |
collection | PubMed |
description | BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such as cancer. Integrating, analyzing, and interpreting big and noisy genomics data to obtain biologically meaningful results, however, remains highly challenging. Mining genomics datasets by utilizing advanced computational methods can help to address these issues. RESULTS: To facilitate the identification of a short list of biologically meaningful genes as candidate drivers of anti-cancer drug resistance from an enormous amount of heterogeneous data, we employed statistical machine-learning techniques and integrated genomics datasets. We developed a computational method that integrates gene expression, somatic mutation, and copy number aberration data of sensitive and resistant tumors. In this method, an integrative method based on module network analysis is applied to identify potential driver genes. This is followed by cross-validation and a comparison of the results of sensitive and resistance groups to obtain the final list of candidate biomarkers. We applied this method to the ovarian cancer data from the cancer genome atlas. The final result contains biologically relevant genes, such as COL11A1, which has been reported as a cis-platinum resistant biomarker for epithelial ovarian carcinoma in several recent studies. CONCLUSIONS: The described method yields a short list of aberrant genes that also control the expression of their co-regulated genes. The results suggest that the unbiased data driven computational method can identify biologically relevant candidate biomarkers. It can be utilized in a wide range of applications that compare two conditions with highly heterogeneous datasets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2942-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4986197 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49861972016-08-17 Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data Nabavi, Sheida BMC Genomics Methodology Article BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such as cancer. Integrating, analyzing, and interpreting big and noisy genomics data to obtain biologically meaningful results, however, remains highly challenging. Mining genomics datasets by utilizing advanced computational methods can help to address these issues. RESULTS: To facilitate the identification of a short list of biologically meaningful genes as candidate drivers of anti-cancer drug resistance from an enormous amount of heterogeneous data, we employed statistical machine-learning techniques and integrated genomics datasets. We developed a computational method that integrates gene expression, somatic mutation, and copy number aberration data of sensitive and resistant tumors. In this method, an integrative method based on module network analysis is applied to identify potential driver genes. This is followed by cross-validation and a comparison of the results of sensitive and resistance groups to obtain the final list of candidate biomarkers. We applied this method to the ovarian cancer data from the cancer genome atlas. The final result contains biologically relevant genes, such as COL11A1, which has been reported as a cis-platinum resistant biomarker for epithelial ovarian carcinoma in several recent studies. CONCLUSIONS: The described method yields a short list of aberrant genes that also control the expression of their co-regulated genes. The results suggest that the unbiased data driven computational method can identify biologically relevant candidate biomarkers. It can be utilized in a wide range of applications that compare two conditions with highly heterogeneous datasets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2942-5) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-15 /pmc/articles/PMC4986197/ /pubmed/27526849 http://dx.doi.org/10.1186/s12864-016-2942-5 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Nabavi, Sheida Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data |
title | Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data |
title_full | Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data |
title_fullStr | Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data |
title_full_unstemmed | Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data |
title_short | Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data |
title_sort | identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986197/ https://www.ncbi.nlm.nih.gov/pubmed/27526849 http://dx.doi.org/10.1186/s12864-016-2942-5 |
work_keys_str_mv | AT nabavisheida identifyingcandidatedriversofdrugresponseinheterogeneouscancerbymininghighthroughputgenomicsdata |