Cargando…

Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data

BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such a...

Descripción completa

Detalles Bibliográficos
Autor principal: Nabavi, Sheida
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986197/
https://www.ncbi.nlm.nih.gov/pubmed/27526849
http://dx.doi.org/10.1186/s12864-016-2942-5
_version_ 1782448164762025984
author Nabavi, Sheida
author_facet Nabavi, Sheida
author_sort Nabavi, Sheida
collection PubMed
description BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such as cancer. Integrating, analyzing, and interpreting big and noisy genomics data to obtain biologically meaningful results, however, remains highly challenging. Mining genomics datasets by utilizing advanced computational methods can help to address these issues. RESULTS: To facilitate the identification of a short list of biologically meaningful genes as candidate drivers of anti-cancer drug resistance from an enormous amount of heterogeneous data, we employed statistical machine-learning techniques and integrated genomics datasets. We developed a computational method that integrates gene expression, somatic mutation, and copy number aberration data of sensitive and resistant tumors. In this method, an integrative method based on module network analysis is applied to identify potential driver genes. This is followed by cross-validation and a comparison of the results of sensitive and resistance groups to obtain the final list of candidate biomarkers. We applied this method to the ovarian cancer data from the cancer genome atlas. The final result contains biologically relevant genes, such as COL11A1, which has been reported as a cis-platinum resistant biomarker for epithelial ovarian carcinoma in several recent studies. CONCLUSIONS: The described method yields a short list of aberrant genes that also control the expression of their co-regulated genes. The results suggest that the unbiased data driven computational method can identify biologically relevant candidate biomarkers. It can be utilized in a wide range of applications that compare two conditions with highly heterogeneous datasets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2942-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4986197
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49861972016-08-17 Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data Nabavi, Sheida BMC Genomics Methodology Article BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such as cancer. Integrating, analyzing, and interpreting big and noisy genomics data to obtain biologically meaningful results, however, remains highly challenging. Mining genomics datasets by utilizing advanced computational methods can help to address these issues. RESULTS: To facilitate the identification of a short list of biologically meaningful genes as candidate drivers of anti-cancer drug resistance from an enormous amount of heterogeneous data, we employed statistical machine-learning techniques and integrated genomics datasets. We developed a computational method that integrates gene expression, somatic mutation, and copy number aberration data of sensitive and resistant tumors. In this method, an integrative method based on module network analysis is applied to identify potential driver genes. This is followed by cross-validation and a comparison of the results of sensitive and resistance groups to obtain the final list of candidate biomarkers. We applied this method to the ovarian cancer data from the cancer genome atlas. The final result contains biologically relevant genes, such as COL11A1, which has been reported as a cis-platinum resistant biomarker for epithelial ovarian carcinoma in several recent studies. CONCLUSIONS: The described method yields a short list of aberrant genes that also control the expression of their co-regulated genes. The results suggest that the unbiased data driven computational method can identify biologically relevant candidate biomarkers. It can be utilized in a wide range of applications that compare two conditions with highly heterogeneous datasets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2942-5) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-15 /pmc/articles/PMC4986197/ /pubmed/27526849 http://dx.doi.org/10.1186/s12864-016-2942-5 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Nabavi, Sheida
Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_full Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_fullStr Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_full_unstemmed Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_short Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_sort identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986197/
https://www.ncbi.nlm.nih.gov/pubmed/27526849
http://dx.doi.org/10.1186/s12864-016-2942-5
work_keys_str_mv AT nabavisheida identifyingcandidatedriversofdrugresponseinheterogeneouscancerbymininghighthroughputgenomicsdata