Cargando…

Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data

BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such a...

Descripción completa

Detalles Bibliográficos
Autor principal:	Nabavi, Sheida
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2016
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986197/ https://www.ncbi.nlm.nih.gov/pubmed/27526849 http://dx.doi.org/10.1186/s12864-016-2942-5

_version_	1782448164762025984
author	Nabavi, Sheida
author_facet	Nabavi, Sheida
author_sort	Nabavi, Sheida
collection	PubMed
description	BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such as cancer. Integrating, analyzing, and interpreting big and noisy genomics data to obtain biologically meaningful results, however, remains highly challenging. Mining genomics datasets by utilizing advanced computational methods can help to address these issues. RESULTS: To facilitate the identification of a short list of biologically meaningful genes as candidate drivers of anti-cancer drug resistance from an enormous amount of heterogeneous data, we employed statistical machine-learning techniques and integrated genomics datasets. We developed a computational method that integrates gene expression, somatic mutation, and copy number aberration data of sensitive and resistant tumors. In this method, an integrative method based on module network analysis is applied to identify potential driver genes. This is followed by cross-validation and a comparison of the results of sensitive and resistance groups to obtain the final list of candidate biomarkers. We applied this method to the ovarian cancer data from the cancer genome atlas. The final result contains biologically relevant genes, such as COL11A1, which has been reported as a cis-platinum resistant biomarker for epithelial ovarian carcinoma in several recent studies. CONCLUSIONS: The described method yields a short list of aberrant genes that also control the expression of their co-regulated genes. The results suggest that the unbiased data driven computational method can identify biologically relevant candidate biomarkers. It can be utilized in a wide range of applications that compare two conditions with highly heterogeneous datasets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2942-5) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-4986197
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-49861972016-08-17 Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data Nabavi, Sheida BMC Genomics Methodology Article BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such as cancer. Integrating, analyzing, and interpreting big and noisy genomics data to obtain biologically meaningful results, however, remains highly challenging. Mining genomics datasets by utilizing advanced computational methods can help to address these issues. RESULTS: To facilitate the identification of a short list of biologically meaningful genes as candidate drivers of anti-cancer drug resistance from an enormous amount of heterogeneous data, we employed statistical machine-learning techniques and integrated genomics datasets. We developed a computational method that integrates gene expression, somatic mutation, and copy number aberration data of sensitive and resistant tumors. In this method, an integrative method based on module network analysis is applied to identify potential driver genes. This is followed by cross-validation and a comparison of the results of sensitive and resistance groups to obtain the final list of candidate biomarkers. We applied this method to the ovarian cancer data from the cancer genome atlas. The final result contains biologically relevant genes, such as COL11A1, which has been reported as a cis-platinum resistant biomarker for epithelial ovarian carcinoma in several recent studies. CONCLUSIONS: The described method yields a short list of aberrant genes that also control the expression of their co-regulated genes. The results suggest that the unbiased data driven computational method can identify biologically relevant candidate biomarkers. It can be utilized in a wide range of applications that compare two conditions with highly heterogeneous datasets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2942-5) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-15 /pmc/articles/PMC4986197/ /pubmed/27526849 http://dx.doi.org/10.1186/s12864-016-2942-5 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Methodology Article Nabavi, Sheida Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title	Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_full	Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_fullStr	Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_full_unstemmed	Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_short	Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
title_sort	identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986197/ https://www.ncbi.nlm.nih.gov/pubmed/27526849 http://dx.doi.org/10.1186/s12864-016-2942-5
work_keys_str_mv	AT nabavisheida identifyingcandidatedriversofdrugresponseinheterogeneouscancerbymininghighthroughputgenomicsdata

Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data

Ejemplares similares