Cargando…

MiningABs: mining associated biomarkers across multi-connected gene expression datasets

BACKGROUND: Human disease often arises as a consequence of alterations in a set of associated genes rather than alterations to a set of unassociated individual genes. Most previous microarray-based meta-analyses identified disease-associated genes or biomarkers independent of genetic interactions. T...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cheng, Chun-Pei, DeBoever, Christopher, Frazer, Kelly A, Liu, Yu-Cheng, Tseng, Vincent S
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4068973/ https://www.ncbi.nlm.nih.gov/pubmed/24909518 http://dx.doi.org/10.1186/1471-2105-15-173

_version_	1782322482280136704
author	Cheng, Chun-Pei DeBoever, Christopher Frazer, Kelly A Liu, Yu-Cheng Tseng, Vincent S
author_facet	Cheng, Chun-Pei DeBoever, Christopher Frazer, Kelly A Liu, Yu-Cheng Tseng, Vincent S
author_sort	Cheng, Chun-Pei
collection	PubMed
description	BACKGROUND: Human disease often arises as a consequence of alterations in a set of associated genes rather than alterations to a set of unassociated individual genes. Most previous microarray-based meta-analyses identified disease-associated genes or biomarkers independent of genetic interactions. Therefore, in this study, we present the first meta-analysis method capable of taking gene combination effects into account to efficiently identify associated biomarkers (ABs) across different microarray platforms. RESULTS: We propose a new meta-analysis approach called MiningABs to mine ABs across different array-based datasets. The similarity between paired probe sequences is quantified as a bridge to connect these datasets together. The ABs can be subsequently identified from an “improved” common logit model (c-LM) by combining several sibling-like LMs in a heuristic genetic algorithm selection process. Our approach is evaluated with two sets of gene expression datasets: i) 4 esophageal squamous cell carcinoma and ii) 3 hepatocellular carcinoma datasets. Based on an unbiased reciprocal test, we demonstrate that each gene in a group of ABs is required to maintain high cancer sample classification accuracy, and we observe that ABs are not limited to genes common to all platforms. Investigating the ABs using Gene Ontology (GO) enrichment, literature survey, and network analyses indicated that our ABs are not only strongly related to cancer development but also highly connected in a diverse network of biological interactions. CONCLUSIONS: The proposed meta-analysis method called MiningABs is able to efficiently identify ABs from different independently performed array-based datasets, and we show its validity in cancer biology via GO enrichment, literature survey and network analyses. We postulate that the ABs may facilitate novel target and drug discovery, leading to improved clinical treatment. Java source code, tutorial, example and related materials are available at “http://sourceforge.net/projects/miningabs/”.
format	Online Article Text
id	pubmed-4068973
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-40689732014-06-27 MiningABs: mining associated biomarkers across multi-connected gene expression datasets Cheng, Chun-Pei DeBoever, Christopher Frazer, Kelly A Liu, Yu-Cheng Tseng, Vincent S BMC Bioinformatics Methodology Article BACKGROUND: Human disease often arises as a consequence of alterations in a set of associated genes rather than alterations to a set of unassociated individual genes. Most previous microarray-based meta-analyses identified disease-associated genes or biomarkers independent of genetic interactions. Therefore, in this study, we present the first meta-analysis method capable of taking gene combination effects into account to efficiently identify associated biomarkers (ABs) across different microarray platforms. RESULTS: We propose a new meta-analysis approach called MiningABs to mine ABs across different array-based datasets. The similarity between paired probe sequences is quantified as a bridge to connect these datasets together. The ABs can be subsequently identified from an “improved” common logit model (c-LM) by combining several sibling-like LMs in a heuristic genetic algorithm selection process. Our approach is evaluated with two sets of gene expression datasets: i) 4 esophageal squamous cell carcinoma and ii) 3 hepatocellular carcinoma datasets. Based on an unbiased reciprocal test, we demonstrate that each gene in a group of ABs is required to maintain high cancer sample classification accuracy, and we observe that ABs are not limited to genes common to all platforms. Investigating the ABs using Gene Ontology (GO) enrichment, literature survey, and network analyses indicated that our ABs are not only strongly related to cancer development but also highly connected in a diverse network of biological interactions. CONCLUSIONS: The proposed meta-analysis method called MiningABs is able to efficiently identify ABs from different independently performed array-based datasets, and we show its validity in cancer biology via GO enrichment, literature survey and network analyses. We postulate that the ABs may facilitate novel target and drug discovery, leading to improved clinical treatment. Java source code, tutorial, example and related materials are available at “http://sourceforge.net/projects/miningabs/”. BioMed Central 2014-06-08 /pmc/articles/PMC4068973/ /pubmed/24909518 http://dx.doi.org/10.1186/1471-2105-15-173 Text en Copyright © 2014 Cheng et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Methodology Article Cheng, Chun-Pei DeBoever, Christopher Frazer, Kelly A Liu, Yu-Cheng Tseng, Vincent S MiningABs: mining associated biomarkers across multi-connected gene expression datasets
title	MiningABs: mining associated biomarkers across multi-connected gene expression datasets
title_full	MiningABs: mining associated biomarkers across multi-connected gene expression datasets
title_fullStr	MiningABs: mining associated biomarkers across multi-connected gene expression datasets
title_full_unstemmed	MiningABs: mining associated biomarkers across multi-connected gene expression datasets
title_short	MiningABs: mining associated biomarkers across multi-connected gene expression datasets
title_sort	miningabs: mining associated biomarkers across multi-connected gene expression datasets
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4068973/ https://www.ncbi.nlm.nih.gov/pubmed/24909518 http://dx.doi.org/10.1186/1471-2105-15-173
work_keys_str_mv	AT chengchunpei miningabsminingassociatedbiomarkersacrossmulticonnectedgeneexpressiondatasets AT deboeverchristopher miningabsminingassociatedbiomarkersacrossmulticonnectedgeneexpressiondatasets AT frazerkellya miningabsminingassociatedbiomarkersacrossmulticonnectedgeneexpressiondatasets AT liuyucheng miningabsminingassociatedbiomarkersacrossmulticonnectedgeneexpressiondatasets AT tsengvincents miningabsminingassociatedbiomarkersacrossmulticonnectedgeneexpressiondatasets

MiningABs: mining associated biomarkers across multi-connected gene expression datasets

Ejemplares similares