Cargando…

Mining differential top-k co-expression patterns from time course comparative gene expression datasets

BACKGROUND: Frequent pattern mining analysis applied on microarray dataset appears to be a promising strategy for identifying relationships between gene expression levels. Unfortunately, too many itemsets (co-expressed genes) are identified by this analysis method since it does not consider the impo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Yu-Cheng, Cheng, Chun-Pei, Tseng, Vincent S
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3751367/ https://www.ncbi.nlm.nih.gov/pubmed/23870110 http://dx.doi.org/10.1186/1471-2105-14-230

_version_	1782281585673895936
author	Liu, Yu-Cheng Cheng, Chun-Pei Tseng, Vincent S
author_facet	Liu, Yu-Cheng Cheng, Chun-Pei Tseng, Vincent S
author_sort	Liu, Yu-Cheng
collection	PubMed
description	BACKGROUND: Frequent pattern mining analysis applied on microarray dataset appears to be a promising strategy for identifying relationships between gene expression levels. Unfortunately, too many itemsets (co-expressed genes) are identified by this analysis method since it does not consider the importance of each gene within biological processes to a cellular response and does not take into account temporal properties under biological treatment-control matched conditions in a microarray dataset. RESULTS: We propose a method termed TIIM (Top-k Impactful Itemsets Miner), which only requires specifying a user-defined number k to explore the top k itemsets with the most significantly differentially co-expressed genes between 2 conditions in a time course. To give genes different weights, a table with impact degrees for each gene was constructed based on the number of neighboring genes that are differently expressed in the dataset within gene regulatory networks. Finally, the resulting top-k impactful itemsets were manually evaluated using previous literature and analyzed by a Gene Ontology enrichment method. CONCLUSIONS: In this study, the proposed method was evaluated in 2 publicly available time course microarray datasets with 2 different experimental conditions. Both datasets identified potential itemsets with co-expressed genes evaluated from the literature and showed higher accuracies compared to the 2 corresponding control methods: i) performing TIIM without considering the gene expression differentiation between 2 different experimental conditions and impact degrees, and ii) performing TIIM with a constant impact degree for each gene. Our proposed method found that several new gene regulations involved in these itemsets were useful for biologists and provided further insights into the mechanisms underpinning biological processes. The Java source code and other related materials used in this study are available at “http://websystem.csie.ncku.edu.tw/TIIM_Program.rar”.
format	Online Article Text
id	pubmed-3751367
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-37513672013-08-28 Mining differential top-k co-expression patterns from time course comparative gene expression datasets Liu, Yu-Cheng Cheng, Chun-Pei Tseng, Vincent S BMC Bioinformatics Methodology Article BACKGROUND: Frequent pattern mining analysis applied on microarray dataset appears to be a promising strategy for identifying relationships between gene expression levels. Unfortunately, too many itemsets (co-expressed genes) are identified by this analysis method since it does not consider the importance of each gene within biological processes to a cellular response and does not take into account temporal properties under biological treatment-control matched conditions in a microarray dataset. RESULTS: We propose a method termed TIIM (Top-k Impactful Itemsets Miner), which only requires specifying a user-defined number k to explore the top k itemsets with the most significantly differentially co-expressed genes between 2 conditions in a time course. To give genes different weights, a table with impact degrees for each gene was constructed based on the number of neighboring genes that are differently expressed in the dataset within gene regulatory networks. Finally, the resulting top-k impactful itemsets were manually evaluated using previous literature and analyzed by a Gene Ontology enrichment method. CONCLUSIONS: In this study, the proposed method was evaluated in 2 publicly available time course microarray datasets with 2 different experimental conditions. Both datasets identified potential itemsets with co-expressed genes evaluated from the literature and showed higher accuracies compared to the 2 corresponding control methods: i) performing TIIM without considering the gene expression differentiation between 2 different experimental conditions and impact degrees, and ii) performing TIIM with a constant impact degree for each gene. Our proposed method found that several new gene regulations involved in these itemsets were useful for biologists and provided further insights into the mechanisms underpinning biological processes. The Java source code and other related materials used in this study are available at “http://websystem.csie.ncku.edu.tw/TIIM_Program.rar”. BioMed Central 2013-07-21 /pmc/articles/PMC3751367/ /pubmed/23870110 http://dx.doi.org/10.1186/1471-2105-14-230 Text en Copyright © 2013 Liu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Liu, Yu-Cheng Cheng, Chun-Pei Tseng, Vincent S Mining differential top-k co-expression patterns from time course comparative gene expression datasets
title	Mining differential top-k co-expression patterns from time course comparative gene expression datasets
title_full	Mining differential top-k co-expression patterns from time course comparative gene expression datasets
title_fullStr	Mining differential top-k co-expression patterns from time course comparative gene expression datasets
title_full_unstemmed	Mining differential top-k co-expression patterns from time course comparative gene expression datasets
title_short	Mining differential top-k co-expression patterns from time course comparative gene expression datasets
title_sort	mining differential top-k co-expression patterns from time course comparative gene expression datasets
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3751367/ https://www.ncbi.nlm.nih.gov/pubmed/23870110 http://dx.doi.org/10.1186/1471-2105-14-230
work_keys_str_mv	AT liuyucheng miningdifferentialtopkcoexpressionpatternsfromtimecoursecomparativegeneexpressiondatasets AT chengchunpei miningdifferentialtopkcoexpressionpatternsfromtimecoursecomparativegeneexpressiondatasets AT tsengvincents miningdifferentialtopkcoexpressionpatternsfromtimecoursecomparativegeneexpressiondatasets

Mining differential top-k co-expression patterns from time course comparative gene expression datasets

Ejemplares similares