Cargando…

MultiFacTV: module detection from higher-order time series biological data

BACKGROUND: Identifying modules from time series biological data helps us understand biological functionalities of a group of proteins/genes interacting together and how responses of these proteins/genes dynamically change with respect to time. With rapid acquisition of time series biological data f...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xutao, Ye, Yunming, Ng, Michael, Wu, Qingyao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3856496/
https://www.ncbi.nlm.nih.gov/pubmed/24268038
http://dx.doi.org/10.1186/1471-2164-14-S4-S2
_version_ 1782295072493010944
author Li, Xutao
Ye, Yunming
Ng, Michael
Wu, Qingyao
author_facet Li, Xutao
Ye, Yunming
Ng, Michael
Wu, Qingyao
author_sort Li, Xutao
collection PubMed
description BACKGROUND: Identifying modules from time series biological data helps us understand biological functionalities of a group of proteins/genes interacting together and how responses of these proteins/genes dynamically change with respect to time. With rapid acquisition of time series biological data from different laboratories or databases, new challenges are posed for the identification task and powerful methods which are able to detect modules with integrative analysis are urgently called for. To accomplish such integrative analysis, we assemble multiple time series biological data into a higher-order form, e.g., a gene × condition × time tensor. It is interesting and useful to develop methods to identify modules from this tensor. RESULTS: In this paper, we present MultiFacTV, a new method to find modules from higher-order time series biological data. This method employs a tensor factorization objective function where a time-related total variation regularization term is incorporated. According to factorization results, MultiFacTV extracts modules that are composed of some genes, conditions and time-points. We have performed MultiFacTV on synthetic datasets and the results have shown that MultiFacTV outperforms existing methods EDISA and Metafac. Moreover, we have applied MultiFacTV to Arabidopsis thaliana root(shoot) tissue dataset represented as a gene×condition×time tensor of size 2395 × 9 × 6(3454 × 8 × 6), to Yeast dataset and Homo sapiens dataset represented as tensors of sizes 4425 × 6 × 6 and 2920×14×9 respectively. The results have shown that MultiFacTV indeed identifies some interesting modules in these datasets, which have been validated and explained by Gene Ontology analysis with DAVID or other analysis. CONCLUSION: Experimental results on both synthetic datasets and real datasets show that the proposed MultiFacTV is effective in identifying modules for higher-order time series biological data. It provides, compared to traditional non-integrative analysis methods, a more comprehensive and better view on biological process since modules composed of more than two types of biological variables could be identified and analyzed.
format Online
Article
Text
id pubmed-3856496
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38564962013-12-16 MultiFacTV: module detection from higher-order time series biological data Li, Xutao Ye, Yunming Ng, Michael Wu, Qingyao BMC Genomics Research BACKGROUND: Identifying modules from time series biological data helps us understand biological functionalities of a group of proteins/genes interacting together and how responses of these proteins/genes dynamically change with respect to time. With rapid acquisition of time series biological data from different laboratories or databases, new challenges are posed for the identification task and powerful methods which are able to detect modules with integrative analysis are urgently called for. To accomplish such integrative analysis, we assemble multiple time series biological data into a higher-order form, e.g., a gene × condition × time tensor. It is interesting and useful to develop methods to identify modules from this tensor. RESULTS: In this paper, we present MultiFacTV, a new method to find modules from higher-order time series biological data. This method employs a tensor factorization objective function where a time-related total variation regularization term is incorporated. According to factorization results, MultiFacTV extracts modules that are composed of some genes, conditions and time-points. We have performed MultiFacTV on synthetic datasets and the results have shown that MultiFacTV outperforms existing methods EDISA and Metafac. Moreover, we have applied MultiFacTV to Arabidopsis thaliana root(shoot) tissue dataset represented as a gene×condition×time tensor of size 2395 × 9 × 6(3454 × 8 × 6), to Yeast dataset and Homo sapiens dataset represented as tensors of sizes 4425 × 6 × 6 and 2920×14×9 respectively. The results have shown that MultiFacTV indeed identifies some interesting modules in these datasets, which have been validated and explained by Gene Ontology analysis with DAVID or other analysis. CONCLUSION: Experimental results on both synthetic datasets and real datasets show that the proposed MultiFacTV is effective in identifying modules for higher-order time series biological data. It provides, compared to traditional non-integrative analysis methods, a more comprehensive and better view on biological process since modules composed of more than two types of biological variables could be identified and analyzed. BioMed Central 2013-10-01 /pmc/articles/PMC3856496/ /pubmed/24268038 http://dx.doi.org/10.1186/1471-2164-14-S4-S2 Text en Copyright © 2013 Li et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Li, Xutao
Ye, Yunming
Ng, Michael
Wu, Qingyao
MultiFacTV: module detection from higher-order time series biological data
title MultiFacTV: module detection from higher-order time series biological data
title_full MultiFacTV: module detection from higher-order time series biological data
title_fullStr MultiFacTV: module detection from higher-order time series biological data
title_full_unstemmed MultiFacTV: module detection from higher-order time series biological data
title_short MultiFacTV: module detection from higher-order time series biological data
title_sort multifactv: module detection from higher-order time series biological data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3856496/
https://www.ncbi.nlm.nih.gov/pubmed/24268038
http://dx.doi.org/10.1186/1471-2164-14-S4-S2
work_keys_str_mv AT lixutao multifactvmoduledetectionfromhigherordertimeseriesbiologicaldata
AT yeyunming multifactvmoduledetectionfromhigherordertimeseriesbiologicaldata
AT ngmichael multifactvmoduledetectionfromhigherordertimeseriesbiologicaldata
AT wuqingyao multifactvmoduledetectionfromhigherordertimeseriesbiologicaldata