Cargando…

Integrative random forest for gene regulatory network inference

Motivation: Gene regulatory network (GRN) inference based on genomic data is one of the most actively pursued computational biological problems. Because different types of biological data usually provide complementary information regarding the underlying GRN, a model that integrates big data of dive...

Descripción completa

Detalles Bibliográficos
Autores principales: Petralia, Francesca, Wang, Pei, Yang, Jialiang, Tu, Zhidong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542785/
https://www.ncbi.nlm.nih.gov/pubmed/26072483
http://dx.doi.org/10.1093/bioinformatics/btv268
_version_ 1782386562752839680
author Petralia, Francesca
Wang, Pei
Yang, Jialiang
Tu, Zhidong
author_facet Petralia, Francesca
Wang, Pei
Yang, Jialiang
Tu, Zhidong
author_sort Petralia, Francesca
collection PubMed
description Motivation: Gene regulatory network (GRN) inference based on genomic data is one of the most actively pursued computational biological problems. Because different types of biological data usually provide complementary information regarding the underlying GRN, a model that integrates big data of diverse types is expected to increase both the power and accuracy of GRN inference. Towards this goal, we propose a novel algorithm named iRafNet: integrative random forest for gene regulatory network inference. Results: iRafNet is a flexible, unified integrative framework that allows information from heterogeneous data, such as protein–protein interactions, transcription factor (TF)-DNA-binding, gene knock-down, to be jointly considered for GRN inference. Using test data from the DREAM4 and DREAM5 challenges, we demonstrate that iRafNet outperforms the original random forest based network inference algorithm (GENIE3), and is highly comparable to the community learning approach. We apply iRafNet to construct GRN in Saccharomyces cerevisiae and demonstrate that it improves the performance in predicting TF-target gene regulations and provides additional functional insights to the predicted gene regulations. Availability and implementation: The R code of iRafNet implementation and a tutorial are available at: http://research.mssm.edu/tulab/software/irafnet.html Contact: zhidong.tu@mssm.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4542785
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45427852015-08-25 Integrative random forest for gene regulatory network inference Petralia, Francesca Wang, Pei Yang, Jialiang Tu, Zhidong Bioinformatics Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Motivation: Gene regulatory network (GRN) inference based on genomic data is one of the most actively pursued computational biological problems. Because different types of biological data usually provide complementary information regarding the underlying GRN, a model that integrates big data of diverse types is expected to increase both the power and accuracy of GRN inference. Towards this goal, we propose a novel algorithm named iRafNet: integrative random forest for gene regulatory network inference. Results: iRafNet is a flexible, unified integrative framework that allows information from heterogeneous data, such as protein–protein interactions, transcription factor (TF)-DNA-binding, gene knock-down, to be jointly considered for GRN inference. Using test data from the DREAM4 and DREAM5 challenges, we demonstrate that iRafNet outperforms the original random forest based network inference algorithm (GENIE3), and is highly comparable to the community learning approach. We apply iRafNet to construct GRN in Saccharomyces cerevisiae and demonstrate that it improves the performance in predicting TF-target gene regulations and provides additional functional insights to the predicted gene regulations. Availability and implementation: The R code of iRafNet implementation and a tutorial are available at: http://research.mssm.edu/tulab/software/irafnet.html Contact: zhidong.tu@mssm.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-06-15 2015-06-10 /pmc/articles/PMC4542785/ /pubmed/26072483 http://dx.doi.org/10.1093/bioinformatics/btv268 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License(http://creativecommons.org/licenses/by-nc/3.0/),which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
Petralia, Francesca
Wang, Pei
Yang, Jialiang
Tu, Zhidong
Integrative random forest for gene regulatory network inference
title Integrative random forest for gene regulatory network inference
title_full Integrative random forest for gene regulatory network inference
title_fullStr Integrative random forest for gene regulatory network inference
title_full_unstemmed Integrative random forest for gene regulatory network inference
title_short Integrative random forest for gene regulatory network inference
title_sort integrative random forest for gene regulatory network inference
topic Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542785/
https://www.ncbi.nlm.nih.gov/pubmed/26072483
http://dx.doi.org/10.1093/bioinformatics/btv268
work_keys_str_mv AT petraliafrancesca integrativerandomforestforgeneregulatorynetworkinference
AT wangpei integrativerandomforestforgeneregulatorynetworkinference
AT yangjialiang integrativerandomforestforgeneregulatorynetworkinference
AT tuzhidong integrativerandomforestforgeneregulatorynetworkinference