Cargando…

A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets

BACKGROUND: As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in sys...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Li-Zhi, Wu, Fang-Xiang, Zhang, Wen-Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243122/
https://www.ncbi.nlm.nih.gov/pubmed/25350697
http://dx.doi.org/10.1186/1752-0509-8-S3-S1
_version_ 1782346064151445504
author Liu, Li-Zhi
Wu, Fang-Xiang
Zhang, Wen-Jun
author_facet Liu, Li-Zhi
Wu, Fang-Xiang
Zhang, Wen-Jun
author_sort Liu, Li-Zhi
collection PubMed
description BACKGROUND: As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in systems biology. With the development of biological technologies, multiple time-course gene expression datasets might be collected for a specific gene network under different circumstances. The inference of a gene regulatory network can be improved by integrating these multiple datasets. It is also known that gene expression data may be contaminated with large errors or outliers, which may affect the inference results. RESULTS: A novel method, Huber group LASSO, is proposed to infer the same underlying network topology from multiple time-course gene expression datasets as well as to take the robustness to large error or outliers into account. To solve the optimization problem involved in the proposed method, an efficient algorithm which combines the ideas of auxiliary function minimization and block descent is developed. A stability selection method is adapted to our method to find a network topology consisting of edges with scores. The proposed method is applied to both simulation datasets and real experimental datasets. It shows that Huber group LASSO outperforms the group LASSO in terms of both areas under receiver operating characteristic curves and areas under the precision-recall curves. CONCLUSIONS: The convergence analysis of the algorithm theoretically shows that the sequence generated from the algorithm converges to the optimal solution of the problem. The simulation and real data examples demonstrate the effectiveness of the Huber group LASSO in integrating multiple time-course gene expression datasets and improving the resistance to large errors or outliers.
format Online
Article
Text
id pubmed-4243122
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42431222014-11-26 A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets Liu, Li-Zhi Wu, Fang-Xiang Zhang, Wen-Jun BMC Syst Biol Research BACKGROUND: As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in systems biology. With the development of biological technologies, multiple time-course gene expression datasets might be collected for a specific gene network under different circumstances. The inference of a gene regulatory network can be improved by integrating these multiple datasets. It is also known that gene expression data may be contaminated with large errors or outliers, which may affect the inference results. RESULTS: A novel method, Huber group LASSO, is proposed to infer the same underlying network topology from multiple time-course gene expression datasets as well as to take the robustness to large error or outliers into account. To solve the optimization problem involved in the proposed method, an efficient algorithm which combines the ideas of auxiliary function minimization and block descent is developed. A stability selection method is adapted to our method to find a network topology consisting of edges with scores. The proposed method is applied to both simulation datasets and real experimental datasets. It shows that Huber group LASSO outperforms the group LASSO in terms of both areas under receiver operating characteristic curves and areas under the precision-recall curves. CONCLUSIONS: The convergence analysis of the algorithm theoretically shows that the sequence generated from the algorithm converges to the optimal solution of the problem. The simulation and real data examples demonstrate the effectiveness of the Huber group LASSO in integrating multiple time-course gene expression datasets and improving the resistance to large errors or outliers. BioMed Central 2014-10-22 /pmc/articles/PMC4243122/ /pubmed/25350697 http://dx.doi.org/10.1186/1752-0509-8-S3-S1 Text en Copyright © 2014 Liu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Liu, Li-Zhi
Wu, Fang-Xiang
Zhang, Wen-Jun
A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets
title A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets
title_full A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets
title_fullStr A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets
title_full_unstemmed A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets
title_short A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets
title_sort group lasso-based method for robustly inferring gene regulatory networks from multiple time-course datasets
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243122/
https://www.ncbi.nlm.nih.gov/pubmed/25350697
http://dx.doi.org/10.1186/1752-0509-8-S3-S1
work_keys_str_mv AT liulizhi agrouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets
AT wufangxiang agrouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets
AT zhangwenjun agrouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets
AT liulizhi grouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets
AT wufangxiang grouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets
AT zhangwenjun grouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets