Cargando…
A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets
BACKGROUND: As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in sys...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243122/ https://www.ncbi.nlm.nih.gov/pubmed/25350697 http://dx.doi.org/10.1186/1752-0509-8-S3-S1 |
_version_ | 1782346064151445504 |
---|---|
author | Liu, Li-Zhi Wu, Fang-Xiang Zhang, Wen-Jun |
author_facet | Liu, Li-Zhi Wu, Fang-Xiang Zhang, Wen-Jun |
author_sort | Liu, Li-Zhi |
collection | PubMed |
description | BACKGROUND: As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in systems biology. With the development of biological technologies, multiple time-course gene expression datasets might be collected for a specific gene network under different circumstances. The inference of a gene regulatory network can be improved by integrating these multiple datasets. It is also known that gene expression data may be contaminated with large errors or outliers, which may affect the inference results. RESULTS: A novel method, Huber group LASSO, is proposed to infer the same underlying network topology from multiple time-course gene expression datasets as well as to take the robustness to large error or outliers into account. To solve the optimization problem involved in the proposed method, an efficient algorithm which combines the ideas of auxiliary function minimization and block descent is developed. A stability selection method is adapted to our method to find a network topology consisting of edges with scores. The proposed method is applied to both simulation datasets and real experimental datasets. It shows that Huber group LASSO outperforms the group LASSO in terms of both areas under receiver operating characteristic curves and areas under the precision-recall curves. CONCLUSIONS: The convergence analysis of the algorithm theoretically shows that the sequence generated from the algorithm converges to the optimal solution of the problem. The simulation and real data examples demonstrate the effectiveness of the Huber group LASSO in integrating multiple time-course gene expression datasets and improving the resistance to large errors or outliers. |
format | Online Article Text |
id | pubmed-4243122 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42431222014-11-26 A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets Liu, Li-Zhi Wu, Fang-Xiang Zhang, Wen-Jun BMC Syst Biol Research BACKGROUND: As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in systems biology. With the development of biological technologies, multiple time-course gene expression datasets might be collected for a specific gene network under different circumstances. The inference of a gene regulatory network can be improved by integrating these multiple datasets. It is also known that gene expression data may be contaminated with large errors or outliers, which may affect the inference results. RESULTS: A novel method, Huber group LASSO, is proposed to infer the same underlying network topology from multiple time-course gene expression datasets as well as to take the robustness to large error or outliers into account. To solve the optimization problem involved in the proposed method, an efficient algorithm which combines the ideas of auxiliary function minimization and block descent is developed. A stability selection method is adapted to our method to find a network topology consisting of edges with scores. The proposed method is applied to both simulation datasets and real experimental datasets. It shows that Huber group LASSO outperforms the group LASSO in terms of both areas under receiver operating characteristic curves and areas under the precision-recall curves. CONCLUSIONS: The convergence analysis of the algorithm theoretically shows that the sequence generated from the algorithm converges to the optimal solution of the problem. The simulation and real data examples demonstrate the effectiveness of the Huber group LASSO in integrating multiple time-course gene expression datasets and improving the resistance to large errors or outliers. BioMed Central 2014-10-22 /pmc/articles/PMC4243122/ /pubmed/25350697 http://dx.doi.org/10.1186/1752-0509-8-S3-S1 Text en Copyright © 2014 Liu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Liu, Li-Zhi Wu, Fang-Xiang Zhang, Wen-Jun A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets |
title | A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets |
title_full | A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets |
title_fullStr | A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets |
title_full_unstemmed | A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets |
title_short | A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets |
title_sort | group lasso-based method for robustly inferring gene regulatory networks from multiple time-course datasets |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243122/ https://www.ncbi.nlm.nih.gov/pubmed/25350697 http://dx.doi.org/10.1186/1752-0509-8-S3-S1 |
work_keys_str_mv | AT liulizhi agrouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets AT wufangxiang agrouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets AT zhangwenjun agrouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets AT liulizhi grouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets AT wufangxiang grouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets AT zhangwenjun grouplassobasedmethodforrobustlyinferringgeneregulatorynetworksfrommultipletimecoursedatasets |