Cargando…

KBoost: a new method to infer gene regulatory networks from gene expression data

Reconstructing gene regulatory networks is crucial to understand biological processes and holds potential for developing personalized treatment. Yet, it is still an open problem as state-of-the-art algorithms are often not able to process large amounts of data within reasonable time. Furthermore, ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Iglesias-Martinez, Luis F., De Kegel, Barbara, Kolch, Walter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8322418/
https://www.ncbi.nlm.nih.gov/pubmed/34326402
http://dx.doi.org/10.1038/s41598-021-94919-6
_version_ 1783731043851829248
author Iglesias-Martinez, Luis F.
De Kegel, Barbara
Kolch, Walter
author_facet Iglesias-Martinez, Luis F.
De Kegel, Barbara
Kolch, Walter
author_sort Iglesias-Martinez, Luis F.
collection PubMed
description Reconstructing gene regulatory networks is crucial to understand biological processes and holds potential for developing personalized treatment. Yet, it is still an open problem as state-of-the-art algorithms are often not able to process large amounts of data within reasonable time. Furthermore, many of the existing methods predict numerous false positives and have limited capabilities to integrate other sources of information, such as previously known interactions. Here we introduce KBoost, an algorithm that uses kernel PCA regression, boosting and Bayesian model averaging for fast and accurate reconstruction of gene regulatory networks. We have benchmarked KBoost against other high performing algorithms using three different datasets. The results show that our method compares favorably to other methods across datasets. We have also applied KBoost to a large cohort of close to 2000 breast cancer patients and 24,000 genes in less than 2 h on standard hardware. Our results show that molecularly defined breast cancer subtypes also feature differences in their GRNs. An implementation of KBoost in the form of an R package is available at: https://github.com/Luisiglm/KBoost and as a Bioconductor software package.
format Online
Article
Text
id pubmed-8322418
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-83224182021-07-30 KBoost: a new method to infer gene regulatory networks from gene expression data Iglesias-Martinez, Luis F. De Kegel, Barbara Kolch, Walter Sci Rep Article Reconstructing gene regulatory networks is crucial to understand biological processes and holds potential for developing personalized treatment. Yet, it is still an open problem as state-of-the-art algorithms are often not able to process large amounts of data within reasonable time. Furthermore, many of the existing methods predict numerous false positives and have limited capabilities to integrate other sources of information, such as previously known interactions. Here we introduce KBoost, an algorithm that uses kernel PCA regression, boosting and Bayesian model averaging for fast and accurate reconstruction of gene regulatory networks. We have benchmarked KBoost against other high performing algorithms using three different datasets. The results show that our method compares favorably to other methods across datasets. We have also applied KBoost to a large cohort of close to 2000 breast cancer patients and 24,000 genes in less than 2 h on standard hardware. Our results show that molecularly defined breast cancer subtypes also feature differences in their GRNs. An implementation of KBoost in the form of an R package is available at: https://github.com/Luisiglm/KBoost and as a Bioconductor software package. Nature Publishing Group UK 2021-07-29 /pmc/articles/PMC8322418/ /pubmed/34326402 http://dx.doi.org/10.1038/s41598-021-94919-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Iglesias-Martinez, Luis F.
De Kegel, Barbara
Kolch, Walter
KBoost: a new method to infer gene regulatory networks from gene expression data
title KBoost: a new method to infer gene regulatory networks from gene expression data
title_full KBoost: a new method to infer gene regulatory networks from gene expression data
title_fullStr KBoost: a new method to infer gene regulatory networks from gene expression data
title_full_unstemmed KBoost: a new method to infer gene regulatory networks from gene expression data
title_short KBoost: a new method to infer gene regulatory networks from gene expression data
title_sort kboost: a new method to infer gene regulatory networks from gene expression data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8322418/
https://www.ncbi.nlm.nih.gov/pubmed/34326402
http://dx.doi.org/10.1038/s41598-021-94919-6
work_keys_str_mv AT iglesiasmartinezluisf kboostanewmethodtoinfergeneregulatorynetworksfromgeneexpressiondata
AT dekegelbarbara kboostanewmethodtoinfergeneregulatorynetworksfromgeneexpressiondata
AT kolchwalter kboostanewmethodtoinfergeneregulatorynetworksfromgeneexpressiondata