Cargando…

RGBM: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes

We propose a generic framework for gene regulatory network (GRN) inference approached as a feature selection problem. GRNs obtained using Machine Learning techniques are often dense, whereas real GRNs are rather sparse. We use a Tikonov regularization inspired optimal L-curve criterion that utilizes...

Descripción completa

Detalles Bibliográficos
Autores principales: Mall, Raghvendra, Cerulo, Luigi, Garofano, Luciano, Frattini, Veronique, Kunji, Khalid, Bensmail, Halima, Sabedot, Thais S, Noushmehr, Houtan, Lasorella, Anna, Iavarone, Antonio, Ceccarelli, Michele
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6283452/
https://www.ncbi.nlm.nih.gov/pubmed/29361062
http://dx.doi.org/10.1093/nar/gky015
_version_ 1783379165246914560
author Mall, Raghvendra
Cerulo, Luigi
Garofano, Luciano
Frattini, Veronique
Kunji, Khalid
Bensmail, Halima
Sabedot, Thais S
Noushmehr, Houtan
Lasorella, Anna
Iavarone, Antonio
Ceccarelli, Michele
author_facet Mall, Raghvendra
Cerulo, Luigi
Garofano, Luciano
Frattini, Veronique
Kunji, Khalid
Bensmail, Halima
Sabedot, Thais S
Noushmehr, Houtan
Lasorella, Anna
Iavarone, Antonio
Ceccarelli, Michele
author_sort Mall, Raghvendra
collection PubMed
description We propose a generic framework for gene regulatory network (GRN) inference approached as a feature selection problem. GRNs obtained using Machine Learning techniques are often dense, whereas real GRNs are rather sparse. We use a Tikonov regularization inspired optimal L-curve criterion that utilizes the edge weight distribution for a given target gene to determine the optimal set of TFs associated with it. Our proposed framework allows to incorporate a mechanistic active biding network based on cis-regulatory motif analysis. We evaluate our regularization framework in conjunction with two non-linear ML techniques, namely gradient boosting machines (GBM) and random-forests (GENIE), resulting in a regularized feature selection based method specifically called RGBM and RGENIE respectively. RGBM has been used to identify the main transcription factors that are causally involved as master regulators of the gene expression signature activated in the FGFR3-TACC3-positive glioblastoma. Here, we illustrate that RGBM identifies the main regulators of the molecular subtypes of brain tumors. Our analysis reveals the identity and corresponding biological activities of the master regulators characterizing the difference between G-CIMP-high and G-CIMP-low subtypes and between PA-like and LGm6-GBM, thus providing a clue to the yet undetermined nature of the transcriptional events among these subtypes.
format Online
Article
Text
id pubmed-6283452
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62834522018-12-11 RGBM: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes Mall, Raghvendra Cerulo, Luigi Garofano, Luciano Frattini, Veronique Kunji, Khalid Bensmail, Halima Sabedot, Thais S Noushmehr, Houtan Lasorella, Anna Iavarone, Antonio Ceccarelli, Michele Nucleic Acids Res Methods Online We propose a generic framework for gene regulatory network (GRN) inference approached as a feature selection problem. GRNs obtained using Machine Learning techniques are often dense, whereas real GRNs are rather sparse. We use a Tikonov regularization inspired optimal L-curve criterion that utilizes the edge weight distribution for a given target gene to determine the optimal set of TFs associated with it. Our proposed framework allows to incorporate a mechanistic active biding network based on cis-regulatory motif analysis. We evaluate our regularization framework in conjunction with two non-linear ML techniques, namely gradient boosting machines (GBM) and random-forests (GENIE), resulting in a regularized feature selection based method specifically called RGBM and RGENIE respectively. RGBM has been used to identify the main transcription factors that are causally involved as master regulators of the gene expression signature activated in the FGFR3-TACC3-positive glioblastoma. Here, we illustrate that RGBM identifies the main regulators of the molecular subtypes of brain tumors. Our analysis reveals the identity and corresponding biological activities of the master regulators characterizing the difference between G-CIMP-high and G-CIMP-low subtypes and between PA-like and LGm6-GBM, thus providing a clue to the yet undetermined nature of the transcriptional events among these subtypes. Oxford University Press 2018-04-20 2018-01-19 /pmc/articles/PMC6283452/ /pubmed/29361062 http://dx.doi.org/10.1093/nar/gky015 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Mall, Raghvendra
Cerulo, Luigi
Garofano, Luciano
Frattini, Veronique
Kunji, Khalid
Bensmail, Halima
Sabedot, Thais S
Noushmehr, Houtan
Lasorella, Anna
Iavarone, Antonio
Ceccarelli, Michele
RGBM: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes
title RGBM: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes
title_full RGBM: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes
title_fullStr RGBM: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes
title_full_unstemmed RGBM: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes
title_short RGBM: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes
title_sort rgbm: regularized gradient boosting machines for identification of the transcriptional regulators of discrete glioma subtypes
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6283452/
https://www.ncbi.nlm.nih.gov/pubmed/29361062
http://dx.doi.org/10.1093/nar/gky015
work_keys_str_mv AT mallraghvendra rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT ceruloluigi rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT garofanoluciano rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT frattiniveronique rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT kunjikhalid rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT bensmailhalima rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT sabedotthaiss rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT noushmehrhoutan rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT lasorellaanna rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT iavaroneantonio rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes
AT ceccarellimichele rgbmregularizedgradientboostingmachinesforidentificationofthetranscriptionalregulatorsofdiscretegliomasubtypes