Cargando…

GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data

Context: Inferring gene regulatory networks (GRN) from high-throughput gene expression data is a challenging task for which different strategies have been developed. Nevertheless, no ever-winning method exists, and each method has its advantages, intrinsic biases, and application domains. Thus, in o...

Descripción completa

Detalles Bibliográficos
Autores principales: Schmitt, Pauline, Sorin, Baptiste, Frouté, Timothée, Parisot, Nicolas, Calevro, Federica, Peignier, Sergio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9957546/
https://www.ncbi.nlm.nih.gov/pubmed/36833196
http://dx.doi.org/10.3390/genes14020269
_version_ 1784894851990421504
author Schmitt, Pauline
Sorin, Baptiste
Frouté, Timothée
Parisot, Nicolas
Calevro, Federica
Peignier, Sergio
author_facet Schmitt, Pauline
Sorin, Baptiste
Frouté, Timothée
Parisot, Nicolas
Calevro, Federica
Peignier, Sergio
author_sort Schmitt, Pauline
collection PubMed
description Context: Inferring gene regulatory networks (GRN) from high-throughput gene expression data is a challenging task for which different strategies have been developed. Nevertheless, no ever-winning method exists, and each method has its advantages, intrinsic biases, and application domains. Thus, in order to analyze a dataset, users should be able to test different techniques and choose the most appropriate one. This step can be particularly difficult and time consuming, since most methods’ implementations are made available independently, possibly in different programming languages. The implementation of an open-source library containing different inference methods within a common framework is expected to be a valuable toolkit for the systems biology community. Results: In this work, we introduce GReNaDIne (Gene Regulatory Network Data-driven Inference), a Python package that implements 18 machine learning data-driven gene regulatory network inference methods. It also includes eight generalist preprocessing techniques, suitable for both RNA-seq and microarray dataset analysis, as well as four normalization techniques dedicated to RNA-seq. In addition, this package implements the possibility to combine the results of different inference tools to form robust and efficient ensembles. This package has been successfully assessed under the DREAM5 challenge benchmark dataset. The open-source GReNaDIne Python package is made freely available in a dedicated GitLab repository, as well as in the official third-party software repository PyPI Python Package Index. The latest documentation on the GReNaDIne library is also available at Read the Docs, an open-source software documentation hosting platform. Contribution: The GReNaDIne tool represents a technological contribution to the field of systems biology. This package can be used to infer gene regulatory networks from high-throughput gene expression data using different algorithms within the same framework. In order to analyze their datasets, users can apply a battery of preprocessing and postprocessing tools and choose the most adapted inference method from the GReNaDIne library and even combine the output of different methods to obtain more robust results. The results format provided by GReNaDIne is compatible with well-known complementary refinement tools such as PYSCENIC.
format Online
Article
Text
id pubmed-9957546
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99575462023-02-25 GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data Schmitt, Pauline Sorin, Baptiste Frouté, Timothée Parisot, Nicolas Calevro, Federica Peignier, Sergio Genes (Basel) Article Context: Inferring gene regulatory networks (GRN) from high-throughput gene expression data is a challenging task for which different strategies have been developed. Nevertheless, no ever-winning method exists, and each method has its advantages, intrinsic biases, and application domains. Thus, in order to analyze a dataset, users should be able to test different techniques and choose the most appropriate one. This step can be particularly difficult and time consuming, since most methods’ implementations are made available independently, possibly in different programming languages. The implementation of an open-source library containing different inference methods within a common framework is expected to be a valuable toolkit for the systems biology community. Results: In this work, we introduce GReNaDIne (Gene Regulatory Network Data-driven Inference), a Python package that implements 18 machine learning data-driven gene regulatory network inference methods. It also includes eight generalist preprocessing techniques, suitable for both RNA-seq and microarray dataset analysis, as well as four normalization techniques dedicated to RNA-seq. In addition, this package implements the possibility to combine the results of different inference tools to form robust and efficient ensembles. This package has been successfully assessed under the DREAM5 challenge benchmark dataset. The open-source GReNaDIne Python package is made freely available in a dedicated GitLab repository, as well as in the official third-party software repository PyPI Python Package Index. The latest documentation on the GReNaDIne library is also available at Read the Docs, an open-source software documentation hosting platform. Contribution: The GReNaDIne tool represents a technological contribution to the field of systems biology. This package can be used to infer gene regulatory networks from high-throughput gene expression data using different algorithms within the same framework. In order to analyze their datasets, users can apply a battery of preprocessing and postprocessing tools and choose the most adapted inference method from the GReNaDIne library and even combine the output of different methods to obtain more robust results. The results format provided by GReNaDIne is compatible with well-known complementary refinement tools such as PYSCENIC. MDPI 2023-01-20 /pmc/articles/PMC9957546/ /pubmed/36833196 http://dx.doi.org/10.3390/genes14020269 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Schmitt, Pauline
Sorin, Baptiste
Frouté, Timothée
Parisot, Nicolas
Calevro, Federica
Peignier, Sergio
GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data
title GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data
title_full GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data
title_fullStr GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data
title_full_unstemmed GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data
title_short GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data
title_sort grenadine: a data-driven python library to infer gene regulatory networks from gene expression data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9957546/
https://www.ncbi.nlm.nih.gov/pubmed/36833196
http://dx.doi.org/10.3390/genes14020269
work_keys_str_mv AT schmittpauline grenadineadatadrivenpythonlibrarytoinfergeneregulatorynetworksfromgeneexpressiondata
AT sorinbaptiste grenadineadatadrivenpythonlibrarytoinfergeneregulatorynetworksfromgeneexpressiondata
AT froutetimothee grenadineadatadrivenpythonlibrarytoinfergeneregulatorynetworksfromgeneexpressiondata
AT parisotnicolas grenadineadatadrivenpythonlibrarytoinfergeneregulatorynetworksfromgeneexpressiondata
AT calevrofederica grenadineadatadrivenpythonlibrarytoinfergeneregulatorynetworksfromgeneexpressiondata
AT peigniersergio grenadineadatadrivenpythonlibrarytoinfergeneregulatorynetworksfromgeneexpressiondata