Cargando…

NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration

MOTIVATION: Many methods have been proposed for mapping the targets of transcription factors (TFs) from gene expression data. It is known that combining outputs from multiple methods can improve performance. To date, outputs have been combined by using either simplistic formulae, such as geometric m...

Descripción completa

Detalles Bibliográficos
Autores principales: Abid, Dhoha, Brent, Michael R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9912366/
https://www.ncbi.nlm.nih.gov/pubmed/36692138
http://dx.doi.org/10.1093/bioinformatics/btad038
_version_ 1784885191387381760
author Abid, Dhoha
Brent, Michael R
author_facet Abid, Dhoha
Brent, Michael R
author_sort Abid, Dhoha
collection PubMed
description MOTIVATION: Many methods have been proposed for mapping the targets of transcription factors (TFs) from gene expression data. It is known that combining outputs from multiple methods can improve performance. To date, outputs have been combined by using either simplistic formulae, such as geometric mean, or carefully hand-tuned formulae that may not generalize well to new inputs. Finally, the evaluation of accuracy has been challenging due to the lack of genome-scale, ground-truth networks. RESULTS: We developed NetProphet3, which combines scores from multiple analyses automatically, using a tree boosting algorithm trained on TF binding location data. We also developed three independent, genome-scale evaluation metrics. By these metrics, NetProphet3 is more accurate than other commonly used packages, including NetProphet 2.0, when gene expression data from direct TF perturbations are available. Furthermore, its integration mode can forge a consensus network from gene expression data and TF binding location data. AVAILABILITY AND IMPLEMENTATION: All data and code are available at https://zenodo.org/record/7504131#.Y7Wu3i-B2x8. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9912366
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99123662023-02-13 NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration Abid, Dhoha Brent, Michael R Bioinformatics Original Paper MOTIVATION: Many methods have been proposed for mapping the targets of transcription factors (TFs) from gene expression data. It is known that combining outputs from multiple methods can improve performance. To date, outputs have been combined by using either simplistic formulae, such as geometric mean, or carefully hand-tuned formulae that may not generalize well to new inputs. Finally, the evaluation of accuracy has been challenging due to the lack of genome-scale, ground-truth networks. RESULTS: We developed NetProphet3, which combines scores from multiple analyses automatically, using a tree boosting algorithm trained on TF binding location data. We also developed three independent, genome-scale evaluation metrics. By these metrics, NetProphet3 is more accurate than other commonly used packages, including NetProphet 2.0, when gene expression data from direct TF perturbations are available. Furthermore, its integration mode can forge a consensus network from gene expression data and TF binding location data. AVAILABILITY AND IMPLEMENTATION: All data and code are available at https://zenodo.org/record/7504131#.Y7Wu3i-B2x8. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-01-24 /pmc/articles/PMC9912366/ /pubmed/36692138 http://dx.doi.org/10.1093/bioinformatics/btad038 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Abid, Dhoha
Brent, Michael R
NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration
title NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration
title_full NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration
title_fullStr NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration
title_full_unstemmed NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration
title_short NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration
title_sort netprophet 3: a machine learning framework for transcription factor network mapping and multi-omics integration
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9912366/
https://www.ncbi.nlm.nih.gov/pubmed/36692138
http://dx.doi.org/10.1093/bioinformatics/btad038
work_keys_str_mv AT abiddhoha netprophet3amachinelearningframeworkfortranscriptionfactornetworkmappingandmultiomicsintegration
AT brentmichaelr netprophet3amachinelearningframeworkfortranscriptionfactornetworkmappingandmultiomicsintegration