Cargando…

Prioritizing and characterizing functionally relevant genes across human tissues

Knowledge of genes that are critical to a tissue’s function remains difficult to ascertain and presents a major bottleneck toward a mechanistic understanding of genotype-phenotype links. Here, we present the first machine learning model–FUGUE–combining transcriptional and network features, to predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Somepalli, Gowthami, Sahoo, Sarthak, Singh, Arashdeep, Hannenhalli, Sridhar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8284802/
https://www.ncbi.nlm.nih.gov/pubmed/34270548
http://dx.doi.org/10.1371/journal.pcbi.1009194
_version_ 1783723462432391168
author Somepalli, Gowthami
Sahoo, Sarthak
Singh, Arashdeep
Hannenhalli, Sridhar
author_facet Somepalli, Gowthami
Sahoo, Sarthak
Singh, Arashdeep
Hannenhalli, Sridhar
author_sort Somepalli, Gowthami
collection PubMed
description Knowledge of genes that are critical to a tissue’s function remains difficult to ascertain and presents a major bottleneck toward a mechanistic understanding of genotype-phenotype links. Here, we present the first machine learning model–FUGUE–combining transcriptional and network features, to predict tissue-relevant genes across 30 human tissues. FUGUE achieves an average cross-validation auROC of 0.86 and auPRC of 0.50 (expected 0.09). In independent datasets, FUGUE accurately distinguishes tissue or cell type-specific genes, significantly outperforming the conventional metric based on tissue-specific expression alone. Comparison of tissue-relevant transcription factors across tissue recapitulate their developmental relationships. Interestingly, the tissue-relevant genes cluster on the genome within topologically associated domains and furthermore, are highly enriched for differentially expressed genes in the corresponding cancer type. We provide the prioritized gene lists in 30 human tissues and an open-source software to prioritize genes in a novel context given multi-sample transcriptomic data.
format Online
Article
Text
id pubmed-8284802
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-82848022021-07-28 Prioritizing and characterizing functionally relevant genes across human tissues Somepalli, Gowthami Sahoo, Sarthak Singh, Arashdeep Hannenhalli, Sridhar PLoS Comput Biol Research Article Knowledge of genes that are critical to a tissue’s function remains difficult to ascertain and presents a major bottleneck toward a mechanistic understanding of genotype-phenotype links. Here, we present the first machine learning model–FUGUE–combining transcriptional and network features, to predict tissue-relevant genes across 30 human tissues. FUGUE achieves an average cross-validation auROC of 0.86 and auPRC of 0.50 (expected 0.09). In independent datasets, FUGUE accurately distinguishes tissue or cell type-specific genes, significantly outperforming the conventional metric based on tissue-specific expression alone. Comparison of tissue-relevant transcription factors across tissue recapitulate their developmental relationships. Interestingly, the tissue-relevant genes cluster on the genome within topologically associated domains and furthermore, are highly enriched for differentially expressed genes in the corresponding cancer type. We provide the prioritized gene lists in 30 human tissues and an open-source software to prioritize genes in a novel context given multi-sample transcriptomic data. Public Library of Science 2021-07-16 /pmc/articles/PMC8284802/ /pubmed/34270548 http://dx.doi.org/10.1371/journal.pcbi.1009194 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Somepalli, Gowthami
Sahoo, Sarthak
Singh, Arashdeep
Hannenhalli, Sridhar
Prioritizing and characterizing functionally relevant genes across human tissues
title Prioritizing and characterizing functionally relevant genes across human tissues
title_full Prioritizing and characterizing functionally relevant genes across human tissues
title_fullStr Prioritizing and characterizing functionally relevant genes across human tissues
title_full_unstemmed Prioritizing and characterizing functionally relevant genes across human tissues
title_short Prioritizing and characterizing functionally relevant genes across human tissues
title_sort prioritizing and characterizing functionally relevant genes across human tissues
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8284802/
https://www.ncbi.nlm.nih.gov/pubmed/34270548
http://dx.doi.org/10.1371/journal.pcbi.1009194
work_keys_str_mv AT somepalligowthami prioritizingandcharacterizingfunctionallyrelevantgenesacrosshumantissues
AT sahoosarthak prioritizingandcharacterizingfunctionallyrelevantgenesacrosshumantissues
AT singharashdeep prioritizingandcharacterizingfunctionallyrelevantgenesacrosshumantissues
AT hannenhallisridhar prioritizingandcharacterizingfunctionallyrelevantgenesacrosshumantissues