Cargando…
Prioritizing and characterizing functionally relevant genes across human tissues
Knowledge of genes that are critical to a tissue’s function remains difficult to ascertain and presents a major bottleneck toward a mechanistic understanding of genotype-phenotype links. Here, we present the first machine learning model–FUGUE–combining transcriptional and network features, to predic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8284802/ https://www.ncbi.nlm.nih.gov/pubmed/34270548 http://dx.doi.org/10.1371/journal.pcbi.1009194 |
_version_ | 1783723462432391168 |
---|---|
author | Somepalli, Gowthami Sahoo, Sarthak Singh, Arashdeep Hannenhalli, Sridhar |
author_facet | Somepalli, Gowthami Sahoo, Sarthak Singh, Arashdeep Hannenhalli, Sridhar |
author_sort | Somepalli, Gowthami |
collection | PubMed |
description | Knowledge of genes that are critical to a tissue’s function remains difficult to ascertain and presents a major bottleneck toward a mechanistic understanding of genotype-phenotype links. Here, we present the first machine learning model–FUGUE–combining transcriptional and network features, to predict tissue-relevant genes across 30 human tissues. FUGUE achieves an average cross-validation auROC of 0.86 and auPRC of 0.50 (expected 0.09). In independent datasets, FUGUE accurately distinguishes tissue or cell type-specific genes, significantly outperforming the conventional metric based on tissue-specific expression alone. Comparison of tissue-relevant transcription factors across tissue recapitulate their developmental relationships. Interestingly, the tissue-relevant genes cluster on the genome within topologically associated domains and furthermore, are highly enriched for differentially expressed genes in the corresponding cancer type. We provide the prioritized gene lists in 30 human tissues and an open-source software to prioritize genes in a novel context given multi-sample transcriptomic data. |
format | Online Article Text |
id | pubmed-8284802 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-82848022021-07-28 Prioritizing and characterizing functionally relevant genes across human tissues Somepalli, Gowthami Sahoo, Sarthak Singh, Arashdeep Hannenhalli, Sridhar PLoS Comput Biol Research Article Knowledge of genes that are critical to a tissue’s function remains difficult to ascertain and presents a major bottleneck toward a mechanistic understanding of genotype-phenotype links. Here, we present the first machine learning model–FUGUE–combining transcriptional and network features, to predict tissue-relevant genes across 30 human tissues. FUGUE achieves an average cross-validation auROC of 0.86 and auPRC of 0.50 (expected 0.09). In independent datasets, FUGUE accurately distinguishes tissue or cell type-specific genes, significantly outperforming the conventional metric based on tissue-specific expression alone. Comparison of tissue-relevant transcription factors across tissue recapitulate their developmental relationships. Interestingly, the tissue-relevant genes cluster on the genome within topologically associated domains and furthermore, are highly enriched for differentially expressed genes in the corresponding cancer type. We provide the prioritized gene lists in 30 human tissues and an open-source software to prioritize genes in a novel context given multi-sample transcriptomic data. Public Library of Science 2021-07-16 /pmc/articles/PMC8284802/ /pubmed/34270548 http://dx.doi.org/10.1371/journal.pcbi.1009194 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Somepalli, Gowthami Sahoo, Sarthak Singh, Arashdeep Hannenhalli, Sridhar Prioritizing and characterizing functionally relevant genes across human tissues |
title | Prioritizing and characterizing functionally relevant genes across human tissues |
title_full | Prioritizing and characterizing functionally relevant genes across human tissues |
title_fullStr | Prioritizing and characterizing functionally relevant genes across human tissues |
title_full_unstemmed | Prioritizing and characterizing functionally relevant genes across human tissues |
title_short | Prioritizing and characterizing functionally relevant genes across human tissues |
title_sort | prioritizing and characterizing functionally relevant genes across human tissues |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8284802/ https://www.ncbi.nlm.nih.gov/pubmed/34270548 http://dx.doi.org/10.1371/journal.pcbi.1009194 |
work_keys_str_mv | AT somepalligowthami prioritizingandcharacterizingfunctionallyrelevantgenesacrosshumantissues AT sahoosarthak prioritizingandcharacterizingfunctionallyrelevantgenesacrosshumantissues AT singharashdeep prioritizingandcharacterizingfunctionallyrelevantgenesacrosshumantissues AT hannenhallisridhar prioritizingandcharacterizingfunctionallyrelevantgenesacrosshumantissues |