Cargando…

Leveraging omic features with F3UTER enables identification of unannotated 3’UTRs for synaptic genes

There is growing evidence for the importance of 3’ untranslated region (3’UTR) dependent regulatory processes. However, our current human 3’UTR catalogue is incomplete. Here, we develop a machine learning-based framework, leveraging both genomic and tissue-specific transcriptomic features to predict...

Descripción completa

Detalles Bibliográficos
Autores principales: Sethi, Siddharth, Zhang, David, Guelfi, Sebastian, Chen, Zhongbo, Garcia-Ruiz, Sonia, Olagbaju, Emmanuel O., Ryten, Mina, Saini, Harpreet, Botia, Juan A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046390/
https://www.ncbi.nlm.nih.gov/pubmed/35477703
http://dx.doi.org/10.1038/s41467-022-30017-z
_version_ 1784695513484886016
author Sethi, Siddharth
Zhang, David
Guelfi, Sebastian
Chen, Zhongbo
Garcia-Ruiz, Sonia
Olagbaju, Emmanuel O.
Ryten, Mina
Saini, Harpreet
Botia, Juan A.
author_facet Sethi, Siddharth
Zhang, David
Guelfi, Sebastian
Chen, Zhongbo
Garcia-Ruiz, Sonia
Olagbaju, Emmanuel O.
Ryten, Mina
Saini, Harpreet
Botia, Juan A.
author_sort Sethi, Siddharth
collection PubMed
description There is growing evidence for the importance of 3’ untranslated region (3’UTR) dependent regulatory processes. However, our current human 3’UTR catalogue is incomplete. Here, we develop a machine learning-based framework, leveraging both genomic and tissue-specific transcriptomic features to predict previously unannotated 3’UTRs. We identify unannotated 3’UTRs associated with 1,563 genes across 39 human tissues, with the greatest abundance found in the brain. These unannotated 3’UTRs are significantly enriched for RNA binding protein (RBP) motifs and exhibit high human lineage-specificity. We find that brain-specific unannotated 3’UTRs are enriched for the binding motifs of important neuronal RBPs such as TARDBP and RBFOX1, and their associated genes are involved in synaptic function. Our data is shared through an online resource F3UTER (https://astx.shinyapps.io/F3UTER/). Overall, our data improves 3’UTR annotation and provides additional insights into the mRNA-RBP interactome in the human brain, with implications for our understanding of neurological and neurodevelopmental diseases.
format Online
Article
Text
id pubmed-9046390
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-90463902022-04-29 Leveraging omic features with F3UTER enables identification of unannotated 3’UTRs for synaptic genes Sethi, Siddharth Zhang, David Guelfi, Sebastian Chen, Zhongbo Garcia-Ruiz, Sonia Olagbaju, Emmanuel O. Ryten, Mina Saini, Harpreet Botia, Juan A. Nat Commun Article There is growing evidence for the importance of 3’ untranslated region (3’UTR) dependent regulatory processes. However, our current human 3’UTR catalogue is incomplete. Here, we develop a machine learning-based framework, leveraging both genomic and tissue-specific transcriptomic features to predict previously unannotated 3’UTRs. We identify unannotated 3’UTRs associated with 1,563 genes across 39 human tissues, with the greatest abundance found in the brain. These unannotated 3’UTRs are significantly enriched for RNA binding protein (RBP) motifs and exhibit high human lineage-specificity. We find that brain-specific unannotated 3’UTRs are enriched for the binding motifs of important neuronal RBPs such as TARDBP and RBFOX1, and their associated genes are involved in synaptic function. Our data is shared through an online resource F3UTER (https://astx.shinyapps.io/F3UTER/). Overall, our data improves 3’UTR annotation and provides additional insights into the mRNA-RBP interactome in the human brain, with implications for our understanding of neurological and neurodevelopmental diseases. Nature Publishing Group UK 2022-04-27 /pmc/articles/PMC9046390/ /pubmed/35477703 http://dx.doi.org/10.1038/s41467-022-30017-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Sethi, Siddharth
Zhang, David
Guelfi, Sebastian
Chen, Zhongbo
Garcia-Ruiz, Sonia
Olagbaju, Emmanuel O.
Ryten, Mina
Saini, Harpreet
Botia, Juan A.
Leveraging omic features with F3UTER enables identification of unannotated 3’UTRs for synaptic genes
title Leveraging omic features with F3UTER enables identification of unannotated 3’UTRs for synaptic genes
title_full Leveraging omic features with F3UTER enables identification of unannotated 3’UTRs for synaptic genes
title_fullStr Leveraging omic features with F3UTER enables identification of unannotated 3’UTRs for synaptic genes
title_full_unstemmed Leveraging omic features with F3UTER enables identification of unannotated 3’UTRs for synaptic genes
title_short Leveraging omic features with F3UTER enables identification of unannotated 3’UTRs for synaptic genes
title_sort leveraging omic features with f3uter enables identification of unannotated 3’utrs for synaptic genes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046390/
https://www.ncbi.nlm.nih.gov/pubmed/35477703
http://dx.doi.org/10.1038/s41467-022-30017-z
work_keys_str_mv AT sethisiddharth leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes
AT zhangdavid leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes
AT guelfisebastian leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes
AT chenzhongbo leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes
AT garciaruizsonia leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes
AT olagbajuemmanuelo leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes
AT rytenmina leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes
AT sainiharpreet leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes
AT botiajuana leveragingomicfeatureswithf3uterenablesidentificationofunannotated3utrsforsynapticgenes