Cargando…

Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students

About 10% of human proteins have no annotated function in protein knowledge bases. A workflow to generate hypotheses for the function of these uncharacterized proteins has been developed, based on predicted and experimental information on protein properties, interactions, tissular expression, subcel...

Descripción completa

Detalles Bibliográficos
Autores principales: Duek, Paula, Mary, Camille, Zahn-Zabal, Monique, Bairoch, Amos, Lane, Lydie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8317215/
https://www.ncbi.nlm.nih.gov/pubmed/34318869
http://dx.doi.org/10.1093/database/baab046
_version_ 1783730027542609920
author Duek, Paula
Mary, Camille
Zahn-Zabal, Monique
Bairoch, Amos
Lane, Lydie
author_facet Duek, Paula
Mary, Camille
Zahn-Zabal, Monique
Bairoch, Amos
Lane, Lydie
author_sort Duek, Paula
collection PubMed
description About 10% of human proteins have no annotated function in protein knowledge bases. A workflow to generate hypotheses for the function of these uncharacterized proteins has been developed, based on predicted and experimental information on protein properties, interactions, tissular expression, subcellular localization, conservation in other organisms, as well as phenotypic data in mutant model organisms. This workflow has been applied to seven uncharacterized human proteins (C6orf118, C7orf25, CXorf58, RSRP1, SMLR1, TMEM53 and TMEM232) in the frame of a course-based undergraduate research experience named Functionathon organized at the University of Geneva to teach undergraduate students how to use biological databases and bioinformatics tools and interpret the results. C6orf118, CXorf58 and TMEM232 were proposed to be involved in cilia-related functions; TMEM53 and SMLR1 were proposed to be involved in lipid metabolism and C7orf25 and RSRP1 were proposed to be involved in RNA metabolism and gene expression. Experimental strategies to test these hypotheses were also discussed. The results of this manual data mining study may contribute to the project recently launched by the Human Proteome Organization (HUPO) Human Proteome Project aiming to fill gaps in the functional annotation of human proteins. Database URL: http://www.nextprot.org
format Online
Article
Text
id pubmed-8317215
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83172152021-07-29 Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students Duek, Paula Mary, Camille Zahn-Zabal, Monique Bairoch, Amos Lane, Lydie Database (Oxford) Original Article About 10% of human proteins have no annotated function in protein knowledge bases. A workflow to generate hypotheses for the function of these uncharacterized proteins has been developed, based on predicted and experimental information on protein properties, interactions, tissular expression, subcellular localization, conservation in other organisms, as well as phenotypic data in mutant model organisms. This workflow has been applied to seven uncharacterized human proteins (C6orf118, C7orf25, CXorf58, RSRP1, SMLR1, TMEM53 and TMEM232) in the frame of a course-based undergraduate research experience named Functionathon organized at the University of Geneva to teach undergraduate students how to use biological databases and bioinformatics tools and interpret the results. C6orf118, CXorf58 and TMEM232 were proposed to be involved in cilia-related functions; TMEM53 and SMLR1 were proposed to be involved in lipid metabolism and C7orf25 and RSRP1 were proposed to be involved in RNA metabolism and gene expression. Experimental strategies to test these hypotheses were also discussed. The results of this manual data mining study may contribute to the project recently launched by the Human Proteome Organization (HUPO) Human Proteome Project aiming to fill gaps in the functional annotation of human proteins. Database URL: http://www.nextprot.org Oxford University Press 2021-07-28 /pmc/articles/PMC8317215/ /pubmed/34318869 http://dx.doi.org/10.1093/database/baab046 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Duek, Paula
Mary, Camille
Zahn-Zabal, Monique
Bairoch, Amos
Lane, Lydie
Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students
title Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students
title_full Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students
title_fullStr Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students
title_full_unstemmed Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students
title_short Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students
title_sort functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8317215/
https://www.ncbi.nlm.nih.gov/pubmed/34318869
http://dx.doi.org/10.1093/database/baab046
work_keys_str_mv AT duekpaula functionathonamanualdataminingworkflowtogeneratefunctionalhypothesesforuncharacterizedhumanproteinsanditsapplicationbyundergraduatestudents
AT marycamille functionathonamanualdataminingworkflowtogeneratefunctionalhypothesesforuncharacterizedhumanproteinsanditsapplicationbyundergraduatestudents
AT zahnzabalmonique functionathonamanualdataminingworkflowtogeneratefunctionalhypothesesforuncharacterizedhumanproteinsanditsapplicationbyundergraduatestudents
AT bairochamos functionathonamanualdataminingworkflowtogeneratefunctionalhypothesesforuncharacterizedhumanproteinsanditsapplicationbyundergraduatestudents
AT lanelydie functionathonamanualdataminingworkflowtogeneratefunctionalhypothesesforuncharacterizedhumanproteinsanditsapplicationbyundergraduatestudents