Cargando…

Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence

Annotating protein sequences according to their biological functions is one of the key steps in understanding microbial diversity, metabolic potentials, and evolutionary histories. However, even in the best-studied prokaryotic genomes, not all proteins can be characterized by classical in vivo, in v...

Descripción completa

Detalles Bibliográficos
Autores principales: Ardern, Zachary, Chakraborty, Sagarika, Lenk, Florian, Kaster, Anne-Kristin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9960493/
https://www.ncbi.nlm.nih.gov/pubmed/36725215
http://dx.doi.org/10.1093/femsre/fuad003
_version_ 1784895526298189824
author Ardern, Zachary
Chakraborty, Sagarika
Lenk, Florian
Kaster, Anne-Kristin
author_facet Ardern, Zachary
Chakraborty, Sagarika
Lenk, Florian
Kaster, Anne-Kristin
author_sort Ardern, Zachary
collection PubMed
description Annotating protein sequences according to their biological functions is one of the key steps in understanding microbial diversity, metabolic potentials, and evolutionary histories. However, even in the best-studied prokaryotic genomes, not all proteins can be characterized by classical in vivo, in vitro, and/or in silico methods—a challenge rapidly growing alongside the advent of next-generation sequencing technologies and their enormous extension of ‘omics’ data in public databases. These so-called hypothetical proteins (HPs) represent a huge knowledge gap and hidden potential for biotechnological applications. Opportunities for leveraging the available ‘Big Data’ have recently proliferated with the use of artificial intelligence (AI). Here, we review the aims and methods of protein annotation and explain the different principles behind machine and deep learning algorithms including recent research examples, in order to assist both biologists wishing to apply AI tools in developing comprehensive genome annotations and computer scientists who want to contribute to this leading edge of biological research.
format Online
Article
Text
id pubmed-9960493
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99604932023-02-26 Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence Ardern, Zachary Chakraborty, Sagarika Lenk, Florian Kaster, Anne-Kristin FEMS Microbiol Rev Review Article Annotating protein sequences according to their biological functions is one of the key steps in understanding microbial diversity, metabolic potentials, and evolutionary histories. However, even in the best-studied prokaryotic genomes, not all proteins can be characterized by classical in vivo, in vitro, and/or in silico methods—a challenge rapidly growing alongside the advent of next-generation sequencing technologies and their enormous extension of ‘omics’ data in public databases. These so-called hypothetical proteins (HPs) represent a huge knowledge gap and hidden potential for biotechnological applications. Opportunities for leveraging the available ‘Big Data’ have recently proliferated with the use of artificial intelligence (AI). Here, we review the aims and methods of protein annotation and explain the different principles behind machine and deep learning algorithms including recent research examples, in order to assist both biologists wishing to apply AI tools in developing comprehensive genome annotations and computer scientists who want to contribute to this leading edge of biological research. Oxford University Press 2023-02-01 /pmc/articles/PMC9960493/ /pubmed/36725215 http://dx.doi.org/10.1093/femsre/fuad003 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of FEMS. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Review Article
Ardern, Zachary
Chakraborty, Sagarika
Lenk, Florian
Kaster, Anne-Kristin
Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence
title Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence
title_full Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence
title_fullStr Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence
title_full_unstemmed Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence
title_short Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence
title_sort elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9960493/
https://www.ncbi.nlm.nih.gov/pubmed/36725215
http://dx.doi.org/10.1093/femsre/fuad003
work_keys_str_mv AT ardernzachary elucidatingthefunctionalrolesofprokaryoticproteinsusingbigdataandartificialintelligence
AT chakrabortysagarika elucidatingthefunctionalrolesofprokaryoticproteinsusingbigdataandartificialintelligence
AT lenkflorian elucidatingthefunctionalrolesofprokaryoticproteinsusingbigdataandartificialintelligence
AT kasterannekristin elucidatingthefunctionalrolesofprokaryoticproteinsusingbigdataandartificialintelligence