Cargando…

Functional annotation of enzyme-encoding genes using deep learning with transformer layers

Functional annotation of open reading frames in microbial genomes remains substantially incomplete. Enzymes constitute the most prevalent functional gene class in microbial genomes and can be described by their specific catalytic functions using the Enzyme Commission (EC) number. Consequently, the a...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Gi Bae, Kim, Ji Yeon, Lee, Jong An, Norsigian, Charles J., Palsson, Bernhard O., Lee, Sang Yup
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10645960/
https://www.ncbi.nlm.nih.gov/pubmed/37963869
http://dx.doi.org/10.1038/s41467-023-43216-z
_version_ 1785147422104616960
author Kim, Gi Bae
Kim, Ji Yeon
Lee, Jong An
Norsigian, Charles J.
Palsson, Bernhard O.
Lee, Sang Yup
author_facet Kim, Gi Bae
Kim, Ji Yeon
Lee, Jong An
Norsigian, Charles J.
Palsson, Bernhard O.
Lee, Sang Yup
author_sort Kim, Gi Bae
collection PubMed
description Functional annotation of open reading frames in microbial genomes remains substantially incomplete. Enzymes constitute the most prevalent functional gene class in microbial genomes and can be described by their specific catalytic functions using the Enzyme Commission (EC) number. Consequently, the ability to predict EC numbers could substantially reduce the number of un-annotated genes. Here we present a deep learning model, DeepECtransformer, which utilizes transformer layers as a neural network architecture to predict EC numbers. Using the extensively studied Escherichia coli K-12 MG1655 genome, DeepECtransformer predicted EC numbers for 464 un-annotated genes. We experimentally validated the enzymatic activities predicted for three proteins (YgfF, YciO, and YjdM). Further examination of the neural network’s reasoning process revealed that the trained neural network relies on functional motifs of enzymes to predict EC numbers. Thus, DeepECtransformer is a method that facilitates the functional annotation of uncharacterized genes.
format Online
Article
Text
id pubmed-10645960
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106459602023-11-14 Functional annotation of enzyme-encoding genes using deep learning with transformer layers Kim, Gi Bae Kim, Ji Yeon Lee, Jong An Norsigian, Charles J. Palsson, Bernhard O. Lee, Sang Yup Nat Commun Article Functional annotation of open reading frames in microbial genomes remains substantially incomplete. Enzymes constitute the most prevalent functional gene class in microbial genomes and can be described by their specific catalytic functions using the Enzyme Commission (EC) number. Consequently, the ability to predict EC numbers could substantially reduce the number of un-annotated genes. Here we present a deep learning model, DeepECtransformer, which utilizes transformer layers as a neural network architecture to predict EC numbers. Using the extensively studied Escherichia coli K-12 MG1655 genome, DeepECtransformer predicted EC numbers for 464 un-annotated genes. We experimentally validated the enzymatic activities predicted for three proteins (YgfF, YciO, and YjdM). Further examination of the neural network’s reasoning process revealed that the trained neural network relies on functional motifs of enzymes to predict EC numbers. Thus, DeepECtransformer is a method that facilitates the functional annotation of uncharacterized genes. Nature Publishing Group UK 2023-11-14 /pmc/articles/PMC10645960/ /pubmed/37963869 http://dx.doi.org/10.1038/s41467-023-43216-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Kim, Gi Bae
Kim, Ji Yeon
Lee, Jong An
Norsigian, Charles J.
Palsson, Bernhard O.
Lee, Sang Yup
Functional annotation of enzyme-encoding genes using deep learning with transformer layers
title Functional annotation of enzyme-encoding genes using deep learning with transformer layers
title_full Functional annotation of enzyme-encoding genes using deep learning with transformer layers
title_fullStr Functional annotation of enzyme-encoding genes using deep learning with transformer layers
title_full_unstemmed Functional annotation of enzyme-encoding genes using deep learning with transformer layers
title_short Functional annotation of enzyme-encoding genes using deep learning with transformer layers
title_sort functional annotation of enzyme-encoding genes using deep learning with transformer layers
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10645960/
https://www.ncbi.nlm.nih.gov/pubmed/37963869
http://dx.doi.org/10.1038/s41467-023-43216-z
work_keys_str_mv AT kimgibae functionalannotationofenzymeencodinggenesusingdeeplearningwithtransformerlayers
AT kimjiyeon functionalannotationofenzymeencodinggenesusingdeeplearningwithtransformerlayers
AT leejongan functionalannotationofenzymeencodinggenesusingdeeplearningwithtransformerlayers
AT norsigiancharlesj functionalannotationofenzymeencodinggenesusingdeeplearningwithtransformerlayers
AT palssonbernhardo functionalannotationofenzymeencodinggenesusingdeeplearningwithtransformerlayers
AT leesangyup functionalannotationofenzymeencodinggenesusingdeeplearningwithtransformerlayers