Cargando…

ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot

The volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because man...

Descripción completa

Detalles Bibliográficos
Autores principales: Silveira, Sabrina de Azevedo, de Melo-Minardi, Raquel Cardoso, da Silveira, Carlos Henrique, Santoro, Marcelo Matos, Meira Jr, Wagner
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3929618/
https://www.ncbi.nlm.nih.gov/pubmed/24586563
http://dx.doi.org/10.1371/journal.pone.0089162
_version_ 1782304417824899072
author Silveira, Sabrina de Azevedo
de Melo-Minardi, Raquel Cardoso
da Silveira, Carlos Henrique
Santoro, Marcelo Matos
Meira Jr, Wagner
author_facet Silveira, Sabrina de Azevedo
de Melo-Minardi, Raquel Cardoso
da Silveira, Carlos Henrique
Santoro, Marcelo Matos
Meira Jr, Wagner
author_sort Silveira, Sabrina de Azevedo
collection PubMed
description The volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because manually annotating them is not possible. Thus, efficient and precise automatic annotation methods are required to ensure the quality and reliability of both the biological data and associated annotations. We proposed ENZYMatic Annotation Predictor (ENZYMAP), a technique to characterize and predict EC number changes based on annotations from UniProt/Swiss-Prot using a supervised learning approach. We evaluated ENZYMAP experimentally, using test data sets from both UniProt/Swiss-Prot and UniProt/TrEMBL, and showed that predicting EC changes using selected types of annotation is possible. Finally, we compared ENZYMAP and DETECT with respect to their predictions and checked both against the UniProt/Swiss-Prot annotations. ENZYMAP was shown to be more accurate than DETECT, coming closer to the actual changes in UniProt/Swiss-Prot. Our proposal is intended to be an automatic complementary method (that can be used together with other techniques like the ones based on protein sequence and structure) that helps to improve the quality and reliability of enzyme annotations over time, suggesting possible corrections, anticipating annotation changes and propagating the implicit knowledge for the whole dataset.
format Online
Article
Text
id pubmed-3929618
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39296182014-02-25 ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot Silveira, Sabrina de Azevedo de Melo-Minardi, Raquel Cardoso da Silveira, Carlos Henrique Santoro, Marcelo Matos Meira Jr, Wagner PLoS One Research Article The volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because manually annotating them is not possible. Thus, efficient and precise automatic annotation methods are required to ensure the quality and reliability of both the biological data and associated annotations. We proposed ENZYMatic Annotation Predictor (ENZYMAP), a technique to characterize and predict EC number changes based on annotations from UniProt/Swiss-Prot using a supervised learning approach. We evaluated ENZYMAP experimentally, using test data sets from both UniProt/Swiss-Prot and UniProt/TrEMBL, and showed that predicting EC changes using selected types of annotation is possible. Finally, we compared ENZYMAP and DETECT with respect to their predictions and checked both against the UniProt/Swiss-Prot annotations. ENZYMAP was shown to be more accurate than DETECT, coming closer to the actual changes in UniProt/Swiss-Prot. Our proposal is intended to be an automatic complementary method (that can be used together with other techniques like the ones based on protein sequence and structure) that helps to improve the quality and reliability of enzyme annotations over time, suggesting possible corrections, anticipating annotation changes and propagating the implicit knowledge for the whole dataset. Public Library of Science 2014-02-19 /pmc/articles/PMC3929618/ /pubmed/24586563 http://dx.doi.org/10.1371/journal.pone.0089162 Text en © 2014 Silveira et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Silveira, Sabrina de Azevedo
de Melo-Minardi, Raquel Cardoso
da Silveira, Carlos Henrique
Santoro, Marcelo Matos
Meira Jr, Wagner
ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot
title ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot
title_full ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot
title_fullStr ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot
title_full_unstemmed ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot
title_short ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot
title_sort enzymap: exploiting protein annotation for modeling and predicting ec number changes in uniprot/swiss-prot
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3929618/
https://www.ncbi.nlm.nih.gov/pubmed/24586563
http://dx.doi.org/10.1371/journal.pone.0089162
work_keys_str_mv AT silveirasabrinadeazevedo enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT demelominardiraquelcardoso enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT dasilveiracarloshenrique enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT santoromarcelomatos enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT meirajrwagner enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot