Cargando…
ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot
The volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because man...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3929618/ https://www.ncbi.nlm.nih.gov/pubmed/24586563 http://dx.doi.org/10.1371/journal.pone.0089162 |
_version_ | 1782304417824899072 |
---|---|
author | Silveira, Sabrina de Azevedo de Melo-Minardi, Raquel Cardoso da Silveira, Carlos Henrique Santoro, Marcelo Matos Meira Jr, Wagner |
author_facet | Silveira, Sabrina de Azevedo de Melo-Minardi, Raquel Cardoso da Silveira, Carlos Henrique Santoro, Marcelo Matos Meira Jr, Wagner |
author_sort | Silveira, Sabrina de Azevedo |
collection | PubMed |
description | The volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because manually annotating them is not possible. Thus, efficient and precise automatic annotation methods are required to ensure the quality and reliability of both the biological data and associated annotations. We proposed ENZYMatic Annotation Predictor (ENZYMAP), a technique to characterize and predict EC number changes based on annotations from UniProt/Swiss-Prot using a supervised learning approach. We evaluated ENZYMAP experimentally, using test data sets from both UniProt/Swiss-Prot and UniProt/TrEMBL, and showed that predicting EC changes using selected types of annotation is possible. Finally, we compared ENZYMAP and DETECT with respect to their predictions and checked both against the UniProt/Swiss-Prot annotations. ENZYMAP was shown to be more accurate than DETECT, coming closer to the actual changes in UniProt/Swiss-Prot. Our proposal is intended to be an automatic complementary method (that can be used together with other techniques like the ones based on protein sequence and structure) that helps to improve the quality and reliability of enzyme annotations over time, suggesting possible corrections, anticipating annotation changes and propagating the implicit knowledge for the whole dataset. |
format | Online Article Text |
id | pubmed-3929618 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39296182014-02-25 ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot Silveira, Sabrina de Azevedo de Melo-Minardi, Raquel Cardoso da Silveira, Carlos Henrique Santoro, Marcelo Matos Meira Jr, Wagner PLoS One Research Article The volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because manually annotating them is not possible. Thus, efficient and precise automatic annotation methods are required to ensure the quality and reliability of both the biological data and associated annotations. We proposed ENZYMatic Annotation Predictor (ENZYMAP), a technique to characterize and predict EC number changes based on annotations from UniProt/Swiss-Prot using a supervised learning approach. We evaluated ENZYMAP experimentally, using test data sets from both UniProt/Swiss-Prot and UniProt/TrEMBL, and showed that predicting EC changes using selected types of annotation is possible. Finally, we compared ENZYMAP and DETECT with respect to their predictions and checked both against the UniProt/Swiss-Prot annotations. ENZYMAP was shown to be more accurate than DETECT, coming closer to the actual changes in UniProt/Swiss-Prot. Our proposal is intended to be an automatic complementary method (that can be used together with other techniques like the ones based on protein sequence and structure) that helps to improve the quality and reliability of enzyme annotations over time, suggesting possible corrections, anticipating annotation changes and propagating the implicit knowledge for the whole dataset. Public Library of Science 2014-02-19 /pmc/articles/PMC3929618/ /pubmed/24586563 http://dx.doi.org/10.1371/journal.pone.0089162 Text en © 2014 Silveira et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Silveira, Sabrina de Azevedo de Melo-Minardi, Raquel Cardoso da Silveira, Carlos Henrique Santoro, Marcelo Matos Meira Jr, Wagner ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot |
title | ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot |
title_full | ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot |
title_fullStr | ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot |
title_full_unstemmed | ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot |
title_short | ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot |
title_sort | enzymap: exploiting protein annotation for modeling and predicting ec number changes in uniprot/swiss-prot |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3929618/ https://www.ncbi.nlm.nih.gov/pubmed/24586563 http://dx.doi.org/10.1371/journal.pone.0089162 |
work_keys_str_mv | AT silveirasabrinadeazevedo enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot AT demelominardiraquelcardoso enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot AT dasilveiracarloshenrique enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot AT santoromarcelomatos enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot AT meirajrwagner enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot |