Cargando…

EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts

BACKGROUND: A better understanding of the mechanisms of an enzyme's functionality and stability, as well as knowledge and impact of mutations is crucial for researchers working with enzymes. Though, several of the enzymes' databases are currently available, scientific literature still rema...

Descripción completa

Detalles Bibliográficos
Autores principales: Yeniterzi, Süveyda, Sezerman, Uğur
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2745584/
https://www.ncbi.nlm.nih.gov/pubmed/19758466
http://dx.doi.org/10.1186/1471-2105-10-S8-S2
_version_ 1782171977805463552
author Yeniterzi, Süveyda
Sezerman, Uğur
author_facet Yeniterzi, Süveyda
Sezerman, Uğur
author_sort Yeniterzi, Süveyda
collection PubMed
description BACKGROUND: A better understanding of the mechanisms of an enzyme's functionality and stability, as well as knowledge and impact of mutations is crucial for researchers working with enzymes. Though, several of the enzymes' databases are currently available, scientific literature still remains at large for up-to-date source of learning the effects of a mutation on an enzyme. However, going through vast amounts of scientific documents to extract the information on desired mutation has always been a time consuming process. In this paper, therefore, we describe an unique method, termed as EnzyMiner, which automatically identifies the PubMed abstracts that contain information on the impact of a protein level mutation on the stability and/or the activity of a given enzyme. RESULTS: We present an automated system which identifies the abstracts that contain an amino-acid-level mutation and then classifies them according to the mutation's effect on the enzyme. In the case of mutation identification, MuGeX, an automated mutation-gene extraction system has an accuracy of 93.1% with a 91.5 F-measure. For impact analysis, document classification is performed to identify the abstracts that contain a change in enzyme's stability or activity resulting from the mutation. The system was trained on lipases and tested on amylases with an accuracy of 85%. CONCLUSION: EnzyMiner identifies the abstracts that contain a protein mutation for a given enzyme and checks whether the abstract is related to a disease with the help of information extraction and machine learning techniques. For disease related abstracts, the mutation list and direct links to the abstracts are retrieved from the system and displayed on the Web. For those abstracts that are related to non-diseases, in addition to having the mutation list, the abstracts are also categorized into two groups. These two groups determine whether the mutation has an effect on the enzyme's stability or functionality followed by displaying these on the web.
format Text
id pubmed-2745584
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27455842009-09-18 EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts Yeniterzi, Süveyda Sezerman, Uğur BMC Bioinformatics Research BACKGROUND: A better understanding of the mechanisms of an enzyme's functionality and stability, as well as knowledge and impact of mutations is crucial for researchers working with enzymes. Though, several of the enzymes' databases are currently available, scientific literature still remains at large for up-to-date source of learning the effects of a mutation on an enzyme. However, going through vast amounts of scientific documents to extract the information on desired mutation has always been a time consuming process. In this paper, therefore, we describe an unique method, termed as EnzyMiner, which automatically identifies the PubMed abstracts that contain information on the impact of a protein level mutation on the stability and/or the activity of a given enzyme. RESULTS: We present an automated system which identifies the abstracts that contain an amino-acid-level mutation and then classifies them according to the mutation's effect on the enzyme. In the case of mutation identification, MuGeX, an automated mutation-gene extraction system has an accuracy of 93.1% with a 91.5 F-measure. For impact analysis, document classification is performed to identify the abstracts that contain a change in enzyme's stability or activity resulting from the mutation. The system was trained on lipases and tested on amylases with an accuracy of 85%. CONCLUSION: EnzyMiner identifies the abstracts that contain a protein mutation for a given enzyme and checks whether the abstract is related to a disease with the help of information extraction and machine learning techniques. For disease related abstracts, the mutation list and direct links to the abstracts are retrieved from the system and displayed on the Web. For those abstracts that are related to non-diseases, in addition to having the mutation list, the abstracts are also categorized into two groups. These two groups determine whether the mutation has an effect on the enzyme's stability or functionality followed by displaying these on the web. BioMed Central 2009-08-27 /pmc/articles/PMC2745584/ /pubmed/19758466 http://dx.doi.org/10.1186/1471-2105-10-S8-S2 Text en Copyright © 2009 Yeniterzi and Sezerman; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Yeniterzi, Süveyda
Sezerman, Uğur
EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts
title EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts
title_full EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts
title_fullStr EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts
title_full_unstemmed EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts
title_short EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts
title_sort enzyminer: automatic identification of protein level mutations and their impact on target enzymes from pubmed abstracts
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2745584/
https://www.ncbi.nlm.nih.gov/pubmed/19758466
http://dx.doi.org/10.1186/1471-2105-10-S8-S2
work_keys_str_mv AT yeniterzisuveyda enzyminerautomaticidentificationofproteinlevelmutationsandtheirimpactontargetenzymesfrompubmedabstracts
AT sezermanugur enzyminerautomaticidentificationofproteinlevelmutationsandtheirimpactontargetenzymesfrompubmedabstracts