Cargando…

Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use

Several natural language processing tools, both commercial and freely available, are used to extract protein interactions from publications. Methods used by these tools include pattern matching to dynamic programming with individual recall and precision rates. A methodical survey of these tools, kee...

Descripción completa

Detalles Bibliográficos
Autores principales: Jose, Hena, Vadivukarasi, Thangavel, Devakumar, Jyothi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171344/
https://www.ncbi.nlm.nih.gov/pubmed/18274648
http://dx.doi.org/10.1155/2007/53096
_version_ 1782211741319430144
author Jose, Hena
Vadivukarasi, Thangavel
Devakumar, Jyothi
author_facet Jose, Hena
Vadivukarasi, Thangavel
Devakumar, Jyothi
author_sort Jose, Hena
collection PubMed
description Several natural language processing tools, both commercial and freely available, are used to extract protein interactions from publications. Methods used by these tools include pattern matching to dynamic programming with individual recall and precision rates. A methodical survey of these tools, keeping in mind the minimum interaction information a researcher would need, in comparison to manual analysis has not been carried out. We compared data generated using some of the selected NLP tools with manually curated protein interaction data (PathArt and IMaps) to comparatively determine the recall and precision rate. The rates were found to be lower than the published scores when a normalized definition for interaction is considered. Each data point captured wrongly or not picked up by the tool was analyzed. Our evaluation brings forth critical failures of NLP tools and provides pointers for the development of an ideal NLP tool.
format Online
Article
Text
id pubmed-3171344
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Springer
record_format MEDLINE/PubMed
spelling pubmed-31713442011-09-13 Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use Jose, Hena Vadivukarasi, Thangavel Devakumar, Jyothi EURASIP J Bioinform Syst Biol Research Article Several natural language processing tools, both commercial and freely available, are used to extract protein interactions from publications. Methods used by these tools include pattern matching to dynamic programming with individual recall and precision rates. A methodical survey of these tools, keeping in mind the minimum interaction information a researcher would need, in comparison to manual analysis has not been carried out. We compared data generated using some of the selected NLP tools with manually curated protein interaction data (PathArt and IMaps) to comparatively determine the recall and precision rate. The rates were found to be lower than the published scores when a normalized definition for interaction is considered. Each data point captured wrongly or not picked up by the tool was analyzed. Our evaluation brings forth critical failures of NLP tools and provides pointers for the development of an ideal NLP tool. Springer 2007-12-09 /pmc/articles/PMC3171344/ /pubmed/18274648 http://dx.doi.org/10.1155/2007/53096 Text en Copyright © 2007 Hena Jose et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jose, Hena
Vadivukarasi, Thangavel
Devakumar, Jyothi
Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use
title Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use
title_full Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use
title_fullStr Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use
title_full_unstemmed Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use
title_short Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use
title_sort extraction of protein interaction data: a comparative analysis of methods in use
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171344/
https://www.ncbi.nlm.nih.gov/pubmed/18274648
http://dx.doi.org/10.1155/2007/53096
work_keys_str_mv AT josehena extractionofproteininteractiondataacomparativeanalysisofmethodsinuse
AT vadivukarasithangavel extractionofproteininteractiondataacomparativeanalysisofmethodsinuse
AT devakumarjyothi extractionofproteininteractiondataacomparativeanalysisofmethodsinuse