Cargando…
Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
Text mining in the biomedical sciences is rapidly transitioning from small-scale evaluation to large-scale application. In this article, we argue that text-mining technologies have become essential tools in real-world biomedical research. We describe four large scale applications of text mining, as...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5199160/ https://www.ncbi.nlm.nih.gov/pubmed/28025348 http://dx.doi.org/10.1093/database/baw161 |
_version_ | 1782488958764056576 |
---|---|
author | Singhal, Ayush Leaman, Robert Catlett, Natalie Lemberger, Thomas McEntyre, Johanna Polson, Shawn Xenarios, Ioannis Arighi, Cecilia Lu, Zhiyong |
author_facet | Singhal, Ayush Leaman, Robert Catlett, Natalie Lemberger, Thomas McEntyre, Johanna Polson, Shawn Xenarios, Ioannis Arighi, Cecilia Lu, Zhiyong |
author_sort | Singhal, Ayush |
collection | PubMed |
description | Text mining in the biomedical sciences is rapidly transitioning from small-scale evaluation to large-scale application. In this article, we argue that text-mining technologies have become essential tools in real-world biomedical research. We describe four large scale applications of text mining, as showcased during a recent panel discussion at the BioCreative V Challenge Workshop. We draw on these applications as case studies to characterize common requirements for successfully applying text-mining techniques to practical biocuration needs. We note that system ‘accuracy’ remains a challenge and identify several additional common difficulties and potential research directions including (i) the ‘scalability’ issue due to the increasing need of mining information from millions of full-text articles, (ii) the ‘interoperability’ issue of integrating various text-mining systems into existing curation workflows and (iii) the ‘reusability’ issue on the difficulty of applying trained systems to text genres that are not seen previously during development. We then describe related efforts within the text-mining community, with a special focus on the BioCreative series of challenge workshops. We believe that focusing on the near-term challenges identified in this work will amplify the opportunities afforded by the continued adoption of text-mining tools. Finally, in order to sustain the curation ecosystem and have text-mining systems adopted for practical benefits, we call for increased collaboration between text-mining researchers and various stakeholders, including researchers, publishers and biocurators. |
format | Online Article Text |
id | pubmed-5199160 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-51991602017-01-06 Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges Singhal, Ayush Leaman, Robert Catlett, Natalie Lemberger, Thomas McEntyre, Johanna Polson, Shawn Xenarios, Ioannis Arighi, Cecilia Lu, Zhiyong Database (Oxford) Perspective Text mining in the biomedical sciences is rapidly transitioning from small-scale evaluation to large-scale application. In this article, we argue that text-mining technologies have become essential tools in real-world biomedical research. We describe four large scale applications of text mining, as showcased during a recent panel discussion at the BioCreative V Challenge Workshop. We draw on these applications as case studies to characterize common requirements for successfully applying text-mining techniques to practical biocuration needs. We note that system ‘accuracy’ remains a challenge and identify several additional common difficulties and potential research directions including (i) the ‘scalability’ issue due to the increasing need of mining information from millions of full-text articles, (ii) the ‘interoperability’ issue of integrating various text-mining systems into existing curation workflows and (iii) the ‘reusability’ issue on the difficulty of applying trained systems to text genres that are not seen previously during development. We then describe related efforts within the text-mining community, with a special focus on the BioCreative series of challenge workshops. We believe that focusing on the near-term challenges identified in this work will amplify the opportunities afforded by the continued adoption of text-mining tools. Finally, in order to sustain the curation ecosystem and have text-mining systems adopted for practical benefits, we call for increased collaboration between text-mining researchers and various stakeholders, including researchers, publishers and biocurators. Oxford University Press 2016-12-26 /pmc/articles/PMC5199160/ /pubmed/28025348 http://dx.doi.org/10.1093/database/baw161 Text en Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US. |
spellingShingle | Perspective Singhal, Ayush Leaman, Robert Catlett, Natalie Lemberger, Thomas McEntyre, Johanna Polson, Shawn Xenarios, Ioannis Arighi, Cecilia Lu, Zhiyong Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges |
title | Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges |
title_full | Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges |
title_fullStr | Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges |
title_full_unstemmed | Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges |
title_short | Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges |
title_sort | pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges |
topic | Perspective |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5199160/ https://www.ncbi.nlm.nih.gov/pubmed/28025348 http://dx.doi.org/10.1093/database/baw161 |
work_keys_str_mv | AT singhalayush pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges AT leamanrobert pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges AT catlettnatalie pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges AT lembergerthomas pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges AT mcentyrejohanna pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges AT polsonshawn pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges AT xenariosioannis pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges AT arighicecilia pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges AT luzhiyong pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges |