Cargando…

What the papers say: Text mining for genomics and systems biology

Keeping up with the rapidly growing literature has become virtually impossible for most scientists. This can have dire consequences. First, we may waste research time and resources on reinventing the wheel simply because we can no longer maintain a reliable grasp on the published literature. Second,...

Descripción completa

Detalles Bibliográficos
Autores principales: Harmston, Nathan, Filsell, Wendy, Stumpf, Michael PH
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3500154/
https://www.ncbi.nlm.nih.gov/pubmed/21106487
http://dx.doi.org/10.1186/1479-7364-5-1-17
_version_ 1782250064180150272
author Harmston, Nathan
Filsell, Wendy
Stumpf, Michael PH
author_facet Harmston, Nathan
Filsell, Wendy
Stumpf, Michael PH
author_sort Harmston, Nathan
collection PubMed
description Keeping up with the rapidly growing literature has become virtually impossible for most scientists. This can have dire consequences. First, we may waste research time and resources on reinventing the wheel simply because we can no longer maintain a reliable grasp on the published literature. Second, and perhaps more detrimental, judicious (or serendipitous) combination of knowledge from different scientific disciplines, which would require following disparate and distinct research literatures, is rapidly becoming impossible for even the most ardent readers of research publications. Text mining -- the automated extraction of information from (electronically) published sources -- could potentially fulfil an important role -- but only if we know how to harness its strengths and overcome its weaknesses. As we do not expect that the rate at which scientific results are published will decrease, text mining tools are now becoming essential in order to cope with, and derive maximum benefit from, this information explosion. In genomics, this is particularly pressing as more and more rare disease-causing variants are found and need to be understood. Not being conversant with this technology may put scientists and biomedical regulators at a severe disadvantage. In this review, we introduce the basic concepts underlying modern text mining and its applications in genomics and systems biology. We hope that this review will serve three purposes: (i) to provide a timely and useful overview of the current status of this field, including a survey of present challenges; (ii) to enable researchers to decide how and when to apply text mining tools in their own research; and (iii) to highlight how the research communities in genomics and systems biology can help to make text mining from biomedical abstracts and texts more straightforward.
format Online
Article
Text
id pubmed-3500154
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35001542012-11-17 What the papers say: Text mining for genomics and systems biology Harmston, Nathan Filsell, Wendy Stumpf, Michael PH Hum Genomics Review Keeping up with the rapidly growing literature has become virtually impossible for most scientists. This can have dire consequences. First, we may waste research time and resources on reinventing the wheel simply because we can no longer maintain a reliable grasp on the published literature. Second, and perhaps more detrimental, judicious (or serendipitous) combination of knowledge from different scientific disciplines, which would require following disparate and distinct research literatures, is rapidly becoming impossible for even the most ardent readers of research publications. Text mining -- the automated extraction of information from (electronically) published sources -- could potentially fulfil an important role -- but only if we know how to harness its strengths and overcome its weaknesses. As we do not expect that the rate at which scientific results are published will decrease, text mining tools are now becoming essential in order to cope with, and derive maximum benefit from, this information explosion. In genomics, this is particularly pressing as more and more rare disease-causing variants are found and need to be understood. Not being conversant with this technology may put scientists and biomedical regulators at a severe disadvantage. In this review, we introduce the basic concepts underlying modern text mining and its applications in genomics and systems biology. We hope that this review will serve three purposes: (i) to provide a timely and useful overview of the current status of this field, including a survey of present challenges; (ii) to enable researchers to decide how and when to apply text mining tools in their own research; and (iii) to highlight how the research communities in genomics and systems biology can help to make text mining from biomedical abstracts and texts more straightforward. BioMed Central 2010-10-01 /pmc/articles/PMC3500154/ /pubmed/21106487 http://dx.doi.org/10.1186/1479-7364-5-1-17 Text en Copyright ©2010 Henry Stewart Publications
spellingShingle Review
Harmston, Nathan
Filsell, Wendy
Stumpf, Michael PH
What the papers say: Text mining for genomics and systems biology
title What the papers say: Text mining for genomics and systems biology
title_full What the papers say: Text mining for genomics and systems biology
title_fullStr What the papers say: Text mining for genomics and systems biology
title_full_unstemmed What the papers say: Text mining for genomics and systems biology
title_short What the papers say: Text mining for genomics and systems biology
title_sort what the papers say: text mining for genomics and systems biology
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3500154/
https://www.ncbi.nlm.nih.gov/pubmed/21106487
http://dx.doi.org/10.1186/1479-7364-5-1-17
work_keys_str_mv AT harmstonnathan whatthepaperssaytextminingforgenomicsandsystemsbiology
AT filsellwendy whatthepaperssaytextminingforgenomicsandsystemsbiology
AT stumpfmichaelph whatthepaperssaytextminingforgenomicsandsystemsbiology