Cargando…

Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research

Discovery studies in animals constitute a cornerstone of biomedical research, but suffer from lack of generalizability to human populations. We propose that large-scale interrogation of these data could reveal patterns of animal use that could narrow the translational divide. We describe a text-mini...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeiss, Caroline J., Shin, Dongwook, Vander Wyk, Brent, Beck, Amanda P., Zatz, Natalie, Sneiderman, Charles A., Kilicoglu, Halil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6917268/
https://www.ncbi.nlm.nih.gov/pubmed/31846471
http://dx.doi.org/10.1371/journal.pone.0226176
_version_ 1783480378823016448
author Zeiss, Caroline J.
Shin, Dongwook
Vander Wyk, Brent
Beck, Amanda P.
Zatz, Natalie
Sneiderman, Charles A.
Kilicoglu, Halil
author_facet Zeiss, Caroline J.
Shin, Dongwook
Vander Wyk, Brent
Beck, Amanda P.
Zatz, Natalie
Sneiderman, Charles A.
Kilicoglu, Halil
author_sort Zeiss, Caroline J.
collection PubMed
description Discovery studies in animals constitute a cornerstone of biomedical research, but suffer from lack of generalizability to human populations. We propose that large-scale interrogation of these data could reveal patterns of animal use that could narrow the translational divide. We describe a text-mining approach that extracts translationally useful data from PubMed abstracts. These comprise six modules: species, model, genes, interventions/disease modifiers, overall outcome and functional outcome measures. Existing National Library of Medicine natural language processing tools (SemRep, GNormPlus and the Chemical annotator) underpin the program and are further augmented by various rules, term lists, and machine learning models. Evaluation of the program using a 98-abstract test set achieved F(1) scores ranging from 0.75–0.95 across all modules, and exceeded F(1) scores obtained from comparable baseline programs. Next, the program was applied to a larger 14,481 abstract data set (2008–2017). Expected and previously identified patterns of species and model use for the field were obtained. As previously noted, the majority of studies reported promising outcomes. Longitudinal patterns of intervention type or gene mentions were demonstrated, and patterns of animal model use characteristic of the Parkinson’s disease field were confirmed. The primary function of the program is to overcome low external validity of animal model systems by aggregating evidence across a diversity of models that capture different aspects of a multifaceted cellular process. Some aspects of the tool are generalizable, whereas others are field-specific. In the initial version presented here, we demonstrate proof of concept within a single disease area, Parkinson’s disease. However, the program can be expanded in modular fashion to support a wider range of neurodegenerative diseases.
format Online
Article
Text
id pubmed-6917268
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69172682019-12-27 Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research Zeiss, Caroline J. Shin, Dongwook Vander Wyk, Brent Beck, Amanda P. Zatz, Natalie Sneiderman, Charles A. Kilicoglu, Halil PLoS One Research Article Discovery studies in animals constitute a cornerstone of biomedical research, but suffer from lack of generalizability to human populations. We propose that large-scale interrogation of these data could reveal patterns of animal use that could narrow the translational divide. We describe a text-mining approach that extracts translationally useful data from PubMed abstracts. These comprise six modules: species, model, genes, interventions/disease modifiers, overall outcome and functional outcome measures. Existing National Library of Medicine natural language processing tools (SemRep, GNormPlus and the Chemical annotator) underpin the program and are further augmented by various rules, term lists, and machine learning models. Evaluation of the program using a 98-abstract test set achieved F(1) scores ranging from 0.75–0.95 across all modules, and exceeded F(1) scores obtained from comparable baseline programs. Next, the program was applied to a larger 14,481 abstract data set (2008–2017). Expected and previously identified patterns of species and model use for the field were obtained. As previously noted, the majority of studies reported promising outcomes. Longitudinal patterns of intervention type or gene mentions were demonstrated, and patterns of animal model use characteristic of the Parkinson’s disease field were confirmed. The primary function of the program is to overcome low external validity of animal model systems by aggregating evidence across a diversity of models that capture different aspects of a multifaceted cellular process. Some aspects of the tool are generalizable, whereas others are field-specific. In the initial version presented here, we demonstrate proof of concept within a single disease area, Parkinson’s disease. However, the program can be expanded in modular fashion to support a wider range of neurodegenerative diseases. Public Library of Science 2019-12-17 /pmc/articles/PMC6917268/ /pubmed/31846471 http://dx.doi.org/10.1371/journal.pone.0226176 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Zeiss, Caroline J.
Shin, Dongwook
Vander Wyk, Brent
Beck, Amanda P.
Zatz, Natalie
Sneiderman, Charles A.
Kilicoglu, Halil
Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research
title Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research
title_full Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research
title_fullStr Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research
title_full_unstemmed Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research
title_short Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research
title_sort menagerie: a text-mining tool to support animal-human translation in neurodegeneration research
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6917268/
https://www.ncbi.nlm.nih.gov/pubmed/31846471
http://dx.doi.org/10.1371/journal.pone.0226176
work_keys_str_mv AT zeisscarolinej menagerieatextminingtooltosupportanimalhumantranslationinneurodegenerationresearch
AT shindongwook menagerieatextminingtooltosupportanimalhumantranslationinneurodegenerationresearch
AT vanderwykbrent menagerieatextminingtooltosupportanimalhumantranslationinneurodegenerationresearch
AT beckamandap menagerieatextminingtooltosupportanimalhumantranslationinneurodegenerationresearch
AT zatznatalie menagerieatextminingtooltosupportanimalhumantranslationinneurodegenerationresearch
AT sneidermancharlesa menagerieatextminingtooltosupportanimalhumantranslationinneurodegenerationresearch
AT kilicogluhalil menagerieatextminingtooltosupportanimalhumantranslationinneurodegenerationresearch