Cargando…

Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library

We show how faceted search using a combination of traditional classification systems and mixed-membership topic models can go beyond keyword search to inform resource discovery, hypothesis formulation, and argument extraction for interdisciplinary research. Our test domain is the history and philoso...

Descripción completa

Detalles Bibliográficos
Autores principales: Murdock, Jaimie, Allen, Colin, Börner, Katy, Light, Robert, McAlister, Simon, Ravenscroft, Andrew, Rose, Robert, Rose, Doori, Otsuka, Jun, Bourget, David, Lawrence, John, Reed, Chris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5602542/
https://www.ncbi.nlm.nih.gov/pubmed/28922416
http://dx.doi.org/10.1371/journal.pone.0184188
_version_ 1783264586767532032
author Murdock, Jaimie
Allen, Colin
Börner, Katy
Light, Robert
McAlister, Simon
Ravenscroft, Andrew
Rose, Robert
Rose, Doori
Otsuka, Jun
Bourget, David
Lawrence, John
Reed, Chris
author_facet Murdock, Jaimie
Allen, Colin
Börner, Katy
Light, Robert
McAlister, Simon
Ravenscroft, Andrew
Rose, Robert
Rose, Doori
Otsuka, Jun
Bourget, David
Lawrence, John
Reed, Chris
author_sort Murdock, Jaimie
collection PubMed
description We show how faceted search using a combination of traditional classification systems and mixed-membership topic models can go beyond keyword search to inform resource discovery, hypothesis formulation, and argument extraction for interdisciplinary research. Our test domain is the history and philosophy of scientific work on animal mind and cognition. The methods can be generalized to other research areas and ultimately support a system for semi-automatic identification of argument structures. We provide a case study for the application of the methods to the problem of identifying and extracting arguments about anthropomorphism during a critical period in the development of comparative psychology. We show how a combination of classification systems and mixed-membership models trained over large digital libraries can inform resource discovery in this domain. Through a novel approach of “drill-down” topic modeling—simultaneously reducing both the size of the corpus and the unit of analysis—we are able to reduce a large collection of fulltext volumes to a much smaller set of pages within six focal volumes containing arguments of interest to historians and philosophers of comparative psychology. The volumes identified in this way did not appear among the first ten results of the keyword search in the HathiTrust digital library and the pages bear the kind of “close reading” needed to generate original interpretations that is the heart of scholarly work in the humanities. Zooming back out, we provide a way to place the books onto a map of science originally constructed from very different data and for different purposes. The multilevel approach advances understanding of the intellectual and societal contexts in which writings are interpreted.
format Online
Article
Text
id pubmed-5602542
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56025422017-09-22 Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library Murdock, Jaimie Allen, Colin Börner, Katy Light, Robert McAlister, Simon Ravenscroft, Andrew Rose, Robert Rose, Doori Otsuka, Jun Bourget, David Lawrence, John Reed, Chris PLoS One Research Article We show how faceted search using a combination of traditional classification systems and mixed-membership topic models can go beyond keyword search to inform resource discovery, hypothesis formulation, and argument extraction for interdisciplinary research. Our test domain is the history and philosophy of scientific work on animal mind and cognition. The methods can be generalized to other research areas and ultimately support a system for semi-automatic identification of argument structures. We provide a case study for the application of the methods to the problem of identifying and extracting arguments about anthropomorphism during a critical period in the development of comparative psychology. We show how a combination of classification systems and mixed-membership models trained over large digital libraries can inform resource discovery in this domain. Through a novel approach of “drill-down” topic modeling—simultaneously reducing both the size of the corpus and the unit of analysis—we are able to reduce a large collection of fulltext volumes to a much smaller set of pages within six focal volumes containing arguments of interest to historians and philosophers of comparative psychology. The volumes identified in this way did not appear among the first ten results of the keyword search in the HathiTrust digital library and the pages bear the kind of “close reading” needed to generate original interpretations that is the heart of scholarly work in the humanities. Zooming back out, we provide a way to place the books onto a map of science originally constructed from very different data and for different purposes. The multilevel approach advances understanding of the intellectual and societal contexts in which writings are interpreted. Public Library of Science 2017-09-18 /pmc/articles/PMC5602542/ /pubmed/28922416 http://dx.doi.org/10.1371/journal.pone.0184188 Text en © 2017 Murdock et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Murdock, Jaimie
Allen, Colin
Börner, Katy
Light, Robert
McAlister, Simon
Ravenscroft, Andrew
Rose, Robert
Rose, Doori
Otsuka, Jun
Bourget, David
Lawrence, John
Reed, Chris
Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library
title Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library
title_full Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library
title_fullStr Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library
title_full_unstemmed Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library
title_short Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library
title_sort multi-level computational methods for interdisciplinary research in the hathitrust digital library
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5602542/
https://www.ncbi.nlm.nih.gov/pubmed/28922416
http://dx.doi.org/10.1371/journal.pone.0184188
work_keys_str_mv AT murdockjaimie multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT allencolin multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT bornerkaty multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT lightrobert multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT mcalistersimon multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT ravenscroftandrew multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT roserobert multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT rosedoori multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT otsukajun multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT bourgetdavid multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT lawrencejohn multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary
AT reedchris multilevelcomputationalmethodsforinterdisciplinaryresearchinthehathitrustdigitallibrary