Cargando…

NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data

INTRODUCTION: Open science initiatives have enabled sharing of large amounts of already collected data. However, significant gaps remain regarding how to find appropriate data, including underutilized data that exist in the long tail of science. We demonstrate the NeuroBridge prototype and its abili...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Lei, Ambite, José Luis, Appaji, Abhishek, Bijsterbosch, Janine, Dockes, Jerome, Herrick, Rick, Kogan, Alex, Lander, Howard, Marcus, Daniel, Moore, Stephen M., Poline, Jean-Baptiste, Rajasekar, Arcot, Sahoo, Satya S., Turner, Matthew D., Wang, Xiaochen, Wang, Yue, Turner, Jessica A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500076/
https://www.ncbi.nlm.nih.gov/pubmed/37720825
http://dx.doi.org/10.3389/fninf.2023.1215261
_version_ 1785105847300390912
author Wang, Lei
Ambite, José Luis
Appaji, Abhishek
Bijsterbosch, Janine
Dockes, Jerome
Herrick, Rick
Kogan, Alex
Lander, Howard
Marcus, Daniel
Moore, Stephen M.
Poline, Jean-Baptiste
Rajasekar, Arcot
Sahoo, Satya S.
Turner, Matthew D.
Wang, Xiaochen
Wang, Yue
Turner, Jessica A.
author_facet Wang, Lei
Ambite, José Luis
Appaji, Abhishek
Bijsterbosch, Janine
Dockes, Jerome
Herrick, Rick
Kogan, Alex
Lander, Howard
Marcus, Daniel
Moore, Stephen M.
Poline, Jean-Baptiste
Rajasekar, Arcot
Sahoo, Satya S.
Turner, Matthew D.
Wang, Xiaochen
Wang, Yue
Turner, Jessica A.
author_sort Wang, Lei
collection PubMed
description INTRODUCTION: Open science initiatives have enabled sharing of large amounts of already collected data. However, significant gaps remain regarding how to find appropriate data, including underutilized data that exist in the long tail of science. We demonstrate the NeuroBridge prototype and its ability to search PubMed Central full-text papers for information relevant to neuroimaging data collected from schizophrenia and addiction studies. METHODS: The NeuroBridge architecture contained the following components: (1) Extensible ontology for modeling study metadata: subject population, imaging techniques, and relevant behavioral, cognitive, or clinical data. Details are described in the companion paper in this special issue; (2) A natural-language based document processor that leveraged pre-trained deep-learning models on a small-sample document corpus to establish efficient representations for each article as a collection of machine-recognized ontological terms; (3) Integrated search using ontology-driven similarity to query PubMed Central and NeuroQuery, which provides fMRI activation maps along with PubMed source articles. RESULTS: The NeuroBridge prototype contains a corpus of 356 papers from 2018 to 2021 describing schizophrenia and addiction neuroimaging studies, of which 186 were annotated with the NeuroBridge ontology. The search portal on the NeuroBridge website https://neurobridges.org/ provides an interactive Query Builder, where the user builds queries by selecting NeuroBridge ontology terms to preserve the ontology tree structure. For each return entry, links to the PubMed abstract as well as to the PMC full-text article, if available, are presented. For each of the returned articles, we provide a list of clinical assessments described in the Section “Methods” of the article. Articles returned from NeuroQuery based on the same search are also presented. CONCLUSION: The NeuroBridge prototype combines ontology-based search with natural-language text-mining approaches to demonstrate that papers relevant to a user’s research question can be identified. The NeuroBridge prototype takes a first step toward identifying potential neuroimaging data described in full-text papers. Toward the overall goal of discovering “enough data of the right kind,” ongoing work includes validating the document processor with a larger corpus, extending the ontology to include detailed imaging data, and extracting information regarding data availability from the returned publications and incorporating XNAT-based neuroimaging databases to enhance data accessibility.
format Online
Article
Text
id pubmed-10500076
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105000762023-09-15 NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data Wang, Lei Ambite, José Luis Appaji, Abhishek Bijsterbosch, Janine Dockes, Jerome Herrick, Rick Kogan, Alex Lander, Howard Marcus, Daniel Moore, Stephen M. Poline, Jean-Baptiste Rajasekar, Arcot Sahoo, Satya S. Turner, Matthew D. Wang, Xiaochen Wang, Yue Turner, Jessica A. Front Neuroinform Neuroscience INTRODUCTION: Open science initiatives have enabled sharing of large amounts of already collected data. However, significant gaps remain regarding how to find appropriate data, including underutilized data that exist in the long tail of science. We demonstrate the NeuroBridge prototype and its ability to search PubMed Central full-text papers for information relevant to neuroimaging data collected from schizophrenia and addiction studies. METHODS: The NeuroBridge architecture contained the following components: (1) Extensible ontology for modeling study metadata: subject population, imaging techniques, and relevant behavioral, cognitive, or clinical data. Details are described in the companion paper in this special issue; (2) A natural-language based document processor that leveraged pre-trained deep-learning models on a small-sample document corpus to establish efficient representations for each article as a collection of machine-recognized ontological terms; (3) Integrated search using ontology-driven similarity to query PubMed Central and NeuroQuery, which provides fMRI activation maps along with PubMed source articles. RESULTS: The NeuroBridge prototype contains a corpus of 356 papers from 2018 to 2021 describing schizophrenia and addiction neuroimaging studies, of which 186 were annotated with the NeuroBridge ontology. The search portal on the NeuroBridge website https://neurobridges.org/ provides an interactive Query Builder, where the user builds queries by selecting NeuroBridge ontology terms to preserve the ontology tree structure. For each return entry, links to the PubMed abstract as well as to the PMC full-text article, if available, are presented. For each of the returned articles, we provide a list of clinical assessments described in the Section “Methods” of the article. Articles returned from NeuroQuery based on the same search are also presented. CONCLUSION: The NeuroBridge prototype combines ontology-based search with natural-language text-mining approaches to demonstrate that papers relevant to a user’s research question can be identified. The NeuroBridge prototype takes a first step toward identifying potential neuroimaging data described in full-text papers. Toward the overall goal of discovering “enough data of the right kind,” ongoing work includes validating the document processor with a larger corpus, extending the ontology to include detailed imaging data, and extracting information regarding data availability from the returned publications and incorporating XNAT-based neuroimaging databases to enhance data accessibility. Frontiers Media S.A. 2023-08-31 /pmc/articles/PMC10500076/ /pubmed/37720825 http://dx.doi.org/10.3389/fninf.2023.1215261 Text en Copyright © 2023 Wang, Ambite, Appaji, Bijsterbosch, Dockes, Herrick, Kogan, Lander, Marcus, Moore, Poline, Rajasekar, Sahoo, Turner, Wang, Wang and Turner. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Wang, Lei
Ambite, José Luis
Appaji, Abhishek
Bijsterbosch, Janine
Dockes, Jerome
Herrick, Rick
Kogan, Alex
Lander, Howard
Marcus, Daniel
Moore, Stephen M.
Poline, Jean-Baptiste
Rajasekar, Arcot
Sahoo, Satya S.
Turner, Matthew D.
Wang, Xiaochen
Wang, Yue
Turner, Jessica A.
NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data
title NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data
title_full NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data
title_fullStr NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data
title_full_unstemmed NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data
title_short NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data
title_sort neurobridge: a prototype platform for discovery of the long-tail neuroimaging data
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500076/
https://www.ncbi.nlm.nih.gov/pubmed/37720825
http://dx.doi.org/10.3389/fninf.2023.1215261
work_keys_str_mv AT wanglei neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT ambitejoseluis neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT appajiabhishek neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT bijsterboschjanine neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT dockesjerome neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT herrickrick neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT koganalex neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT landerhoward neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT marcusdaniel neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT moorestephenm neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT polinejeanbaptiste neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT rajasekararcot neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT sahoosatyas neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT turnermatthewd neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT wangxiaochen neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT wangyue neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata
AT turnerjessicaa neurobridgeaprototypeplatformfordiscoveryofthelongtailneuroimagingdata