Cargando…

Large-scale extraction of brain connectivity from the neuroscientific literature

Motivation: In neuroscience, as in many other scientific domains, the primary form of knowledge dissemination is through published articles. One challenge for modern neuroinformatics is finding methods to make the knowledge from the tremendous backlog of publications accessible for search, analysis...

Descripción completa

Detalles Bibliográficos
Autores principales: Richardet, Renaud, Chappelier, Jean-Cédric, Telefont, Martin, Hill, Sean
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426844/
https://www.ncbi.nlm.nih.gov/pubmed/25609795
http://dx.doi.org/10.1093/bioinformatics/btv025
_version_ 1782370644730576896
author Richardet, Renaud
Chappelier, Jean-Cédric
Telefont, Martin
Hill, Sean
author_facet Richardet, Renaud
Chappelier, Jean-Cédric
Telefont, Martin
Hill, Sean
author_sort Richardet, Renaud
collection PubMed
description Motivation: In neuroscience, as in many other scientific domains, the primary form of knowledge dissemination is through published articles. One challenge for modern neuroinformatics is finding methods to make the knowledge from the tremendous backlog of publications accessible for search, analysis and the integration of such data into computational models. A key example of this is metascale brain connectivity, where results are not reported in a normalized repository. Instead, these experimental results are published in natural language, scattered among individual scientific publications. This lack of normalization and centralization hinders the large-scale integration of brain connectivity results. In this article, we present text-mining models to extract and aggregate brain connectivity results from 13.2 million PubMed abstracts and 630 216 full-text publications related to neuroscience. The brain regions are identified with three different named entity recognizers (NERs) and then normalized against two atlases: the Allen Brain Atlas (ABA) and the atlas from the Brain Architecture Management System (BAMS). We then use three different extractors to assess inter-region connectivity. Results: NERs and connectivity extractors are evaluated against a manually annotated corpus. The complete in litero extraction models are also evaluated against in vivo connectivity data from ABA with an estimated precision of 78%. The resulting database contains over 4 million brain region mentions and over 100 000 (ABA) and 122 000 (BAMS) potential brain region connections. This database drastically accelerates connectivity literature review, by providing a centralized repository of connectivity data to neuroscientists. Availability and implementation: The resulting models are publicly available at github.com/BlueBrain/bluima. Contact: renaud.richardet@epfl.ch Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4426844
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44268442015-05-15 Large-scale extraction of brain connectivity from the neuroscientific literature Richardet, Renaud Chappelier, Jean-Cédric Telefont, Martin Hill, Sean Bioinformatics Original Papers Motivation: In neuroscience, as in many other scientific domains, the primary form of knowledge dissemination is through published articles. One challenge for modern neuroinformatics is finding methods to make the knowledge from the tremendous backlog of publications accessible for search, analysis and the integration of such data into computational models. A key example of this is metascale brain connectivity, where results are not reported in a normalized repository. Instead, these experimental results are published in natural language, scattered among individual scientific publications. This lack of normalization and centralization hinders the large-scale integration of brain connectivity results. In this article, we present text-mining models to extract and aggregate brain connectivity results from 13.2 million PubMed abstracts and 630 216 full-text publications related to neuroscience. The brain regions are identified with three different named entity recognizers (NERs) and then normalized against two atlases: the Allen Brain Atlas (ABA) and the atlas from the Brain Architecture Management System (BAMS). We then use three different extractors to assess inter-region connectivity. Results: NERs and connectivity extractors are evaluated against a manually annotated corpus. The complete in litero extraction models are also evaluated against in vivo connectivity data from ABA with an estimated precision of 78%. The resulting database contains over 4 million brain region mentions and over 100 000 (ABA) and 122 000 (BAMS) potential brain region connections. This database drastically accelerates connectivity literature review, by providing a centralized repository of connectivity data to neuroscientists. Availability and implementation: The resulting models are publicly available at github.com/BlueBrain/bluima. Contact: renaud.richardet@epfl.ch Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-05-15 2015-01-20 /pmc/articles/PMC4426844/ /pubmed/25609795 http://dx.doi.org/10.1093/bioinformatics/btv025 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Richardet, Renaud
Chappelier, Jean-Cédric
Telefont, Martin
Hill, Sean
Large-scale extraction of brain connectivity from the neuroscientific literature
title Large-scale extraction of brain connectivity from the neuroscientific literature
title_full Large-scale extraction of brain connectivity from the neuroscientific literature
title_fullStr Large-scale extraction of brain connectivity from the neuroscientific literature
title_full_unstemmed Large-scale extraction of brain connectivity from the neuroscientific literature
title_short Large-scale extraction of brain connectivity from the neuroscientific literature
title_sort large-scale extraction of brain connectivity from the neuroscientific literature
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426844/
https://www.ncbi.nlm.nih.gov/pubmed/25609795
http://dx.doi.org/10.1093/bioinformatics/btv025
work_keys_str_mv AT richardetrenaud largescaleextractionofbrainconnectivityfromtheneuroscientificliterature
AT chappelierjeancedric largescaleextractionofbrainconnectivityfromtheneuroscientificliterature
AT telefontmartin largescaleextractionofbrainconnectivityfromtheneuroscientificliterature
AT hillsean largescaleextractionofbrainconnectivityfromtheneuroscientificliterature