Cargando…

Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text

Motivation: Automated annotation of neuroanatomical connectivity statements from the neuroscience literature would enable accessible and large-scale connectivity resources. Unfortunately, the connectivity findings are not formally encoded and occur as natural language text. This hinders aggregation,...

Descripción completa

Detalles Bibliográficos
Autores principales:	French, Leon, Lane, Suzanne, Xu, Lydia, Siu, Celia, Kwok, Cathy, Chen, Yiqi, Krebs, Claudia, Pavlidis, Paul
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2012
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496336/ https://www.ncbi.nlm.nih.gov/pubmed/22954628 http://dx.doi.org/10.1093/bioinformatics/bts542

_version_	1782249618813222912
author	French, Leon Lane, Suzanne Xu, Lydia Siu, Celia Kwok, Cathy Chen, Yiqi Krebs, Claudia Pavlidis, Paul
author_facet	French, Leon Lane, Suzanne Xu, Lydia Siu, Celia Kwok, Cathy Chen, Yiqi Krebs, Claudia Pavlidis, Paul
author_sort	French, Leon
collection	PubMed
description	Motivation: Automated annotation of neuroanatomical connectivity statements from the neuroscience literature would enable accessible and large-scale connectivity resources. Unfortunately, the connectivity findings are not formally encoded and occur as natural language text. This hinders aggregation, indexing, searching and integration of the reports. We annotated a set of 1377 abstracts for connectivity relations to facilitate automated extraction of connectivity relationships from neuroscience literature. We tested several baseline measures based on co-occurrence and lexical rules. We compare results from seven machine learning methods adapted from the protein interaction extraction domain that employ part-of-speech, dependency and syntax features. Results: Co-occurrence based methods provided high recall with weak precision. The shallow linguistic kernel recalled 70.1% of the sentence-level connectivity statements at 50.3% precision. Owing to its speed and simplicity, we applied the shallow linguistic kernel to a large set of new abstracts. To evaluate the results, we compared 2688 extracted connections with the Brain Architecture Management System (an existing database of rat connectivity). The extracted connections were connected in the Brain Architecture Management System at a rate of 63.5%, compared with 51.1% for co-occurring brain region pairs. We found that precision increases with the recency and frequency of the extracted relationships. Availability and implementation: The source code, evaluations, documentation and other supplementary materials are available at http://www.chibi.ubc.ca/WhiteText. Contact: paul@chibi.ubc.ca Supplementary information: Supplementary data are available at Bioinformatics Online.
format	Online Article Text
id	pubmed-3496336
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-34963362012-12-12 Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text French, Leon Lane, Suzanne Xu, Lydia Siu, Celia Kwok, Cathy Chen, Yiqi Krebs, Claudia Pavlidis, Paul Bioinformatics Original Papers Motivation: Automated annotation of neuroanatomical connectivity statements from the neuroscience literature would enable accessible and large-scale connectivity resources. Unfortunately, the connectivity findings are not formally encoded and occur as natural language text. This hinders aggregation, indexing, searching and integration of the reports. We annotated a set of 1377 abstracts for connectivity relations to facilitate automated extraction of connectivity relationships from neuroscience literature. We tested several baseline measures based on co-occurrence and lexical rules. We compare results from seven machine learning methods adapted from the protein interaction extraction domain that employ part-of-speech, dependency and syntax features. Results: Co-occurrence based methods provided high recall with weak precision. The shallow linguistic kernel recalled 70.1% of the sentence-level connectivity statements at 50.3% precision. Owing to its speed and simplicity, we applied the shallow linguistic kernel to a large set of new abstracts. To evaluate the results, we compared 2688 extracted connections with the Brain Architecture Management System (an existing database of rat connectivity). The extracted connections were connected in the Brain Architecture Management System at a rate of 63.5%, compared with 51.1% for co-occurring brain region pairs. We found that precision increases with the recency and frequency of the extracted relationships. Availability and implementation: The source code, evaluations, documentation and other supplementary materials are available at http://www.chibi.ubc.ca/WhiteText. Contact: paul@chibi.ubc.ca Supplementary information: Supplementary data are available at Bioinformatics Online. Oxford University Press 2012-11-15 2012-09-06 /pmc/articles/PMC3496336/ /pubmed/22954628 http://dx.doi.org/10.1093/bioinformatics/bts542 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers French, Leon Lane, Suzanne Xu, Lydia Siu, Celia Kwok, Cathy Chen, Yiqi Krebs, Claudia Pavlidis, Paul Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text
title	Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text
title_full	Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text
title_fullStr	Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text
title_full_unstemmed	Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text
title_short	Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text
title_sort	application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496336/ https://www.ncbi.nlm.nih.gov/pubmed/22954628 http://dx.doi.org/10.1093/bioinformatics/bts542
work_keys_str_mv	AT frenchleon applicationandevaluationofautomatedmethodstoextractneuroanatomicalconnectivitystatementsfromfreetext AT lanesuzanne applicationandevaluationofautomatedmethodstoextractneuroanatomicalconnectivitystatementsfromfreetext AT xulydia applicationandevaluationofautomatedmethodstoextractneuroanatomicalconnectivitystatementsfromfreetext AT siucelia applicationandevaluationofautomatedmethodstoextractneuroanatomicalconnectivitystatementsfromfreetext AT kwokcathy applicationandevaluationofautomatedmethodstoextractneuroanatomicalconnectivitystatementsfromfreetext AT chenyiqi applicationandevaluationofautomatedmethodstoextractneuroanatomicalconnectivitystatementsfromfreetext AT krebsclaudia applicationandevaluationofautomatedmethodstoextractneuroanatomicalconnectivitystatementsfromfreetext AT pavlidispaul applicationandevaluationofautomatedmethodstoextractneuroanatomicalconnectivitystatementsfromfreetext

Application and evaluation of automated methods to extract neuroanatomical connectivity statements from free text

Ejemplares similares