Cargando…

Large-scale automated machine reading discovers new cancer-driving mechanisms

PubMed, a repository and search engine for biomedical literature, now indexes >1 million articles each year. This exceeds the processing capacity of human domain experts, limiting our ability to truly understand many diseases. We present Reach, a system for automated, large-scale machine reading...

Descripción completa

Detalles Bibliográficos
Autores principales: Valenzuela-Escárcega, Marco A, Babur, Özgün, Hahn-Powell, Gus, Bell, Dane, Hicks, Thomas, Noriega-Atala, Enrique, Wang, Xia, Surdeanu, Mihai, Demir, Emek, Morrison, Clayton T
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6156821/
https://www.ncbi.nlm.nih.gov/pubmed/30256986
http://dx.doi.org/10.1093/database/bay098
_version_ 1783358165054652416
author Valenzuela-Escárcega, Marco A
Babur, Özgün
Hahn-Powell, Gus
Bell, Dane
Hicks, Thomas
Noriega-Atala, Enrique
Wang, Xia
Surdeanu, Mihai
Demir, Emek
Morrison, Clayton T
author_facet Valenzuela-Escárcega, Marco A
Babur, Özgün
Hahn-Powell, Gus
Bell, Dane
Hicks, Thomas
Noriega-Atala, Enrique
Wang, Xia
Surdeanu, Mihai
Demir, Emek
Morrison, Clayton T
author_sort Valenzuela-Escárcega, Marco A
collection PubMed
description PubMed, a repository and search engine for biomedical literature, now indexes >1 million articles each year. This exceeds the processing capacity of human domain experts, limiting our ability to truly understand many diseases. We present Reach, a system for automated, large-scale machine reading of biomedical papers that can extract mechanistic descriptions of biological processes with relatively high precision at high throughput. We demonstrate that combining the extracted pathway fragments with existing biological data analysis algorithms that rely on curated models helps identify and explain a large number of previously unidentified mutually exclusive altered signaling pathways in seven different cancer types. This work shows that combining human-curated ‘big mechanisms’ with extracted ‘big data’ can lead to a causal, predictive understanding of cellular processes and unlock important downstream applications.
format Online
Article
Text
id pubmed-6156821
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61568212018-10-02 Large-scale automated machine reading discovers new cancer-driving mechanisms Valenzuela-Escárcega, Marco A Babur, Özgün Hahn-Powell, Gus Bell, Dane Hicks, Thomas Noriega-Atala, Enrique Wang, Xia Surdeanu, Mihai Demir, Emek Morrison, Clayton T Database (Oxford) Original Article PubMed, a repository and search engine for biomedical literature, now indexes >1 million articles each year. This exceeds the processing capacity of human domain experts, limiting our ability to truly understand many diseases. We present Reach, a system for automated, large-scale machine reading of biomedical papers that can extract mechanistic descriptions of biological processes with relatively high precision at high throughput. We demonstrate that combining the extracted pathway fragments with existing biological data analysis algorithms that rely on curated models helps identify and explain a large number of previously unidentified mutually exclusive altered signaling pathways in seven different cancer types. This work shows that combining human-curated ‘big mechanisms’ with extracted ‘big data’ can lead to a causal, predictive understanding of cellular processes and unlock important downstream applications. Oxford University Press 2018-09-26 /pmc/articles/PMC6156821/ /pubmed/30256986 http://dx.doi.org/10.1093/database/bay098 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Valenzuela-Escárcega, Marco A
Babur, Özgün
Hahn-Powell, Gus
Bell, Dane
Hicks, Thomas
Noriega-Atala, Enrique
Wang, Xia
Surdeanu, Mihai
Demir, Emek
Morrison, Clayton T
Large-scale automated machine reading discovers new cancer-driving mechanisms
title Large-scale automated machine reading discovers new cancer-driving mechanisms
title_full Large-scale automated machine reading discovers new cancer-driving mechanisms
title_fullStr Large-scale automated machine reading discovers new cancer-driving mechanisms
title_full_unstemmed Large-scale automated machine reading discovers new cancer-driving mechanisms
title_short Large-scale automated machine reading discovers new cancer-driving mechanisms
title_sort large-scale automated machine reading discovers new cancer-driving mechanisms
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6156821/
https://www.ncbi.nlm.nih.gov/pubmed/30256986
http://dx.doi.org/10.1093/database/bay098
work_keys_str_mv AT valenzuelaescarcegamarcoa largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT baburozgun largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT hahnpowellgus largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT belldane largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT hicksthomas largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT noriegaatalaenrique largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT wangxia largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT surdeanumihai largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT demiremek largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms
AT morrisonclaytont largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms