Cargando…
Large-scale automated machine reading discovers new cancer-driving mechanisms
PubMed, a repository and search engine for biomedical literature, now indexes >1 million articles each year. This exceeds the processing capacity of human domain experts, limiting our ability to truly understand many diseases. We present Reach, a system for automated, large-scale machine reading...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6156821/ https://www.ncbi.nlm.nih.gov/pubmed/30256986 http://dx.doi.org/10.1093/database/bay098 |
_version_ | 1783358165054652416 |
---|---|
author | Valenzuela-Escárcega, Marco A Babur, Özgün Hahn-Powell, Gus Bell, Dane Hicks, Thomas Noriega-Atala, Enrique Wang, Xia Surdeanu, Mihai Demir, Emek Morrison, Clayton T |
author_facet | Valenzuela-Escárcega, Marco A Babur, Özgün Hahn-Powell, Gus Bell, Dane Hicks, Thomas Noriega-Atala, Enrique Wang, Xia Surdeanu, Mihai Demir, Emek Morrison, Clayton T |
author_sort | Valenzuela-Escárcega, Marco A |
collection | PubMed |
description | PubMed, a repository and search engine for biomedical literature, now indexes >1 million articles each year. This exceeds the processing capacity of human domain experts, limiting our ability to truly understand many diseases. We present Reach, a system for automated, large-scale machine reading of biomedical papers that can extract mechanistic descriptions of biological processes with relatively high precision at high throughput. We demonstrate that combining the extracted pathway fragments with existing biological data analysis algorithms that rely on curated models helps identify and explain a large number of previously unidentified mutually exclusive altered signaling pathways in seven different cancer types. This work shows that combining human-curated ‘big mechanisms’ with extracted ‘big data’ can lead to a causal, predictive understanding of cellular processes and unlock important downstream applications. |
format | Online Article Text |
id | pubmed-6156821 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-61568212018-10-02 Large-scale automated machine reading discovers new cancer-driving mechanisms Valenzuela-Escárcega, Marco A Babur, Özgün Hahn-Powell, Gus Bell, Dane Hicks, Thomas Noriega-Atala, Enrique Wang, Xia Surdeanu, Mihai Demir, Emek Morrison, Clayton T Database (Oxford) Original Article PubMed, a repository and search engine for biomedical literature, now indexes >1 million articles each year. This exceeds the processing capacity of human domain experts, limiting our ability to truly understand many diseases. We present Reach, a system for automated, large-scale machine reading of biomedical papers that can extract mechanistic descriptions of biological processes with relatively high precision at high throughput. We demonstrate that combining the extracted pathway fragments with existing biological data analysis algorithms that rely on curated models helps identify and explain a large number of previously unidentified mutually exclusive altered signaling pathways in seven different cancer types. This work shows that combining human-curated ‘big mechanisms’ with extracted ‘big data’ can lead to a causal, predictive understanding of cellular processes and unlock important downstream applications. Oxford University Press 2018-09-26 /pmc/articles/PMC6156821/ /pubmed/30256986 http://dx.doi.org/10.1093/database/bay098 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Valenzuela-Escárcega, Marco A Babur, Özgün Hahn-Powell, Gus Bell, Dane Hicks, Thomas Noriega-Atala, Enrique Wang, Xia Surdeanu, Mihai Demir, Emek Morrison, Clayton T Large-scale automated machine reading discovers new cancer-driving mechanisms |
title | Large-scale automated machine reading discovers new cancer-driving mechanisms |
title_full | Large-scale automated machine reading discovers new cancer-driving mechanisms |
title_fullStr | Large-scale automated machine reading discovers new cancer-driving mechanisms |
title_full_unstemmed | Large-scale automated machine reading discovers new cancer-driving mechanisms |
title_short | Large-scale automated machine reading discovers new cancer-driving mechanisms |
title_sort | large-scale automated machine reading discovers new cancer-driving mechanisms |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6156821/ https://www.ncbi.nlm.nih.gov/pubmed/30256986 http://dx.doi.org/10.1093/database/bay098 |
work_keys_str_mv | AT valenzuelaescarcegamarcoa largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT baburozgun largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT hahnpowellgus largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT belldane largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT hicksthomas largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT noriegaatalaenrique largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT wangxia largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT surdeanumihai largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT demiremek largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms AT morrisonclaytont largescaleautomatedmachinereadingdiscoversnewcancerdrivingmechanisms |