Cargando…

Extracting causal relations on HIV drug resistance from literature

BACKGROUND: In HIV treatment it is critical to have up-to-date resistance data of applicable drugs since HIV has a very high rate of mutation. These data are made available through scientific publications and must be extracted manually by experts in order to be used by virologists and medical doctor...

Descripción completa

Detalles Bibliográficos
Autores principales: Bui, Quoc-Chinh, Nualláin, Breanndán Ó, Boucher, Charles A, Sloot, Peter MA
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2841207/
https://www.ncbi.nlm.nih.gov/pubmed/20178611
http://dx.doi.org/10.1186/1471-2105-11-101
_version_ 1782179093443248128
author Bui, Quoc-Chinh
Nualláin, Breanndán Ó
Boucher, Charles A
Sloot, Peter MA
author_facet Bui, Quoc-Chinh
Nualláin, Breanndán Ó
Boucher, Charles A
Sloot, Peter MA
author_sort Bui, Quoc-Chinh
collection PubMed
description BACKGROUND: In HIV treatment it is critical to have up-to-date resistance data of applicable drugs since HIV has a very high rate of mutation. These data are made available through scientific publications and must be extracted manually by experts in order to be used by virologists and medical doctors. Therefore there is an urgent need for a tool that partially automates this process and is able to retrieve relations between drugs and virus mutations from literature. RESULTS: In this work we present a novel method to extract and combine relationships between HIV drugs and mutations in viral genomes. Our extraction method is based on natural language processing (NLP) which produces grammatical relations and applies a set of rules to these relations. We applied our method to a relevant set of PubMed abstracts and obtained 2,434 extracted relations with an estimated performance of 84% for F-score. We then combined the extracted relations using logistic regression to generate resistance values for each <drug, mutation> pair. The results of this relation combination show more than 85% agreement with the Stanford HIVDB for the ten most frequently occurring mutations. The system is used in 5 hospitals from the Virolab project http://www.virolab.org to preselect the most relevant novel resistance data from literature and present those to virologists and medical doctors for further evaluation. CONCLUSIONS: The proposed relation extraction and combination method has a good performance on extracting HIV drug resistance data. It can be used in large-scale relation extraction experiments. The developed methods can also be applied to extract other type of relations such as gene-protein, gene-disease, and disease-mutation.
format Text
id pubmed-2841207
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28412072010-03-18 Extracting causal relations on HIV drug resistance from literature Bui, Quoc-Chinh Nualláin, Breanndán Ó Boucher, Charles A Sloot, Peter MA BMC Bioinformatics Research article BACKGROUND: In HIV treatment it is critical to have up-to-date resistance data of applicable drugs since HIV has a very high rate of mutation. These data are made available through scientific publications and must be extracted manually by experts in order to be used by virologists and medical doctors. Therefore there is an urgent need for a tool that partially automates this process and is able to retrieve relations between drugs and virus mutations from literature. RESULTS: In this work we present a novel method to extract and combine relationships between HIV drugs and mutations in viral genomes. Our extraction method is based on natural language processing (NLP) which produces grammatical relations and applies a set of rules to these relations. We applied our method to a relevant set of PubMed abstracts and obtained 2,434 extracted relations with an estimated performance of 84% for F-score. We then combined the extracted relations using logistic regression to generate resistance values for each <drug, mutation> pair. The results of this relation combination show more than 85% agreement with the Stanford HIVDB for the ten most frequently occurring mutations. The system is used in 5 hospitals from the Virolab project http://www.virolab.org to preselect the most relevant novel resistance data from literature and present those to virologists and medical doctors for further evaluation. CONCLUSIONS: The proposed relation extraction and combination method has a good performance on extracting HIV drug resistance data. It can be used in large-scale relation extraction experiments. The developed methods can also be applied to extract other type of relations such as gene-protein, gene-disease, and disease-mutation. BioMed Central 2010-02-23 /pmc/articles/PMC2841207/ /pubmed/20178611 http://dx.doi.org/10.1186/1471-2105-11-101 Text en Copyright ©2010 Bui et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Bui, Quoc-Chinh
Nualláin, Breanndán Ó
Boucher, Charles A
Sloot, Peter MA
Extracting causal relations on HIV drug resistance from literature
title Extracting causal relations on HIV drug resistance from literature
title_full Extracting causal relations on HIV drug resistance from literature
title_fullStr Extracting causal relations on HIV drug resistance from literature
title_full_unstemmed Extracting causal relations on HIV drug resistance from literature
title_short Extracting causal relations on HIV drug resistance from literature
title_sort extracting causal relations on hiv drug resistance from literature
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2841207/
https://www.ncbi.nlm.nih.gov/pubmed/20178611
http://dx.doi.org/10.1186/1471-2105-11-101
work_keys_str_mv AT buiquocchinh extractingcausalrelationsonhivdrugresistancefromliterature
AT nuallainbreanndano extractingcausalrelationsonhivdrugresistancefromliterature
AT bouchercharlesa extractingcausalrelationsonhivdrugresistancefromliterature
AT slootpeterma extractingcausalrelationsonhivdrugresistancefromliterature