Cargando…
Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach
The precision medicine (PM) initiative promises to identify individualized treatment depending on a patients’ genetic profile and their related responses. In order to help health professionals and researchers in the PM endeavor, BioCreative VI organized a PM Track to mine protein–protein interaction...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6147215/ https://www.ncbi.nlm.nih.gov/pubmed/30295718 http://dx.doi.org/10.1093/database/bay097 |
_version_ | 1783356531180306432 |
---|---|
author | Luo, Ling Yang, Zhihao Lin, Hongfei Wang, Jian |
author_facet | Luo, Ling Yang, Zhihao Lin, Hongfei Wang, Jian |
author_sort | Luo, Ling |
collection | PubMed |
description | The precision medicine (PM) initiative promises to identify individualized treatment depending on a patients’ genetic profile and their related responses. In order to help health professionals and researchers in the PM endeavor, BioCreative VI organized a PM Track to mine protein–protein interactions (PPI) affected by genetic mutations from the biomedical literature. In this paper, we present a neural network ensemble approach to identify relevant articles describing PPI affected by mutations. In this approach, several neural network models are used for document triage, and the ensemble performs better than any individual model. In the official runs, our best submission achieves an F-score of 69.04% in the BioCreative VI PM document triage task. After post-challenge analysis, to address the problem of the limited size of training set, a PPI pre-trained module is incorporated into our approach to further improve the performance. Finally, our best ensemble method achieves an F-score of 71.04% on the test set. |
format | Online Article Text |
id | pubmed-6147215 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-61472152018-09-25 Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach Luo, Ling Yang, Zhihao Lin, Hongfei Wang, Jian Database (Oxford) Original Article The precision medicine (PM) initiative promises to identify individualized treatment depending on a patients’ genetic profile and their related responses. In order to help health professionals and researchers in the PM endeavor, BioCreative VI organized a PM Track to mine protein–protein interactions (PPI) affected by genetic mutations from the biomedical literature. In this paper, we present a neural network ensemble approach to identify relevant articles describing PPI affected by mutations. In this approach, several neural network models are used for document triage, and the ensemble performs better than any individual model. In the official runs, our best submission achieves an F-score of 69.04% in the BioCreative VI PM document triage task. After post-challenge analysis, to address the problem of the limited size of training set, a PPI pre-trained module is incorporated into our approach to further improve the performance. Finally, our best ensemble method achieves an F-score of 71.04% on the test set. Oxford University Press 2018-09-19 /pmc/articles/PMC6147215/ /pubmed/30295718 http://dx.doi.org/10.1093/database/bay097 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Luo, Ling Yang, Zhihao Lin, Hongfei Wang, Jian Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach |
title | Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach |
title_full | Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach |
title_fullStr | Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach |
title_full_unstemmed | Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach |
title_short | Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach |
title_sort | document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6147215/ https://www.ncbi.nlm.nih.gov/pubmed/30295718 http://dx.doi.org/10.1093/database/bay097 |
work_keys_str_mv | AT luoling documenttriageforidentifyingproteinproteininteractionsaffectedbymutationsaneuralnetworkensembleapproach AT yangzhihao documenttriageforidentifyingproteinproteininteractionsaffectedbymutationsaneuralnetworkensembleapproach AT linhongfei documenttriageforidentifyingproteinproteininteractionsaffectedbymutationsaneuralnetworkensembleapproach AT wangjian documenttriageforidentifyingproteinproteininteractionsaffectedbymutationsaneuralnetworkensembleapproach |