Cargando…

Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach

The precision medicine (PM) initiative promises to identify individualized treatment depending on a patients’ genetic profile and their related responses. In order to help health professionals and researchers in the PM endeavor, BioCreative VI organized a PM Track to mine protein–protein interaction...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Ling, Yang, Zhihao, Lin, Hongfei, Wang, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6147215/
https://www.ncbi.nlm.nih.gov/pubmed/30295718
http://dx.doi.org/10.1093/database/bay097
_version_ 1783356531180306432
author Luo, Ling
Yang, Zhihao
Lin, Hongfei
Wang, Jian
author_facet Luo, Ling
Yang, Zhihao
Lin, Hongfei
Wang, Jian
author_sort Luo, Ling
collection PubMed
description The precision medicine (PM) initiative promises to identify individualized treatment depending on a patients’ genetic profile and their related responses. In order to help health professionals and researchers in the PM endeavor, BioCreative VI organized a PM Track to mine protein–protein interactions (PPI) affected by genetic mutations from the biomedical literature. In this paper, we present a neural network ensemble approach to identify relevant articles describing PPI affected by mutations. In this approach, several neural network models are used for document triage, and the ensemble performs better than any individual model. In the official runs, our best submission achieves an F-score of 69.04% in the BioCreative VI PM document triage task. After post-challenge analysis, to address the problem of the limited size of training set, a PPI pre-trained module is incorporated into our approach to further improve the performance. Finally, our best ensemble method achieves an F-score of 71.04% on the test set.
format Online
Article
Text
id pubmed-6147215
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61472152018-09-25 Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach Luo, Ling Yang, Zhihao Lin, Hongfei Wang, Jian Database (Oxford) Original Article The precision medicine (PM) initiative promises to identify individualized treatment depending on a patients’ genetic profile and their related responses. In order to help health professionals and researchers in the PM endeavor, BioCreative VI organized a PM Track to mine protein–protein interactions (PPI) affected by genetic mutations from the biomedical literature. In this paper, we present a neural network ensemble approach to identify relevant articles describing PPI affected by mutations. In this approach, several neural network models are used for document triage, and the ensemble performs better than any individual model. In the official runs, our best submission achieves an F-score of 69.04% in the BioCreative VI PM document triage task. After post-challenge analysis, to address the problem of the limited size of training set, a PPI pre-trained module is incorporated into our approach to further improve the performance. Finally, our best ensemble method achieves an F-score of 71.04% on the test set. Oxford University Press 2018-09-19 /pmc/articles/PMC6147215/ /pubmed/30295718 http://dx.doi.org/10.1093/database/bay097 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Luo, Ling
Yang, Zhihao
Lin, Hongfei
Wang, Jian
Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach
title Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach
title_full Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach
title_fullStr Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach
title_full_unstemmed Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach
title_short Document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach
title_sort document triage for identifying protein–protein interactions affected by mutations: a neural network ensemble approach
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6147215/
https://www.ncbi.nlm.nih.gov/pubmed/30295718
http://dx.doi.org/10.1093/database/bay097
work_keys_str_mv AT luoling documenttriageforidentifyingproteinproteininteractionsaffectedbymutationsaneuralnetworkensembleapproach
AT yangzhihao documenttriageforidentifyingproteinproteininteractionsaffectedbymutationsaneuralnetworkensembleapproach
AT linhongfei documenttriageforidentifyingproteinproteininteractionsaffectedbymutationsaneuralnetworkensembleapproach
AT wangjian documenttriageforidentifyingproteinproteininteractionsaffectedbymutationsaneuralnetworkensembleapproach