Cargando…

Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy

Hepatocellular carcinoma (HCC), one of the most common causes of cancer-related deaths, carries a 5-year survival rate of 18%, underscoring the need for robust biomarkers. In spite of the increased availability of HCC related literatures, many of the promising biomarkers reported have not been valid...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Nai-Wen, Dai, Hong-Jie, Shih, Yung-Yu, Wu, Chi-Yang, Dela Rosa, Mira Anne C, Obena, Rofeamor P, Chen, Yu-Ju, Hsu, Wen-Lian, Oyang, Yen-Jen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7243925/
https://www.ncbi.nlm.nih.gov/pubmed/31725857
http://dx.doi.org/10.1093/database/bax082
_version_ 1783537488881516544
author Chang, Nai-Wen
Dai, Hong-Jie
Shih, Yung-Yu
Wu, Chi-Yang
Dela Rosa, Mira Anne C
Obena, Rofeamor P
Chen, Yu-Ju
Hsu, Wen-Lian
Oyang, Yen-Jen
author_facet Chang, Nai-Wen
Dai, Hong-Jie
Shih, Yung-Yu
Wu, Chi-Yang
Dela Rosa, Mira Anne C
Obena, Rofeamor P
Chen, Yu-Ju
Hsu, Wen-Lian
Oyang, Yen-Jen
author_sort Chang, Nai-Wen
collection PubMed
description Hepatocellular carcinoma (HCC), one of the most common causes of cancer-related deaths, carries a 5-year survival rate of 18%, underscoring the need for robust biomarkers. In spite of the increased availability of HCC related literatures, many of the promising biomarkers reported have not been validated for clinical use. To narrow down the wide range of possible biomarkers for further clinical validation, bioinformaticians need to sort them out using information provided in published works. Biomedical text mining is an automated way to obtain information of interest within the massive collection of biomedical knowledge, thus enabling extraction of data for biomarkers associated with certain diseases. This method can significantly reduce both the time and effort spent on studying important maladies such as liver diseases. Herein, we report a text mining-aided curation pipeline to identify potential biomarkers for liver cancer. The curation pipeline integrates PubMed E-Utilities to collect abstracts from PubMed and recognize several types of named entities by machine learning-based and pattern-based methods. Genes/proteins from evidential sentences were classified as candidate biomarkers using a convolutional neural network. Lastly, extracted biomarkers were ranked depending on several criteria, such as the frequency of keywords and articles and the journal impact factor, and then integrated into a meaningful list for bioinformaticians. Based on the developed pipeline, we constructed MarkerHub, which contains 2128 candidate biomarkers extracted from PubMed publications from 2008 to 2017. Database URL: http://markerhub.iis.sinica.edu.tw
format Online
Article
Text
id pubmed-7243925
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72439252020-05-27 Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy Chang, Nai-Wen Dai, Hong-Jie Shih, Yung-Yu Wu, Chi-Yang Dela Rosa, Mira Anne C Obena, Rofeamor P Chen, Yu-Ju Hsu, Wen-Lian Oyang, Yen-Jen Database (Oxford) Original Article Hepatocellular carcinoma (HCC), one of the most common causes of cancer-related deaths, carries a 5-year survival rate of 18%, underscoring the need for robust biomarkers. In spite of the increased availability of HCC related literatures, many of the promising biomarkers reported have not been validated for clinical use. To narrow down the wide range of possible biomarkers for further clinical validation, bioinformaticians need to sort them out using information provided in published works. Biomedical text mining is an automated way to obtain information of interest within the massive collection of biomedical knowledge, thus enabling extraction of data for biomarkers associated with certain diseases. This method can significantly reduce both the time and effort spent on studying important maladies such as liver diseases. Herein, we report a text mining-aided curation pipeline to identify potential biomarkers for liver cancer. The curation pipeline integrates PubMed E-Utilities to collect abstracts from PubMed and recognize several types of named entities by machine learning-based and pattern-based methods. Genes/proteins from evidential sentences were classified as candidate biomarkers using a convolutional neural network. Lastly, extracted biomarkers were ranked depending on several criteria, such as the frequency of keywords and articles and the journal impact factor, and then integrated into a meaningful list for bioinformaticians. Based on the developed pipeline, we constructed MarkerHub, which contains 2128 candidate biomarkers extracted from PubMed publications from 2008 to 2017. Database URL: http://markerhub.iis.sinica.edu.tw Oxford University Press 2017-12-08 /pmc/articles/PMC7243925/ /pubmed/31725857 http://dx.doi.org/10.1093/database/bax082 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Chang, Nai-Wen
Dai, Hong-Jie
Shih, Yung-Yu
Wu, Chi-Yang
Dela Rosa, Mira Anne C
Obena, Rofeamor P
Chen, Yu-Ju
Hsu, Wen-Lian
Oyang, Yen-Jen
Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy
title Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy
title_full Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy
title_fullStr Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy
title_full_unstemmed Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy
title_short Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy
title_sort biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7243925/
https://www.ncbi.nlm.nih.gov/pubmed/31725857
http://dx.doi.org/10.1093/database/bax082
work_keys_str_mv AT changnaiwen biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy
AT daihongjie biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy
AT shihyungyu biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy
AT wuchiyang biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy
AT delarosamiraannec biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy
AT obenarofeamorp biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy
AT chenyuju biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy
AT hsuwenlian biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy
AT oyangyenjen biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy