Cargando…
Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy
Hepatocellular carcinoma (HCC), one of the most common causes of cancer-related deaths, carries a 5-year survival rate of 18%, underscoring the need for robust biomarkers. In spite of the increased availability of HCC related literatures, many of the promising biomarkers reported have not been valid...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7243925/ https://www.ncbi.nlm.nih.gov/pubmed/31725857 http://dx.doi.org/10.1093/database/bax082 |
_version_ | 1783537488881516544 |
---|---|
author | Chang, Nai-Wen Dai, Hong-Jie Shih, Yung-Yu Wu, Chi-Yang Dela Rosa, Mira Anne C Obena, Rofeamor P Chen, Yu-Ju Hsu, Wen-Lian Oyang, Yen-Jen |
author_facet | Chang, Nai-Wen Dai, Hong-Jie Shih, Yung-Yu Wu, Chi-Yang Dela Rosa, Mira Anne C Obena, Rofeamor P Chen, Yu-Ju Hsu, Wen-Lian Oyang, Yen-Jen |
author_sort | Chang, Nai-Wen |
collection | PubMed |
description | Hepatocellular carcinoma (HCC), one of the most common causes of cancer-related deaths, carries a 5-year survival rate of 18%, underscoring the need for robust biomarkers. In spite of the increased availability of HCC related literatures, many of the promising biomarkers reported have not been validated for clinical use. To narrow down the wide range of possible biomarkers for further clinical validation, bioinformaticians need to sort them out using information provided in published works. Biomedical text mining is an automated way to obtain information of interest within the massive collection of biomedical knowledge, thus enabling extraction of data for biomarkers associated with certain diseases. This method can significantly reduce both the time and effort spent on studying important maladies such as liver diseases. Herein, we report a text mining-aided curation pipeline to identify potential biomarkers for liver cancer. The curation pipeline integrates PubMed E-Utilities to collect abstracts from PubMed and recognize several types of named entities by machine learning-based and pattern-based methods. Genes/proteins from evidential sentences were classified as candidate biomarkers using a convolutional neural network. Lastly, extracted biomarkers were ranked depending on several criteria, such as the frequency of keywords and articles and the journal impact factor, and then integrated into a meaningful list for bioinformaticians. Based on the developed pipeline, we constructed MarkerHub, which contains 2128 candidate biomarkers extracted from PubMed publications from 2008 to 2017. Database URL: http://markerhub.iis.sinica.edu.tw |
format | Online Article Text |
id | pubmed-7243925 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-72439252020-05-27 Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy Chang, Nai-Wen Dai, Hong-Jie Shih, Yung-Yu Wu, Chi-Yang Dela Rosa, Mira Anne C Obena, Rofeamor P Chen, Yu-Ju Hsu, Wen-Lian Oyang, Yen-Jen Database (Oxford) Original Article Hepatocellular carcinoma (HCC), one of the most common causes of cancer-related deaths, carries a 5-year survival rate of 18%, underscoring the need for robust biomarkers. In spite of the increased availability of HCC related literatures, many of the promising biomarkers reported have not been validated for clinical use. To narrow down the wide range of possible biomarkers for further clinical validation, bioinformaticians need to sort them out using information provided in published works. Biomedical text mining is an automated way to obtain information of interest within the massive collection of biomedical knowledge, thus enabling extraction of data for biomarkers associated with certain diseases. This method can significantly reduce both the time and effort spent on studying important maladies such as liver diseases. Herein, we report a text mining-aided curation pipeline to identify potential biomarkers for liver cancer. The curation pipeline integrates PubMed E-Utilities to collect abstracts from PubMed and recognize several types of named entities by machine learning-based and pattern-based methods. Genes/proteins from evidential sentences were classified as candidate biomarkers using a convolutional neural network. Lastly, extracted biomarkers were ranked depending on several criteria, such as the frequency of keywords and articles and the journal impact factor, and then integrated into a meaningful list for bioinformaticians. Based on the developed pipeline, we constructed MarkerHub, which contains 2128 candidate biomarkers extracted from PubMed publications from 2008 to 2017. Database URL: http://markerhub.iis.sinica.edu.tw Oxford University Press 2017-12-08 /pmc/articles/PMC7243925/ /pubmed/31725857 http://dx.doi.org/10.1093/database/bax082 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Chang, Nai-Wen Dai, Hong-Jie Shih, Yung-Yu Wu, Chi-Yang Dela Rosa, Mira Anne C Obena, Rofeamor P Chen, Yu-Ju Hsu, Wen-Lian Oyang, Yen-Jen Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy |
title | Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy |
title_full | Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy |
title_fullStr | Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy |
title_full_unstemmed | Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy |
title_short | Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy |
title_sort | biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7243925/ https://www.ncbi.nlm.nih.gov/pubmed/31725857 http://dx.doi.org/10.1093/database/bax082 |
work_keys_str_mv | AT changnaiwen biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy AT daihongjie biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy AT shihyungyu biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy AT wuchiyang biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy AT delarosamiraannec biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy AT obenarofeamorp biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy AT chenyuju biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy AT hsuwenlian biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy AT oyangyenjen biomarkeridentificationofhepatocellularcarcinomausingamethodicalliteratureminingstrategy |