Cargando…

GIDB: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer

Gastrointestinal (GI) cancer is common, characterized by high mortality, and includes oesophagus, gastric, liver, bile duct, pancreas, rectal and colon cancers. The insufficient specificity and sensitivity of biomarkers is still a key clinical hindrance for GI cancer diagnosis and successful treatme...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Ying, Wang, Yueqian, Wang, Shuangkuai, Tong, Yuantao, Jin, Ling, Zong, Hui, Zheng, Rongbin, Yang, Jinxuan, Zhang, Zeyu, Ouyang, En, Zhou, Mengyan, Zhang, Xiaoyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6517830/
https://www.ncbi.nlm.nih.gov/pubmed/31089686
http://dx.doi.org/10.1093/database/baz051
_version_ 1783418333509451776
author Wang, Ying
Wang, Yueqian
Wang, Shuangkuai
Tong, Yuantao
Jin, Ling
Zong, Hui
Zheng, Rongbin
Yang, Jinxuan
Zhang, Zeyu
Ouyang, En
Zhou, Mengyan
Zhang, Xiaoyan
author_facet Wang, Ying
Wang, Yueqian
Wang, Shuangkuai
Tong, Yuantao
Jin, Ling
Zong, Hui
Zheng, Rongbin
Yang, Jinxuan
Zhang, Zeyu
Ouyang, En
Zhou, Mengyan
Zhang, Xiaoyan
author_sort Wang, Ying
collection PubMed
description Gastrointestinal (GI) cancer is common, characterized by high mortality, and includes oesophagus, gastric, liver, bile duct, pancreas, rectal and colon cancers. The insufficient specificity and sensitivity of biomarkers is still a key clinical hindrance for GI cancer diagnosis and successful treatment. The emergence of `precision medicine’, `basket trial’ and `field cancerization’ concepts calls for an urgent need and importance for the understanding of how organ system cancers occur at the molecular levels. Knowledge from both the literature and data available in public databases is informative in elucidating the molecular alterations underlying GI cancer. Currently, most available cancer databases have not offered a comprehensive discovery of gene-disease associations, molecular alterations and clinical information by integrated text mining and data mining in GI cancer. We develop GIDB, a panoptic knowledge database that attempts to automate the curation of molecular signatures using natural language processing approaches and multidimensional analyses. GIDB covers information on 8730 genes with both literature and data supporting evidence, 248 miRNAs, 58 lncRNAs, 320 copy number variations, 49 fusion genes and 2381 semantic networks. It presents a comprehensive database, not only in parallelizing supporting evidence and data integration for signatures associated with GI cancer but also in providing the timeline feature of major molecular discoveries. It highlights the most comprehensive overview, research hotspots and the development of historical knowledge of genes in GI cancer. Furthermore, GIDB characterizes genomic abnormalities in multilevel analysis, including simple somatic mutations, gene expression, DNA methylation and prognosis. GIDB offers a user-friendly interface and two customizable online tools (Heatmap and Network) for experimental researchers and clinicians to explore data and help them shorten the learning curve and broaden the scope of knowledge. More importantly, GIDB is an ongoing research project that will continue to be updated and improve the automated method for reducing manual work.
format Online
Article
Text
id pubmed-6517830
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65178302019-05-20 GIDB: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer Wang, Ying Wang, Yueqian Wang, Shuangkuai Tong, Yuantao Jin, Ling Zong, Hui Zheng, Rongbin Yang, Jinxuan Zhang, Zeyu Ouyang, En Zhou, Mengyan Zhang, Xiaoyan Database (Oxford) Original Article Gastrointestinal (GI) cancer is common, characterized by high mortality, and includes oesophagus, gastric, liver, bile duct, pancreas, rectal and colon cancers. The insufficient specificity and sensitivity of biomarkers is still a key clinical hindrance for GI cancer diagnosis and successful treatment. The emergence of `precision medicine’, `basket trial’ and `field cancerization’ concepts calls for an urgent need and importance for the understanding of how organ system cancers occur at the molecular levels. Knowledge from both the literature and data available in public databases is informative in elucidating the molecular alterations underlying GI cancer. Currently, most available cancer databases have not offered a comprehensive discovery of gene-disease associations, molecular alterations and clinical information by integrated text mining and data mining in GI cancer. We develop GIDB, a panoptic knowledge database that attempts to automate the curation of molecular signatures using natural language processing approaches and multidimensional analyses. GIDB covers information on 8730 genes with both literature and data supporting evidence, 248 miRNAs, 58 lncRNAs, 320 copy number variations, 49 fusion genes and 2381 semantic networks. It presents a comprehensive database, not only in parallelizing supporting evidence and data integration for signatures associated with GI cancer but also in providing the timeline feature of major molecular discoveries. It highlights the most comprehensive overview, research hotspots and the development of historical knowledge of genes in GI cancer. Furthermore, GIDB characterizes genomic abnormalities in multilevel analysis, including simple somatic mutations, gene expression, DNA methylation and prognosis. GIDB offers a user-friendly interface and two customizable online tools (Heatmap and Network) for experimental researchers and clinicians to explore data and help them shorten the learning curve and broaden the scope of knowledge. More importantly, GIDB is an ongoing research project that will continue to be updated and improve the automated method for reducing manual work. Oxford University Press 2019-05-15 /pmc/articles/PMC6517830/ /pubmed/31089686 http://dx.doi.org/10.1093/database/baz051 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Wang, Ying
Wang, Yueqian
Wang, Shuangkuai
Tong, Yuantao
Jin, Ling
Zong, Hui
Zheng, Rongbin
Yang, Jinxuan
Zhang, Zeyu
Ouyang, En
Zhou, Mengyan
Zhang, Xiaoyan
GIDB: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer
title GIDB: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer
title_full GIDB: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer
title_fullStr GIDB: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer
title_full_unstemmed GIDB: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer
title_short GIDB: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer
title_sort gidb: a knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal cancer
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6517830/
https://www.ncbi.nlm.nih.gov/pubmed/31089686
http://dx.doi.org/10.1093/database/baz051
work_keys_str_mv AT wangying gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT wangyueqian gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT wangshuangkuai gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT tongyuantao gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT jinling gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT zonghui gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT zhengrongbin gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT yangjinxuan gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT zhangzeyu gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT ouyangen gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT zhoumengyan gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer
AT zhangxiaoyan gidbaknowledgedatabasefortheautomatedcurationandmultidimensionalanalysisofmolecularsignaturesingastrointestinalcancer