Cargando…

Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets

In recent years, a growing number of researchers began to focus on how to establish associations between clinical and genomic data. However, up to now, there is lack of research mining clinic-genomic associations by comprehensively analysing available gene expression data for a single disease. Color...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Fang, Feng, Yaning, Li, Zhenye, Pan, Chao, Su, Yuncong, Yang, Rui, Song, Liying, Duan, Huilong, Deng, Ning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4060771/
https://www.ncbi.nlm.nih.gov/pubmed/24987669
http://dx.doi.org/10.1155/2014/170289
_version_ 1782321408113639424
author Liu, Fang
Feng, Yaning
Li, Zhenye
Pan, Chao
Su, Yuncong
Yang, Rui
Song, Liying
Duan, Huilong
Deng, Ning
author_facet Liu, Fang
Feng, Yaning
Li, Zhenye
Pan, Chao
Su, Yuncong
Yang, Rui
Song, Liying
Duan, Huilong
Deng, Ning
author_sort Liu, Fang
collection PubMed
description In recent years, a growing number of researchers began to focus on how to establish associations between clinical and genomic data. However, up to now, there is lack of research mining clinic-genomic associations by comprehensively analysing available gene expression data for a single disease. Colorectal cancer is one of the malignant tumours. A number of genetic syndromes have been proven to be associated with colorectal cancer. This paper presents our research on mining clinic-genomic associations for colorectal cancer under biomedical big data environment. The proposed method is engineered with multiple technologies, including extracting clinical concepts using the unified medical language system (UMLS), extracting genes through the literature mining, and mining clinic-genomic associations through statistical analysis. We applied this method to datasets extracted from both gene expression omnibus (GEO) and genetic association database (GAD). A total of 23517 clinic-genomic associations between 139 clinical concepts and 7914 genes were obtained, of which 3474 associations between 31 clinical concepts and 1689 genes were identified as highly reliable ones. Evaluation and interpretation were performed using UMLS, KEGG, and Gephi, and potential new discoveries were explored. The proposed method is effective in mining valuable knowledge from available biomedical big data and achieves a good performance in bridging clinical data with genomic data for colorectal cancer.
format Online
Article
Text
id pubmed-4060771
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-40607712014-07-01 Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets Liu, Fang Feng, Yaning Li, Zhenye Pan, Chao Su, Yuncong Yang, Rui Song, Liying Duan, Huilong Deng, Ning Biomed Res Int Research Article In recent years, a growing number of researchers began to focus on how to establish associations between clinical and genomic data. However, up to now, there is lack of research mining clinic-genomic associations by comprehensively analysing available gene expression data for a single disease. Colorectal cancer is one of the malignant tumours. A number of genetic syndromes have been proven to be associated with colorectal cancer. This paper presents our research on mining clinic-genomic associations for colorectal cancer under biomedical big data environment. The proposed method is engineered with multiple technologies, including extracting clinical concepts using the unified medical language system (UMLS), extracting genes through the literature mining, and mining clinic-genomic associations through statistical analysis. We applied this method to datasets extracted from both gene expression omnibus (GEO) and genetic association database (GAD). A total of 23517 clinic-genomic associations between 139 clinical concepts and 7914 genes were obtained, of which 3474 associations between 31 clinical concepts and 1689 genes were identified as highly reliable ones. Evaluation and interpretation were performed using UMLS, KEGG, and Gephi, and potential new discoveries were explored. The proposed method is effective in mining valuable knowledge from available biomedical big data and achieves a good performance in bridging clinical data with genomic data for colorectal cancer. Hindawi Publishing Corporation 2014 2014-06-02 /pmc/articles/PMC4060771/ /pubmed/24987669 http://dx.doi.org/10.1155/2014/170289 Text en Copyright © 2014 Fang Liu et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liu, Fang
Feng, Yaning
Li, Zhenye
Pan, Chao
Su, Yuncong
Yang, Rui
Song, Liying
Duan, Huilong
Deng, Ning
Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
title Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
title_full Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
title_fullStr Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
title_full_unstemmed Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
title_short Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
title_sort clinic-genomic association mining for colorectal cancer using publicly available datasets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4060771/
https://www.ncbi.nlm.nih.gov/pubmed/24987669
http://dx.doi.org/10.1155/2014/170289
work_keys_str_mv AT liufang clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets
AT fengyaning clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets
AT lizhenye clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets
AT panchao clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets
AT suyuncong clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets
AT yangrui clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets
AT songliying clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets
AT duanhuilong clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets
AT dengning clinicgenomicassociationminingforcolorectalcancerusingpubliclyavailabledatasets