Cargando…
Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data
BACKGROUND: Next Generation Sequencing (NGS) is playing a key role in therapeutic decision making for the cancer prognosis and treatment. The NGS technologies are producing a massive amount of sequencing datasets. Often, these datasets are published from the isolated and different sequencing facilit...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5606033/ https://www.ncbi.nlm.nih.gov/pubmed/28927463 http://dx.doi.org/10.1186/s13326-017-0146-9 |
_version_ | 1783265088384270336 |
---|---|
author | Jha, Alokkumar Khan, Yasar Mehdi, Muntazir Karim, Md Rezaul Mehmood, Qaiser Zappa, Achille Rebholz-Schuhmann, Dietrich Sahay, Ratnesh |
author_facet | Jha, Alokkumar Khan, Yasar Mehdi, Muntazir Karim, Md Rezaul Mehmood, Qaiser Zappa, Achille Rebholz-Schuhmann, Dietrich Sahay, Ratnesh |
author_sort | Jha, Alokkumar |
collection | PubMed |
description | BACKGROUND: Next Generation Sequencing (NGS) is playing a key role in therapeutic decision making for the cancer prognosis and treatment. The NGS technologies are producing a massive amount of sequencing datasets. Often, these datasets are published from the isolated and different sequencing facilities. Consequently, the process of sharing and aggregating multisite sequencing datasets are thwarted by issues such as the need to discover relevant data from different sources, built scalable repositories, the automation of data linkage, the volume of the data, efficient querying mechanism, and information rich intuitive visualisation. RESULTS: We present an approach to link and query different sequencing datasets (TCGA, COSMIC, REACTOME, KEGG and GO) to indicate risks for four cancer types – Ovarian Serous Cystadenocarcinoma (OV), Uterine Corpus Endometrial Carcinoma (UCEC), Uterine Carcinosarcoma (UCS), Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma (CESC) – covering the 16 healthy tissue-specific genes from Illumina Human Body Map 2.0. The differentially expressed genes from Illumina Human Body Map 2.0 are analysed together with the gene expressions reported in COSMIC and TCGA repositories leading to the discover of potential biomarkers for a tissue-specific cancer. CONCLUSION: We analyse the tissue expression of genes, copy number variation (CNV), somatic mutation, and promoter methylation to identify associated pathways and find novel biomarkers. We discovered twenty (20) mutated genes and three (3) potential pathways causing promoter changes in different gynaecological cancer types. We propose a data-interlinked platform called BIOOPENER that glues together heterogeneous cancer and biomedical repositories. The key approach is to find correspondences (or data links) among genetic, cellular and molecular features across isolated cancer datasets giving insight into cancer progression from normal to diseased tissues. The proposed BIOOPENER platform enriches mutations by filling in missing links from TCGA, COSMIC, REACTOME, KEGG and GO datasets and provides an interlinking mechanism to understand cancer progression from normal to diseased tissues with pathway components, which in turn helped to map mutations, associated phenotypes, pathways, and mechanism. |
format | Online Article Text |
id | pubmed-5606033 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-56060332017-09-20 Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data Jha, Alokkumar Khan, Yasar Mehdi, Muntazir Karim, Md Rezaul Mehmood, Qaiser Zappa, Achille Rebholz-Schuhmann, Dietrich Sahay, Ratnesh J Biomed Semantics Research BACKGROUND: Next Generation Sequencing (NGS) is playing a key role in therapeutic decision making for the cancer prognosis and treatment. The NGS technologies are producing a massive amount of sequencing datasets. Often, these datasets are published from the isolated and different sequencing facilities. Consequently, the process of sharing and aggregating multisite sequencing datasets are thwarted by issues such as the need to discover relevant data from different sources, built scalable repositories, the automation of data linkage, the volume of the data, efficient querying mechanism, and information rich intuitive visualisation. RESULTS: We present an approach to link and query different sequencing datasets (TCGA, COSMIC, REACTOME, KEGG and GO) to indicate risks for four cancer types – Ovarian Serous Cystadenocarcinoma (OV), Uterine Corpus Endometrial Carcinoma (UCEC), Uterine Carcinosarcoma (UCS), Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma (CESC) – covering the 16 healthy tissue-specific genes from Illumina Human Body Map 2.0. The differentially expressed genes from Illumina Human Body Map 2.0 are analysed together with the gene expressions reported in COSMIC and TCGA repositories leading to the discover of potential biomarkers for a tissue-specific cancer. CONCLUSION: We analyse the tissue expression of genes, copy number variation (CNV), somatic mutation, and promoter methylation to identify associated pathways and find novel biomarkers. We discovered twenty (20) mutated genes and three (3) potential pathways causing promoter changes in different gynaecological cancer types. We propose a data-interlinked platform called BIOOPENER that glues together heterogeneous cancer and biomedical repositories. The key approach is to find correspondences (or data links) among genetic, cellular and molecular features across isolated cancer datasets giving insight into cancer progression from normal to diseased tissues. The proposed BIOOPENER platform enriches mutations by filling in missing links from TCGA, COSMIC, REACTOME, KEGG and GO datasets and provides an interlinking mechanism to understand cancer progression from normal to diseased tissues with pathway components, which in turn helped to map mutations, associated phenotypes, pathways, and mechanism. BioMed Central 2017-09-19 /pmc/articles/PMC5606033/ /pubmed/28927463 http://dx.doi.org/10.1186/s13326-017-0146-9 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Jha, Alokkumar Khan, Yasar Mehdi, Muntazir Karim, Md Rezaul Mehmood, Qaiser Zappa, Achille Rebholz-Schuhmann, Dietrich Sahay, Ratnesh Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data |
title | Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data |
title_full | Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data |
title_fullStr | Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data |
title_full_unstemmed | Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data |
title_short | Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data |
title_sort | towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5606033/ https://www.ncbi.nlm.nih.gov/pubmed/28927463 http://dx.doi.org/10.1186/s13326-017-0146-9 |
work_keys_str_mv | AT jhaalokkumar towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata AT khanyasar towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata AT mehdimuntazir towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata AT karimmdrezaul towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata AT mehmoodqaiser towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata AT zappaachille towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata AT rebholzschuhmanndietrich towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata AT sahayratnesh towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata |