Cargando…

Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data

BACKGROUND: Next Generation Sequencing (NGS) is playing a key role in therapeutic decision making for the cancer prognosis and treatment. The NGS technologies are producing a massive amount of sequencing datasets. Often, these datasets are published from the isolated and different sequencing facilit...

Descripción completa

Detalles Bibliográficos
Autores principales: Jha, Alokkumar, Khan, Yasar, Mehdi, Muntazir, Karim, Md Rezaul, Mehmood, Qaiser, Zappa, Achille, Rebholz-Schuhmann, Dietrich, Sahay, Ratnesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5606033/
https://www.ncbi.nlm.nih.gov/pubmed/28927463
http://dx.doi.org/10.1186/s13326-017-0146-9
_version_ 1783265088384270336
author Jha, Alokkumar
Khan, Yasar
Mehdi, Muntazir
Karim, Md Rezaul
Mehmood, Qaiser
Zappa, Achille
Rebholz-Schuhmann, Dietrich
Sahay, Ratnesh
author_facet Jha, Alokkumar
Khan, Yasar
Mehdi, Muntazir
Karim, Md Rezaul
Mehmood, Qaiser
Zappa, Achille
Rebholz-Schuhmann, Dietrich
Sahay, Ratnesh
author_sort Jha, Alokkumar
collection PubMed
description BACKGROUND: Next Generation Sequencing (NGS) is playing a key role in therapeutic decision making for the cancer prognosis and treatment. The NGS technologies are producing a massive amount of sequencing datasets. Often, these datasets are published from the isolated and different sequencing facilities. Consequently, the process of sharing and aggregating multisite sequencing datasets are thwarted by issues such as the need to discover relevant data from different sources, built scalable repositories, the automation of data linkage, the volume of the data, efficient querying mechanism, and information rich intuitive visualisation. RESULTS: We present an approach to link and query different sequencing datasets (TCGA, COSMIC, REACTOME, KEGG and GO) to indicate risks for four cancer types – Ovarian Serous Cystadenocarcinoma (OV), Uterine Corpus Endometrial Carcinoma (UCEC), Uterine Carcinosarcoma (UCS), Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma (CESC) – covering the 16 healthy tissue-specific genes from Illumina Human Body Map 2.0. The differentially expressed genes from Illumina Human Body Map 2.0 are analysed together with the gene expressions reported in COSMIC and TCGA repositories leading to the discover of potential biomarkers for a tissue-specific cancer. CONCLUSION: We analyse the tissue expression of genes, copy number variation (CNV), somatic mutation, and promoter methylation to identify associated pathways and find novel biomarkers. We discovered twenty (20) mutated genes and three (3) potential pathways causing promoter changes in different gynaecological cancer types. We propose a data-interlinked platform called BIOOPENER that glues together heterogeneous cancer and biomedical repositories. The key approach is to find correspondences (or data links) among genetic, cellular and molecular features across isolated cancer datasets giving insight into cancer progression from normal to diseased tissues. The proposed BIOOPENER platform enriches mutations by filling in missing links from TCGA, COSMIC, REACTOME, KEGG and GO datasets and provides an interlinking mechanism to understand cancer progression from normal to diseased tissues with pathway components, which in turn helped to map mutations, associated phenotypes, pathways, and mechanism.
format Online
Article
Text
id pubmed-5606033
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56060332017-09-20 Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data Jha, Alokkumar Khan, Yasar Mehdi, Muntazir Karim, Md Rezaul Mehmood, Qaiser Zappa, Achille Rebholz-Schuhmann, Dietrich Sahay, Ratnesh J Biomed Semantics Research BACKGROUND: Next Generation Sequencing (NGS) is playing a key role in therapeutic decision making for the cancer prognosis and treatment. The NGS technologies are producing a massive amount of sequencing datasets. Often, these datasets are published from the isolated and different sequencing facilities. Consequently, the process of sharing and aggregating multisite sequencing datasets are thwarted by issues such as the need to discover relevant data from different sources, built scalable repositories, the automation of data linkage, the volume of the data, efficient querying mechanism, and information rich intuitive visualisation. RESULTS: We present an approach to link and query different sequencing datasets (TCGA, COSMIC, REACTOME, KEGG and GO) to indicate risks for four cancer types – Ovarian Serous Cystadenocarcinoma (OV), Uterine Corpus Endometrial Carcinoma (UCEC), Uterine Carcinosarcoma (UCS), Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma (CESC) – covering the 16 healthy tissue-specific genes from Illumina Human Body Map 2.0. The differentially expressed genes from Illumina Human Body Map 2.0 are analysed together with the gene expressions reported in COSMIC and TCGA repositories leading to the discover of potential biomarkers for a tissue-specific cancer. CONCLUSION: We analyse the tissue expression of genes, copy number variation (CNV), somatic mutation, and promoter methylation to identify associated pathways and find novel biomarkers. We discovered twenty (20) mutated genes and three (3) potential pathways causing promoter changes in different gynaecological cancer types. We propose a data-interlinked platform called BIOOPENER that glues together heterogeneous cancer and biomedical repositories. The key approach is to find correspondences (or data links) among genetic, cellular and molecular features across isolated cancer datasets giving insight into cancer progression from normal to diseased tissues. The proposed BIOOPENER platform enriches mutations by filling in missing links from TCGA, COSMIC, REACTOME, KEGG and GO datasets and provides an interlinking mechanism to understand cancer progression from normal to diseased tissues with pathway components, which in turn helped to map mutations, associated phenotypes, pathways, and mechanism. BioMed Central 2017-09-19 /pmc/articles/PMC5606033/ /pubmed/28927463 http://dx.doi.org/10.1186/s13326-017-0146-9 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Jha, Alokkumar
Khan, Yasar
Mehdi, Muntazir
Karim, Md Rezaul
Mehmood, Qaiser
Zappa, Achille
Rebholz-Schuhmann, Dietrich
Sahay, Ratnesh
Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data
title Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data
title_full Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data
title_fullStr Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data
title_full_unstemmed Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data
title_short Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data
title_sort towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5606033/
https://www.ncbi.nlm.nih.gov/pubmed/28927463
http://dx.doi.org/10.1186/s13326-017-0146-9
work_keys_str_mv AT jhaalokkumar towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata
AT khanyasar towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata
AT mehdimuntazir towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata
AT karimmdrezaul towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata
AT mehmoodqaiser towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata
AT zappaachille towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata
AT rebholzschuhmanndietrich towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata
AT sahayratnesh towardsprecisionmedicinediscoveringnovelgynecologicalcancerbiomarkersandpathwaysusinglinkeddata