Cargando…

Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer–Related Mortality

Long non-coding RNAs (lncRNAs) are a large and diverse class of transcribed RNAs, which have been shown to play a significant role in developing cancer. In this study, we apply integrative modeling framework to integrate the DNA copy number variation (CNV), lncRNA expression, and downstream target p...

Descripción completa

Detalles Bibliográficos
Autores principales: Roy Sarkar, Tapasree, Maity, Arnab Kumar, Niu, Yabo, Mallick, Bani K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710679/
https://www.ncbi.nlm.nih.gov/pubmed/31488946
http://dx.doi.org/10.1177/1176935119871933
_version_ 1783446384757702656
author Roy Sarkar, Tapasree
Maity, Arnab Kumar
Niu, Yabo
Mallick, Bani K
author_facet Roy Sarkar, Tapasree
Maity, Arnab Kumar
Niu, Yabo
Mallick, Bani K
author_sort Roy Sarkar, Tapasree
collection PubMed
description Long non-coding RNAs (lncRNAs) are a large and diverse class of transcribed RNAs, which have been shown to play a significant role in developing cancer. In this study, we apply integrative modeling framework to integrate the DNA copy number variation (CNV), lncRNA expression, and downstream target protein expression to predict patient survival in breast cancer. We develop a 3-stage model combining a mechanical model (lncRNA regressed on CNV and target proteins regressed on lncRNA) and a clinical model (survival regressed on estimated effects from the mechanical models). Using lncRNAs (such as HOTAIR and MALAT1) along with their CNV, target protein expressions, and survival outcomes from The Cancer Genome Atlas (TCGA) database, we show that predicted mean square error and integrated Brier score (IBS) are both lower for the proposed 3-step integrated model than that of 2-step model. Therefore, the integrative model has better predictive ability than the 2-step model not considering target protein information.
format Online
Article
Text
id pubmed-6710679
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-67106792019-09-05 Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer–Related Mortality Roy Sarkar, Tapasree Maity, Arnab Kumar Niu, Yabo Mallick, Bani K Cancer Inform Short Report Long non-coding RNAs (lncRNAs) are a large and diverse class of transcribed RNAs, which have been shown to play a significant role in developing cancer. In this study, we apply integrative modeling framework to integrate the DNA copy number variation (CNV), lncRNA expression, and downstream target protein expression to predict patient survival in breast cancer. We develop a 3-stage model combining a mechanical model (lncRNA regressed on CNV and target proteins regressed on lncRNA) and a clinical model (survival regressed on estimated effects from the mechanical models). Using lncRNAs (such as HOTAIR and MALAT1) along with their CNV, target protein expressions, and survival outcomes from The Cancer Genome Atlas (TCGA) database, we show that predicted mean square error and integrated Brier score (IBS) are both lower for the proposed 3-step integrated model than that of 2-step model. Therefore, the integrative model has better predictive ability than the 2-step model not considering target protein information. SAGE Publications 2019-08-24 /pmc/articles/PMC6710679/ /pubmed/31488946 http://dx.doi.org/10.1177/1176935119871933 Text en © The Author(s) 2019 http://www.creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Short Report
Roy Sarkar, Tapasree
Maity, Arnab Kumar
Niu, Yabo
Mallick, Bani K
Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer–Related Mortality
title Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer–Related Mortality
title_full Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer–Related Mortality
title_fullStr Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer–Related Mortality
title_full_unstemmed Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer–Related Mortality
title_short Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer–Related Mortality
title_sort multiple omics data integration to identify long noncoding rna responsible for breast cancer–related mortality
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710679/
https://www.ncbi.nlm.nih.gov/pubmed/31488946
http://dx.doi.org/10.1177/1176935119871933
work_keys_str_mv AT roysarkartapasree multipleomicsdataintegrationtoidentifylongnoncodingrnaresponsibleforbreastcancerrelatedmortality
AT maityarnabkumar multipleomicsdataintegrationtoidentifylongnoncodingrnaresponsibleforbreastcancerrelatedmortality
AT niuyabo multipleomicsdataintegrationtoidentifylongnoncodingrnaresponsibleforbreastcancerrelatedmortality
AT mallickbanik multipleomicsdataintegrationtoidentifylongnoncodingrnaresponsibleforbreastcancerrelatedmortality