Cargando…

Predictors of breast cancer cell types and their prognostic power in breast cancer patients

BACKGROUND: Comprehensive understanding of intratumor heterogeneity requires identification of molecular markers, which are capable of differentiating different subpopulations and which also have clinical significance. One important tool that has been addressing this issue is single cell RNA-Sequenc...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Fan, Dohogne, Zachariah, Yang, Jin, Liu, Yu, Soibam, Benjamin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5809864/
https://www.ncbi.nlm.nih.gov/pubmed/29433432
http://dx.doi.org/10.1186/s12864-018-4527-y
_version_ 1783299630635679744
author Wang, Fan
Dohogne, Zachariah
Yang, Jin
Liu, Yu
Soibam, Benjamin
author_facet Wang, Fan
Dohogne, Zachariah
Yang, Jin
Liu, Yu
Soibam, Benjamin
author_sort Wang, Fan
collection PubMed
description BACKGROUND: Comprehensive understanding of intratumor heterogeneity requires identification of molecular markers, which are capable of differentiating different subpopulations and which also have clinical significance. One important tool that has been addressing this issue is single cell RNA-Sequencing (scRNASeq) that allows the quantification of expression profiles of transcripts in individual cells in a population of cancer cells. Using the expression profiles from scRNASeq, current studies conduct analysis to group cells into different subpopulations using clustering algorithms. In this study, we explore scRNASeq cancer data from a different perspective. We focus on scRNASeq data originating from cancer cells pertaining to a particular cancer type, where the cell type or the subpopulation to which each cell belongs is known. We investigate if the “cell type” of a cancer cell can be predicted based on the expression profiles of a small set of transcripts. RESULTS: We outline a predictive analytics pipeline to accurately predict 6 breast cancer cell types using single cell gene expression profiles. Instead of building predictive models using the complete human transcripts, the pipeline first eliminates predictors with low expression and low variance. A multinomial penalized logistic regression further reduces the size of the predictors to only 308, out of which 34 are long non-coding RNAs. Tuning of predictive models shows support vector machines and neural networks as the most accurate models achieving close to 98% prediction accuracies. We also find that mixture of protein coding genes and long non-coding RNAs are better predictors compared to when the two sets of transcripts are treated separately. A signature risk score originating from 65 protein coding genes and 5 lncRNA predictors is associated with prognostic survival of TCGA breast cancer patients. This association was maintained when the risk scores were generated using 65 PCGs and 5 lncRNA separately. We further show that predictors restricted to a particular cell type serve as better prognostic markers for the respective patient subtype. CONCLUSION: Our results show that in general, the breast cancer cell type predictors are also associated with patient survivability and hence have clinical significance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4527-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5809864
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58098642018-02-16 Predictors of breast cancer cell types and their prognostic power in breast cancer patients Wang, Fan Dohogne, Zachariah Yang, Jin Liu, Yu Soibam, Benjamin BMC Genomics Research Article BACKGROUND: Comprehensive understanding of intratumor heterogeneity requires identification of molecular markers, which are capable of differentiating different subpopulations and which also have clinical significance. One important tool that has been addressing this issue is single cell RNA-Sequencing (scRNASeq) that allows the quantification of expression profiles of transcripts in individual cells in a population of cancer cells. Using the expression profiles from scRNASeq, current studies conduct analysis to group cells into different subpopulations using clustering algorithms. In this study, we explore scRNASeq cancer data from a different perspective. We focus on scRNASeq data originating from cancer cells pertaining to a particular cancer type, where the cell type or the subpopulation to which each cell belongs is known. We investigate if the “cell type” of a cancer cell can be predicted based on the expression profiles of a small set of transcripts. RESULTS: We outline a predictive analytics pipeline to accurately predict 6 breast cancer cell types using single cell gene expression profiles. Instead of building predictive models using the complete human transcripts, the pipeline first eliminates predictors with low expression and low variance. A multinomial penalized logistic regression further reduces the size of the predictors to only 308, out of which 34 are long non-coding RNAs. Tuning of predictive models shows support vector machines and neural networks as the most accurate models achieving close to 98% prediction accuracies. We also find that mixture of protein coding genes and long non-coding RNAs are better predictors compared to when the two sets of transcripts are treated separately. A signature risk score originating from 65 protein coding genes and 5 lncRNA predictors is associated with prognostic survival of TCGA breast cancer patients. This association was maintained when the risk scores were generated using 65 PCGs and 5 lncRNA separately. We further show that predictors restricted to a particular cell type serve as better prognostic markers for the respective patient subtype. CONCLUSION: Our results show that in general, the breast cancer cell type predictors are also associated with patient survivability and hence have clinical significance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4527-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-13 /pmc/articles/PMC5809864/ /pubmed/29433432 http://dx.doi.org/10.1186/s12864-018-4527-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Wang, Fan
Dohogne, Zachariah
Yang, Jin
Liu, Yu
Soibam, Benjamin
Predictors of breast cancer cell types and their prognostic power in breast cancer patients
title Predictors of breast cancer cell types and their prognostic power in breast cancer patients
title_full Predictors of breast cancer cell types and their prognostic power in breast cancer patients
title_fullStr Predictors of breast cancer cell types and their prognostic power in breast cancer patients
title_full_unstemmed Predictors of breast cancer cell types and their prognostic power in breast cancer patients
title_short Predictors of breast cancer cell types and their prognostic power in breast cancer patients
title_sort predictors of breast cancer cell types and their prognostic power in breast cancer patients
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5809864/
https://www.ncbi.nlm.nih.gov/pubmed/29433432
http://dx.doi.org/10.1186/s12864-018-4527-y
work_keys_str_mv AT wangfan predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients
AT dohognezachariah predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients
AT yangjin predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients
AT liuyu predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients
AT soibambenjamin predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients