Cargando…
Predictors of breast cancer cell types and their prognostic power in breast cancer patients
BACKGROUND: Comprehensive understanding of intratumor heterogeneity requires identification of molecular markers, which are capable of differentiating different subpopulations and which also have clinical significance. One important tool that has been addressing this issue is single cell RNA-Sequenc...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5809864/ https://www.ncbi.nlm.nih.gov/pubmed/29433432 http://dx.doi.org/10.1186/s12864-018-4527-y |
_version_ | 1783299630635679744 |
---|---|
author | Wang, Fan Dohogne, Zachariah Yang, Jin Liu, Yu Soibam, Benjamin |
author_facet | Wang, Fan Dohogne, Zachariah Yang, Jin Liu, Yu Soibam, Benjamin |
author_sort | Wang, Fan |
collection | PubMed |
description | BACKGROUND: Comprehensive understanding of intratumor heterogeneity requires identification of molecular markers, which are capable of differentiating different subpopulations and which also have clinical significance. One important tool that has been addressing this issue is single cell RNA-Sequencing (scRNASeq) that allows the quantification of expression profiles of transcripts in individual cells in a population of cancer cells. Using the expression profiles from scRNASeq, current studies conduct analysis to group cells into different subpopulations using clustering algorithms. In this study, we explore scRNASeq cancer data from a different perspective. We focus on scRNASeq data originating from cancer cells pertaining to a particular cancer type, where the cell type or the subpopulation to which each cell belongs is known. We investigate if the “cell type” of a cancer cell can be predicted based on the expression profiles of a small set of transcripts. RESULTS: We outline a predictive analytics pipeline to accurately predict 6 breast cancer cell types using single cell gene expression profiles. Instead of building predictive models using the complete human transcripts, the pipeline first eliminates predictors with low expression and low variance. A multinomial penalized logistic regression further reduces the size of the predictors to only 308, out of which 34 are long non-coding RNAs. Tuning of predictive models shows support vector machines and neural networks as the most accurate models achieving close to 98% prediction accuracies. We also find that mixture of protein coding genes and long non-coding RNAs are better predictors compared to when the two sets of transcripts are treated separately. A signature risk score originating from 65 protein coding genes and 5 lncRNA predictors is associated with prognostic survival of TCGA breast cancer patients. This association was maintained when the risk scores were generated using 65 PCGs and 5 lncRNA separately. We further show that predictors restricted to a particular cell type serve as better prognostic markers for the respective patient subtype. CONCLUSION: Our results show that in general, the breast cancer cell type predictors are also associated with patient survivability and hence have clinical significance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4527-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5809864 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58098642018-02-16 Predictors of breast cancer cell types and their prognostic power in breast cancer patients Wang, Fan Dohogne, Zachariah Yang, Jin Liu, Yu Soibam, Benjamin BMC Genomics Research Article BACKGROUND: Comprehensive understanding of intratumor heterogeneity requires identification of molecular markers, which are capable of differentiating different subpopulations and which also have clinical significance. One important tool that has been addressing this issue is single cell RNA-Sequencing (scRNASeq) that allows the quantification of expression profiles of transcripts in individual cells in a population of cancer cells. Using the expression profiles from scRNASeq, current studies conduct analysis to group cells into different subpopulations using clustering algorithms. In this study, we explore scRNASeq cancer data from a different perspective. We focus on scRNASeq data originating from cancer cells pertaining to a particular cancer type, where the cell type or the subpopulation to which each cell belongs is known. We investigate if the “cell type” of a cancer cell can be predicted based on the expression profiles of a small set of transcripts. RESULTS: We outline a predictive analytics pipeline to accurately predict 6 breast cancer cell types using single cell gene expression profiles. Instead of building predictive models using the complete human transcripts, the pipeline first eliminates predictors with low expression and low variance. A multinomial penalized logistic regression further reduces the size of the predictors to only 308, out of which 34 are long non-coding RNAs. Tuning of predictive models shows support vector machines and neural networks as the most accurate models achieving close to 98% prediction accuracies. We also find that mixture of protein coding genes and long non-coding RNAs are better predictors compared to when the two sets of transcripts are treated separately. A signature risk score originating from 65 protein coding genes and 5 lncRNA predictors is associated with prognostic survival of TCGA breast cancer patients. This association was maintained when the risk scores were generated using 65 PCGs and 5 lncRNA separately. We further show that predictors restricted to a particular cell type serve as better prognostic markers for the respective patient subtype. CONCLUSION: Our results show that in general, the breast cancer cell type predictors are also associated with patient survivability and hence have clinical significance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4527-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-13 /pmc/articles/PMC5809864/ /pubmed/29433432 http://dx.doi.org/10.1186/s12864-018-4527-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Wang, Fan Dohogne, Zachariah Yang, Jin Liu, Yu Soibam, Benjamin Predictors of breast cancer cell types and their prognostic power in breast cancer patients |
title | Predictors of breast cancer cell types and their prognostic power in breast cancer patients |
title_full | Predictors of breast cancer cell types and their prognostic power in breast cancer patients |
title_fullStr | Predictors of breast cancer cell types and their prognostic power in breast cancer patients |
title_full_unstemmed | Predictors of breast cancer cell types and their prognostic power in breast cancer patients |
title_short | Predictors of breast cancer cell types and their prognostic power in breast cancer patients |
title_sort | predictors of breast cancer cell types and their prognostic power in breast cancer patients |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5809864/ https://www.ncbi.nlm.nih.gov/pubmed/29433432 http://dx.doi.org/10.1186/s12864-018-4527-y |
work_keys_str_mv | AT wangfan predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients AT dohognezachariah predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients AT yangjin predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients AT liuyu predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients AT soibambenjamin predictorsofbreastcancercelltypesandtheirprognosticpowerinbreastcancerpatients |