Cargando…

Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data

The advances of single-cell transcriptomic technologies have led to increasing use of single-cell RNA sequencing (scRNA-seq) data in large-scale patient cohort studies. The resulting high-dimensional data can be summarized and incorporated into patient outcome prediction models in several ways; howe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cao, Yue, Ghazanfar, Shila, Yang, Pengyi, Yang, Jean
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Case Study
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199760/ https://www.ncbi.nlm.nih.gov/pubmed/37096588 http://dx.doi.org/10.1093/bib/bbad159

_version_	1785044999613710336
author	Cao, Yue Ghazanfar, Shila Yang, Pengyi Yang, Jean
author_facet	Cao, Yue Ghazanfar, Shila Yang, Pengyi Yang, Jean
author_sort	Cao, Yue
collection	PubMed
description	The advances of single-cell transcriptomic technologies have led to increasing use of single-cell RNA sequencing (scRNA-seq) data in large-scale patient cohort studies. The resulting high-dimensional data can be summarized and incorporated into patient outcome prediction models in several ways; however, there is a pressing need to understand the impact of analytical decisions on such model quality. In this study, we evaluate the impact of analytical choices on model choices, ensemble learning strategies and integrate approaches on patient outcome prediction using five scRNA-seq COVID-19 datasets. First, we examine the difference in performance between using single-view feature space versus multi-view feature space. Next, we survey multiple learning platforms from classical machine learning to modern deep learning methods. Lastly, we compare different integration approaches when combining datasets is necessary. Through benchmarking such analytical combinations, our study highlights the power of ensemble learning, consistency among different learning methods and robustness to dataset normalization when using multiple datasets as the model input.
format	Online Article Text
id	pubmed-10199760
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-101997602023-05-21 Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data Cao, Yue Ghazanfar, Shila Yang, Pengyi Yang, Jean Brief Bioinform Case Study The advances of single-cell transcriptomic technologies have led to increasing use of single-cell RNA sequencing (scRNA-seq) data in large-scale patient cohort studies. The resulting high-dimensional data can be summarized and incorporated into patient outcome prediction models in several ways; however, there is a pressing need to understand the impact of analytical decisions on such model quality. In this study, we evaluate the impact of analytical choices on model choices, ensemble learning strategies and integrate approaches on patient outcome prediction using five scRNA-seq COVID-19 datasets. First, we examine the difference in performance between using single-view feature space versus multi-view feature space. Next, we survey multiple learning platforms from classical machine learning to modern deep learning methods. Lastly, we compare different integration approaches when combining datasets is necessary. Through benchmarking such analytical combinations, our study highlights the power of ensemble learning, consistency among different learning methods and robustness to dataset normalization when using multiple datasets as the model input. Oxford University Press 2023-04-24 /pmc/articles/PMC10199760/ /pubmed/37096588 http://dx.doi.org/10.1093/bib/bbad159 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Case Study Cao, Yue Ghazanfar, Shila Yang, Pengyi Yang, Jean Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data
title	Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data
title_full	Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data
title_fullStr	Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data
title_full_unstemmed	Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data
title_short	Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data
title_sort	benchmarking of analytical combinations for covid-19 outcome prediction using single-cell rna sequencing data
topic	Case Study
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199760/ https://www.ncbi.nlm.nih.gov/pubmed/37096588 http://dx.doi.org/10.1093/bib/bbad159
work_keys_str_mv	AT caoyue benchmarkingofanalyticalcombinationsforcovid19outcomepredictionusingsinglecellrnasequencingdata AT ghazanfarshila benchmarkingofanalyticalcombinationsforcovid19outcomepredictionusingsinglecellrnasequencingdata AT yangpengyi benchmarkingofanalyticalcombinationsforcovid19outcomepredictionusingsinglecellrnasequencingdata AT yangjean benchmarkingofanalyticalcombinationsforcovid19outcomepredictionusingsinglecellrnasequencingdata

Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data

Ejemplares similares