Cargando…

Performance evaluation of transcriptomics data normalization for survival risk prediction

One pivotal feature of transcriptomics data is the unwanted variations caused by disparate experimental handling, known as handling effects. Various data normalization methods were developed to alleviate the adverse impact of handling effects in the setting of differential expression analysis. Howev...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ni, Ai, Qin, Li-Xuan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2021
Materias:	Review
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8575026/ https://www.ncbi.nlm.nih.gov/pubmed/34245143 http://dx.doi.org/10.1093/bib/bbab257

_version_	1784595603797311488
author	Ni, Ai Qin, Li-Xuan
author_facet	Ni, Ai Qin, Li-Xuan
author_sort	Ni, Ai
collection	PubMed
description	One pivotal feature of transcriptomics data is the unwanted variations caused by disparate experimental handling, known as handling effects. Various data normalization methods were developed to alleviate the adverse impact of handling effects in the setting of differential expression analysis. However, little research has been done to evaluate their performance in the setting of survival outcome prediction, an important analysis goal for transcriptomics data in biomedical research. Leveraging a unique pair of datasets for the same set of tumor samples—one with handling effects and the other without, we developed a benchmarking tool for conducting such an evaluation in microRNA microarrays. We applied this tool to evaluate the performance of three popular normalization methods—quantile normalization, median normalization and variance stabilizing normalization—in survival prediction using various approaches for model building and designs for sample assignment. We showed that handling effects can have a strong impact on survival prediction and that quantile normalization, a most popular method in current practice, tends to underperform median normalization and variance stabilizing normalization. We demonstrated with a small example the reason for quantile normalization’s poor performance in this setting. Our finding highlights the importance of putting normalization evaluation in the context of the downstream analysis setting and the potential of improving the development of survival predictors by applying median normalization. We make available our benchmarking tool for performing such evaluation on additional normalization methods in connection with prediction modeling approaches.
format	Online Article Text
id	pubmed-8575026
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-85750262021-11-09 Performance evaluation of transcriptomics data normalization for survival risk prediction Ni, Ai Qin, Li-Xuan Brief Bioinform Review One pivotal feature of transcriptomics data is the unwanted variations caused by disparate experimental handling, known as handling effects. Various data normalization methods were developed to alleviate the adverse impact of handling effects in the setting of differential expression analysis. However, little research has been done to evaluate their performance in the setting of survival outcome prediction, an important analysis goal for transcriptomics data in biomedical research. Leveraging a unique pair of datasets for the same set of tumor samples—one with handling effects and the other without, we developed a benchmarking tool for conducting such an evaluation in microRNA microarrays. We applied this tool to evaluate the performance of three popular normalization methods—quantile normalization, median normalization and variance stabilizing normalization—in survival prediction using various approaches for model building and designs for sample assignment. We showed that handling effects can have a strong impact on survival prediction and that quantile normalization, a most popular method in current practice, tends to underperform median normalization and variance stabilizing normalization. We demonstrated with a small example the reason for quantile normalization’s poor performance in this setting. Our finding highlights the importance of putting normalization evaluation in the context of the downstream analysis setting and the potential of improving the development of survival predictors by applying median normalization. We make available our benchmarking tool for performing such evaluation on additional normalization methods in connection with prediction modeling approaches. Oxford University Press 2021-07-09 /pmc/articles/PMC8575026/ /pubmed/34245143 http://dx.doi.org/10.1093/bib/bbab257 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Review Ni, Ai Qin, Li-Xuan Performance evaluation of transcriptomics data normalization for survival risk prediction
title	Performance evaluation of transcriptomics data normalization for survival risk prediction
title_full	Performance evaluation of transcriptomics data normalization for survival risk prediction
title_fullStr	Performance evaluation of transcriptomics data normalization for survival risk prediction
title_full_unstemmed	Performance evaluation of transcriptomics data normalization for survival risk prediction
title_short	Performance evaluation of transcriptomics data normalization for survival risk prediction
title_sort	performance evaluation of transcriptomics data normalization for survival risk prediction
topic	Review
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8575026/ https://www.ncbi.nlm.nih.gov/pubmed/34245143 http://dx.doi.org/10.1093/bib/bbab257
work_keys_str_mv	AT niai performanceevaluationoftranscriptomicsdatanormalizationforsurvivalriskprediction AT qinlixuan performanceevaluationoftranscriptomicsdatanormalizationforsurvivalriskprediction

Performance evaluation of transcriptomics data normalization for survival risk prediction

Ejemplares similares