Cargando…
Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer
MicroRNAs (miRNAs) are small non-coding RNA molecules regulating gene expression with diagnostic potential in different diseases, including epithelial ovarian carcinomas (EOC). As only a few studies have been published on the identification of stable endogenous miRNA in EOC, there is no consensus wh...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10159121/ https://www.ncbi.nlm.nih.gov/pubmed/37141239 http://dx.doi.org/10.1371/journal.pone.0282576 |
_version_ | 1785037065883222016 |
---|---|
author | Lopacinska-Jørgensen, Joanna Petersen, Patrick H. D. Oliveira, Douglas V. N. P. Høgdall, Claus K. Høgdall, Estrid V. |
author_facet | Lopacinska-Jørgensen, Joanna Petersen, Patrick H. D. Oliveira, Douglas V. N. P. Høgdall, Claus K. Høgdall, Estrid V. |
author_sort | Lopacinska-Jørgensen, Joanna |
collection | PubMed |
description | MicroRNAs (miRNAs) are small non-coding RNA molecules regulating gene expression with diagnostic potential in different diseases, including epithelial ovarian carcinomas (EOC). As only a few studies have been published on the identification of stable endogenous miRNA in EOC, there is no consensus which miRNAs should be used aiming standardization. Currently, U6-snRNA is widely adopted as a normalization control in RT-qPCR when investigating miRNAs in EOC; despite its variable expression across cancers being reported. Therefore, our goal was to compare different missing data and normalization approaches to investigate their impact on the choice of stable endogenous controls and subsequent survival analysis while performing expression analysis of miRNAs by RT-qPCR in most frequent subtype of EOC: high-grade serous carcinoma (HGSC). 40 miRNAs were included based on their potential as stable endogenous controls or as biomarkers in EOC. Following RNA extraction from formalin-fixed paraffin embedded tissues from 63 HGSC patients, RT-qPCR was performed with a custom panel covering 40 target miRNAs and 8 controls. The raw data was analyzed by applying various strategies regarding choosing stable endogenous controls (geNorm, BestKeeper, NormFinder, the comparative ΔCt method and RefFinder), missing data (single/multiple imputation), and normalization (endogenous miRNA controls, U6-snRNA or global mean). Based on our study, we propose hsa-miR-23a-3p and hsa-miR-193a-5p, but not U6-snRNA as endogenous controls in HGSC patients. Our findings are validated in two external cohorts retrieved from the NCBI Gene Expression Omnibus database. We present that the outcome of stability analysis depends on the histological composition of the cohort, and it might suggest unique pattern of miRNA stability profiles for each subtype of EOC. Moreover, our data demonstrates the challenge of miRNA data analysis by presenting various outcomes from normalization and missing data imputation strategies on survival analysis. |
format | Online Article Text |
id | pubmed-10159121 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-101591212023-05-05 Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer Lopacinska-Jørgensen, Joanna Petersen, Patrick H. D. Oliveira, Douglas V. N. P. Høgdall, Claus K. Høgdall, Estrid V. PLoS One Research Article MicroRNAs (miRNAs) are small non-coding RNA molecules regulating gene expression with diagnostic potential in different diseases, including epithelial ovarian carcinomas (EOC). As only a few studies have been published on the identification of stable endogenous miRNA in EOC, there is no consensus which miRNAs should be used aiming standardization. Currently, U6-snRNA is widely adopted as a normalization control in RT-qPCR when investigating miRNAs in EOC; despite its variable expression across cancers being reported. Therefore, our goal was to compare different missing data and normalization approaches to investigate their impact on the choice of stable endogenous controls and subsequent survival analysis while performing expression analysis of miRNAs by RT-qPCR in most frequent subtype of EOC: high-grade serous carcinoma (HGSC). 40 miRNAs were included based on their potential as stable endogenous controls or as biomarkers in EOC. Following RNA extraction from formalin-fixed paraffin embedded tissues from 63 HGSC patients, RT-qPCR was performed with a custom panel covering 40 target miRNAs and 8 controls. The raw data was analyzed by applying various strategies regarding choosing stable endogenous controls (geNorm, BestKeeper, NormFinder, the comparative ΔCt method and RefFinder), missing data (single/multiple imputation), and normalization (endogenous miRNA controls, U6-snRNA or global mean). Based on our study, we propose hsa-miR-23a-3p and hsa-miR-193a-5p, but not U6-snRNA as endogenous controls in HGSC patients. Our findings are validated in two external cohorts retrieved from the NCBI Gene Expression Omnibus database. We present that the outcome of stability analysis depends on the histological composition of the cohort, and it might suggest unique pattern of miRNA stability profiles for each subtype of EOC. Moreover, our data demonstrates the challenge of miRNA data analysis by presenting various outcomes from normalization and missing data imputation strategies on survival analysis. Public Library of Science 2023-05-04 /pmc/articles/PMC10159121/ /pubmed/37141239 http://dx.doi.org/10.1371/journal.pone.0282576 Text en © 2023 Lopacinska-Jørgensen et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lopacinska-Jørgensen, Joanna Petersen, Patrick H. D. Oliveira, Douglas V. N. P. Høgdall, Claus K. Høgdall, Estrid V. Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer |
title | Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer |
title_full | Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer |
title_fullStr | Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer |
title_full_unstemmed | Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer |
title_short | Strategies for data normalization and missing data imputation and consequences for potential diagnostic microRNA biomarkers in epithelial ovarian cancer |
title_sort | strategies for data normalization and missing data imputation and consequences for potential diagnostic microrna biomarkers in epithelial ovarian cancer |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10159121/ https://www.ncbi.nlm.nih.gov/pubmed/37141239 http://dx.doi.org/10.1371/journal.pone.0282576 |
work_keys_str_mv | AT lopacinskajørgensenjoanna strategiesfordatanormalizationandmissingdataimputationandconsequencesforpotentialdiagnosticmicrornabiomarkersinepithelialovariancancer AT petersenpatrickhd strategiesfordatanormalizationandmissingdataimputationandconsequencesforpotentialdiagnosticmicrornabiomarkersinepithelialovariancancer AT oliveiradouglasvnp strategiesfordatanormalizationandmissingdataimputationandconsequencesforpotentialdiagnosticmicrornabiomarkersinepithelialovariancancer AT høgdallclausk strategiesfordatanormalizationandmissingdataimputationandconsequencesforpotentialdiagnosticmicrornabiomarkersinepithelialovariancancer AT høgdallestridv strategiesfordatanormalizationandmissingdataimputationandconsequencesforpotentialdiagnosticmicrornabiomarkersinepithelialovariancancer |