Cargando…

Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers

Precision medicine for breast cancer relies on biomarkers to select therapies. However, the reliability of biomarkers drawn from gene expression arrays has been questioned and calls for reassessment, in particular for large datasets. We revisit widely used data-normalization procedures and evaluate...

Descripción completa

Detalles Bibliográficos
Autores principales: Kenn, Michael, Cacsire Castillo-Tong, Dan, Singer, Christian F., Cibena, Michael, Kölbl, Heinz, Schreiner, Wolfgang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7428878/
https://www.ncbi.nlm.nih.gov/pubmed/32832541
http://dx.doi.org/10.1155/2020/1363827
_version_ 1783571168152780800
author Kenn, Michael
Cacsire Castillo-Tong, Dan
Singer, Christian F.
Cibena, Michael
Kölbl, Heinz
Schreiner, Wolfgang
author_facet Kenn, Michael
Cacsire Castillo-Tong, Dan
Singer, Christian F.
Cibena, Michael
Kölbl, Heinz
Schreiner, Wolfgang
author_sort Kenn, Michael
collection PubMed
description Precision medicine for breast cancer relies on biomarkers to select therapies. However, the reliability of biomarkers drawn from gene expression arrays has been questioned and calls for reassessment, in particular for large datasets. We revisit widely used data-normalization procedures and evaluate differences in outcome in order to pinpoint the most reliable reprocessing methods biomarkers can be based upon. We generated a database of 3753 breast cancer patients out of 38 studies by downloading and curating patient samples from NCBI-GEO. As gene-expression biomarkers, we select the assessment of receptor status and breast cancer subtype classification. Each normalization procedure is applied separately, and biomarkers are then evaluated for each patient. Differences between normalization pipelines are quantified as percentages of patients having outcomes different for each pipeline. Some normalization procedures lead to quite consistent biomarkers, differing only in 1-2% of patients. Other normalization procedures—some of them have been used in many clinical studies—end up with distrusting discrepancies (10% and more). A good deal of doubt regarding the reliability of microarrays may root in the haphazard application of inadequate preprocessing pipelines. Several modes of batch corrections are evaluated regarding a possible improvement of receptor prediction from gene expression versus the golden standard of immunohistochemistry. Finally, we nominate those normalization methods yielding consistent and trustable results. Adequate bioinformatics data preprocessing is key and crucial for any subsequent statistics to arrive at trustable results. We conclude with a suggestion for future bioinformatics development to further increase the reliability of cancer biomarkers.
format Online
Article
Text
id pubmed-7428878
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-74288782020-08-20 Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers Kenn, Michael Cacsire Castillo-Tong, Dan Singer, Christian F. Cibena, Michael Kölbl, Heinz Schreiner, Wolfgang Biomed Res Int Research Article Precision medicine for breast cancer relies on biomarkers to select therapies. However, the reliability of biomarkers drawn from gene expression arrays has been questioned and calls for reassessment, in particular for large datasets. We revisit widely used data-normalization procedures and evaluate differences in outcome in order to pinpoint the most reliable reprocessing methods biomarkers can be based upon. We generated a database of 3753 breast cancer patients out of 38 studies by downloading and curating patient samples from NCBI-GEO. As gene-expression biomarkers, we select the assessment of receptor status and breast cancer subtype classification. Each normalization procedure is applied separately, and biomarkers are then evaluated for each patient. Differences between normalization pipelines are quantified as percentages of patients having outcomes different for each pipeline. Some normalization procedures lead to quite consistent biomarkers, differing only in 1-2% of patients. Other normalization procedures—some of them have been used in many clinical studies—end up with distrusting discrepancies (10% and more). A good deal of doubt regarding the reliability of microarrays may root in the haphazard application of inadequate preprocessing pipelines. Several modes of batch corrections are evaluated regarding a possible improvement of receptor prediction from gene expression versus the golden standard of immunohistochemistry. Finally, we nominate those normalization methods yielding consistent and trustable results. Adequate bioinformatics data preprocessing is key and crucial for any subsequent statistics to arrive at trustable results. We conclude with a suggestion for future bioinformatics development to further increase the reliability of cancer biomarkers. Hindawi 2020-08-06 /pmc/articles/PMC7428878/ /pubmed/32832541 http://dx.doi.org/10.1155/2020/1363827 Text en Copyright © 2020 Michael Kenn et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kenn, Michael
Cacsire Castillo-Tong, Dan
Singer, Christian F.
Cibena, Michael
Kölbl, Heinz
Schreiner, Wolfgang
Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers
title Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers
title_full Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers
title_fullStr Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers
title_full_unstemmed Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers
title_short Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers
title_sort microarray normalization revisited for reproducible breast cancer biomarkers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7428878/
https://www.ncbi.nlm.nih.gov/pubmed/32832541
http://dx.doi.org/10.1155/2020/1363827
work_keys_str_mv AT kennmichael microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers
AT cacsirecastillotongdan microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers
AT singerchristianf microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers
AT cibenamichael microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers
AT kolblheinz microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers
AT schreinerwolfgang microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers