Cargando…
Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers
Precision medicine for breast cancer relies on biomarkers to select therapies. However, the reliability of biomarkers drawn from gene expression arrays has been questioned and calls for reassessment, in particular for large datasets. We revisit widely used data-normalization procedures and evaluate...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7428878/ https://www.ncbi.nlm.nih.gov/pubmed/32832541 http://dx.doi.org/10.1155/2020/1363827 |
_version_ | 1783571168152780800 |
---|---|
author | Kenn, Michael Cacsire Castillo-Tong, Dan Singer, Christian F. Cibena, Michael Kölbl, Heinz Schreiner, Wolfgang |
author_facet | Kenn, Michael Cacsire Castillo-Tong, Dan Singer, Christian F. Cibena, Michael Kölbl, Heinz Schreiner, Wolfgang |
author_sort | Kenn, Michael |
collection | PubMed |
description | Precision medicine for breast cancer relies on biomarkers to select therapies. However, the reliability of biomarkers drawn from gene expression arrays has been questioned and calls for reassessment, in particular for large datasets. We revisit widely used data-normalization procedures and evaluate differences in outcome in order to pinpoint the most reliable reprocessing methods biomarkers can be based upon. We generated a database of 3753 breast cancer patients out of 38 studies by downloading and curating patient samples from NCBI-GEO. As gene-expression biomarkers, we select the assessment of receptor status and breast cancer subtype classification. Each normalization procedure is applied separately, and biomarkers are then evaluated for each patient. Differences between normalization pipelines are quantified as percentages of patients having outcomes different for each pipeline. Some normalization procedures lead to quite consistent biomarkers, differing only in 1-2% of patients. Other normalization procedures—some of them have been used in many clinical studies—end up with distrusting discrepancies (10% and more). A good deal of doubt regarding the reliability of microarrays may root in the haphazard application of inadequate preprocessing pipelines. Several modes of batch corrections are evaluated regarding a possible improvement of receptor prediction from gene expression versus the golden standard of immunohistochemistry. Finally, we nominate those normalization methods yielding consistent and trustable results. Adequate bioinformatics data preprocessing is key and crucial for any subsequent statistics to arrive at trustable results. We conclude with a suggestion for future bioinformatics development to further increase the reliability of cancer biomarkers. |
format | Online Article Text |
id | pubmed-7428878 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-74288782020-08-20 Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers Kenn, Michael Cacsire Castillo-Tong, Dan Singer, Christian F. Cibena, Michael Kölbl, Heinz Schreiner, Wolfgang Biomed Res Int Research Article Precision medicine for breast cancer relies on biomarkers to select therapies. However, the reliability of biomarkers drawn from gene expression arrays has been questioned and calls for reassessment, in particular for large datasets. We revisit widely used data-normalization procedures and evaluate differences in outcome in order to pinpoint the most reliable reprocessing methods biomarkers can be based upon. We generated a database of 3753 breast cancer patients out of 38 studies by downloading and curating patient samples from NCBI-GEO. As gene-expression biomarkers, we select the assessment of receptor status and breast cancer subtype classification. Each normalization procedure is applied separately, and biomarkers are then evaluated for each patient. Differences between normalization pipelines are quantified as percentages of patients having outcomes different for each pipeline. Some normalization procedures lead to quite consistent biomarkers, differing only in 1-2% of patients. Other normalization procedures—some of them have been used in many clinical studies—end up with distrusting discrepancies (10% and more). A good deal of doubt regarding the reliability of microarrays may root in the haphazard application of inadequate preprocessing pipelines. Several modes of batch corrections are evaluated regarding a possible improvement of receptor prediction from gene expression versus the golden standard of immunohistochemistry. Finally, we nominate those normalization methods yielding consistent and trustable results. Adequate bioinformatics data preprocessing is key and crucial for any subsequent statistics to arrive at trustable results. We conclude with a suggestion for future bioinformatics development to further increase the reliability of cancer biomarkers. Hindawi 2020-08-06 /pmc/articles/PMC7428878/ /pubmed/32832541 http://dx.doi.org/10.1155/2020/1363827 Text en Copyright © 2020 Michael Kenn et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Kenn, Michael Cacsire Castillo-Tong, Dan Singer, Christian F. Cibena, Michael Kölbl, Heinz Schreiner, Wolfgang Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers |
title | Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers |
title_full | Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers |
title_fullStr | Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers |
title_full_unstemmed | Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers |
title_short | Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers |
title_sort | microarray normalization revisited for reproducible breast cancer biomarkers |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7428878/ https://www.ncbi.nlm.nih.gov/pubmed/32832541 http://dx.doi.org/10.1155/2020/1363827 |
work_keys_str_mv | AT kennmichael microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers AT cacsirecastillotongdan microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers AT singerchristianf microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers AT cibenamichael microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers AT kolblheinz microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers AT schreinerwolfgang microarraynormalizationrevisitedforreproduciblebreastcancerbiomarkers |