Cargando…

Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples

BACKGROUND: Formalin fixed, paraffin embedded tissues are most commonly used for routine pathology analysis and for long term tissue preservation in the clinical setting. Many institutions have large archives of Formalin fixed, paraffin embedded tissues that provide a unique opportunity for understa...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahoney, Douglas W, Therneau, Terry M, Anderson, S Keith, Jen, Jin, Kocher, Jean-Pierre A, Reinholz, Monica M, Perez, Edith A, Eckel-Passow, Jeanette E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3626608/
https://www.ncbi.nlm.nih.gov/pubmed/23360712
http://dx.doi.org/10.1186/1756-0500-6-33
_version_ 1782266213169102848
author Mahoney, Douglas W
Therneau, Terry M
Anderson, S Keith
Jen, Jin
Kocher, Jean-Pierre A
Reinholz, Monica M
Perez, Edith A
Eckel-Passow, Jeanette E
author_facet Mahoney, Douglas W
Therneau, Terry M
Anderson, S Keith
Jen, Jin
Kocher, Jean-Pierre A
Reinholz, Monica M
Perez, Edith A
Eckel-Passow, Jeanette E
author_sort Mahoney, Douglas W
collection PubMed
description BACKGROUND: Formalin fixed, paraffin embedded tissues are most commonly used for routine pathology analysis and for long term tissue preservation in the clinical setting. Many institutions have large archives of Formalin fixed, paraffin embedded tissues that provide a unique opportunity for understanding genomic signatures of disease. However, genome-wide expression profiling of Formalin fixed, paraffin embedded samples have been challenging due to RNA degradation. Because of the significant heterogeneity in tissue quality, normalization and analysis of these data presents particular challenges. The distribution of intensity values from archival tissues are inherently noisy and skewed due to differential sample degradation raising two primary concerns; whether a highly skewed array will unduly influence initial normalization of the data and whether outlier arrays can be reliably identified. FINDINGS: Two simple extensions of common regression diagnostic measures are introduced that measure the stress an array undergoes during normalization and how much a given array deviates from the remaining arrays post-normalization. These metrics are applied to a study involving 1618 formalin-fixed, paraffin-embedded HER2-positive breast cancer samples from the N9831 adjuvant trial processed with Illumina’s cDNA-mediated Annealing Selection extension and Ligation assay. CONCLUSION: Proper assessment of array quality within a research study is crucial for controlling unwanted variability in the data. The metrics proposed in this paper have direct biological interpretations and can be used to identify arrays that should either be removed from analysis all together or down-weighted to reduce their influence in downstream analyses.
format Online
Article
Text
id pubmed-3626608
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36266082013-04-23 Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples Mahoney, Douglas W Therneau, Terry M Anderson, S Keith Jen, Jin Kocher, Jean-Pierre A Reinholz, Monica M Perez, Edith A Eckel-Passow, Jeanette E BMC Res Notes Technical Note BACKGROUND: Formalin fixed, paraffin embedded tissues are most commonly used for routine pathology analysis and for long term tissue preservation in the clinical setting. Many institutions have large archives of Formalin fixed, paraffin embedded tissues that provide a unique opportunity for understanding genomic signatures of disease. However, genome-wide expression profiling of Formalin fixed, paraffin embedded samples have been challenging due to RNA degradation. Because of the significant heterogeneity in tissue quality, normalization and analysis of these data presents particular challenges. The distribution of intensity values from archival tissues are inherently noisy and skewed due to differential sample degradation raising two primary concerns; whether a highly skewed array will unduly influence initial normalization of the data and whether outlier arrays can be reliably identified. FINDINGS: Two simple extensions of common regression diagnostic measures are introduced that measure the stress an array undergoes during normalization and how much a given array deviates from the remaining arrays post-normalization. These metrics are applied to a study involving 1618 formalin-fixed, paraffin-embedded HER2-positive breast cancer samples from the N9831 adjuvant trial processed with Illumina’s cDNA-mediated Annealing Selection extension and Ligation assay. CONCLUSION: Proper assessment of array quality within a research study is crucial for controlling unwanted variability in the data. The metrics proposed in this paper have direct biological interpretations and can be used to identify arrays that should either be removed from analysis all together or down-weighted to reduce their influence in downstream analyses. BioMed Central 2013-01-30 /pmc/articles/PMC3626608/ /pubmed/23360712 http://dx.doi.org/10.1186/1756-0500-6-33 Text en Copyright © 2013 Mahoney et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Mahoney, Douglas W
Therneau, Terry M
Anderson, S Keith
Jen, Jin
Kocher, Jean-Pierre A
Reinholz, Monica M
Perez, Edith A
Eckel-Passow, Jeanette E
Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
title Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
title_full Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
title_fullStr Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
title_full_unstemmed Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
title_short Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
title_sort quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3626608/
https://www.ncbi.nlm.nih.gov/pubmed/23360712
http://dx.doi.org/10.1186/1756-0500-6-33
work_keys_str_mv AT mahoneydouglasw qualityassessmentmetricsforwholegenomegeneexpressionprofilingofparaffinembeddedsamples
AT therneauterrym qualityassessmentmetricsforwholegenomegeneexpressionprofilingofparaffinembeddedsamples
AT andersonskeith qualityassessmentmetricsforwholegenomegeneexpressionprofilingofparaffinembeddedsamples
AT jenjin qualityassessmentmetricsforwholegenomegeneexpressionprofilingofparaffinembeddedsamples
AT kocherjeanpierrea qualityassessmentmetricsforwholegenomegeneexpressionprofilingofparaffinembeddedsamples
AT reinholzmonicam qualityassessmentmetricsforwholegenomegeneexpressionprofilingofparaffinembeddedsamples
AT perezeditha qualityassessmentmetricsforwholegenomegeneexpressionprofilingofparaffinembeddedsamples
AT eckelpassowjeanettee qualityassessmentmetricsforwholegenomegeneexpressionprofilingofparaffinembeddedsamples