Cargando…
Preprocessing Steps for Agilent MicroRNA Arrays: Does the Order Matter?
MOTIVATION/BACKGROUND: Previous publications on microarray preprocessing mostly focused on method development or comparison for an individual preprocessing step. Very few, if any, focused on recommending an effective ordering of the preprocessing steps, in particular, normalization in relationship t...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4560483/ https://www.ncbi.nlm.nih.gov/pubmed/26380547 http://dx.doi.org/10.4137/CIN.S21630 |
Sumario: | MOTIVATION/BACKGROUND: Previous publications on microarray preprocessing mostly focused on method development or comparison for an individual preprocessing step. Very few, if any, focused on recommending an effective ordering of the preprocessing steps, in particular, normalization in relationship to log transformation and probe set summarization. In this study, we aim to study how the relative ordering of the preprocessing steps influences differential expression analysis for Agilent microRNA array data. METHODS: A set of 192 untreated primary gynecologic tumor samples (96 endometrial tumors and 96 ovarian tumors) were collected at Memorial Sloan Kettering Cancer Center during the period of 2000–2012. From this same sample set, two datasets were generated: one dataset had no confounding array effects by experimental design and served as the benchmark, and another dataset exhibited array effects and served as the test data. We preprocessed our test dataset using different orderings between the following three steps: quantile normalization, log transformation, and median summarization. Differential expression analysis was performed on each preprocessed test dataset, and the results were compared against the results from the benchmark dataset. True positive rate, false positive rate, and false discovery rate were used to assess the effectiveness of the orderings. RESULTS: The ordering of log transformation, quantile normalization (on probe-level data), and median summarization slightly outperforms the other orderings. CONCLUSION: Our results ease the anxiety over the uncertain effect that the orderings could have on the analysis of Agilent microRNA array data. |
---|