Cargando…
How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis
BACKGROUND: The transcriptomes of several cyanobacterial strains have been shown to exhibit diurnal oscillation patterns, reflecting the diurnal phototrophic lifestyle of the organisms. The analysis of such genome-wide transcriptional oscillations is often facilitated by the use of clustering algori...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3679775/ https://www.ncbi.nlm.nih.gov/pubmed/23601192 http://dx.doi.org/10.1186/1471-2105-14-133 |
_version_ | 1782273013715042304 |
---|---|
author | Lehmann, Robert Machné, Rainer Georg, Jens Benary, Manuela Axmann, Ilka Steuer, Ralf |
author_facet | Lehmann, Robert Machné, Rainer Georg, Jens Benary, Manuela Axmann, Ilka Steuer, Ralf |
author_sort | Lehmann, Robert |
collection | PubMed |
description | BACKGROUND: The transcriptomes of several cyanobacterial strains have been shown to exhibit diurnal oscillation patterns, reflecting the diurnal phototrophic lifestyle of the organisms. The analysis of such genome-wide transcriptional oscillations is often facilitated by the use of clustering algorithms in conjunction with a number of pre-processing steps. Biological interpretation is usually focussed on the time and phase of expression of the resulting groups of genes. However, the use of microarray technology in such studies requires the normalization of pre-processing data, with unclear impact on the qualitative and quantitative features of the derived information on the number of oscillating transcripts and their respective phases. RESULTS: A microarray based evaluation of diurnal expression in the cyanobacterium Synechocystis sp. PCC 6803 is presented. As expected, the temporal expression patterns reveal strong oscillations in transcript abundance. We compare the Fourier transformation-based expression phase before and after the application of quantile normalization, median polishing, cyclical LOESS, and least oscillating set (LOS) normalization. Whereas LOS normalization mostly preserves the phases of the raw data, the remaining methods introduce systematic biases. In particular, quantile-normalization is found to introduce a phase-shift of 180°, effectively changing night-expressed genes into day-expressed ones. Comparison of a large number of clustering results of differently normalized data shows that the normalization method determines the result. Subsequent steps, such as the choice of data transformation, similarity measure, and clustering algorithm, only play minor roles. We find that the standardization and the DTF transformation are favorable for the clustering of time series in contrast to the 12 m transformation. We use the cluster-wise functional enrichment of a clustering derived by LOS normalization, clustering using flowClust, and DFT transformation to derive the diurnal biological program of Synechocystis sp.. CONCLUSION: Application of quantile normalization, median polishing, and also cyclic LOESS normalization of the presented cyanobacterial dataset lead to increased numbers of oscillating genes and the systematic shift of the expression phase. The LOS normalization minimizes the observed detrimental effects. As previous analyses employed a variety of different normalization methods, a direct comparison of results must be treated with caution. |
format | Online Article Text |
id | pubmed-3679775 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-36797752013-06-25 How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis Lehmann, Robert Machné, Rainer Georg, Jens Benary, Manuela Axmann, Ilka Steuer, Ralf BMC Bioinformatics Methodology Article BACKGROUND: The transcriptomes of several cyanobacterial strains have been shown to exhibit diurnal oscillation patterns, reflecting the diurnal phototrophic lifestyle of the organisms. The analysis of such genome-wide transcriptional oscillations is often facilitated by the use of clustering algorithms in conjunction with a number of pre-processing steps. Biological interpretation is usually focussed on the time and phase of expression of the resulting groups of genes. However, the use of microarray technology in such studies requires the normalization of pre-processing data, with unclear impact on the qualitative and quantitative features of the derived information on the number of oscillating transcripts and their respective phases. RESULTS: A microarray based evaluation of diurnal expression in the cyanobacterium Synechocystis sp. PCC 6803 is presented. As expected, the temporal expression patterns reveal strong oscillations in transcript abundance. We compare the Fourier transformation-based expression phase before and after the application of quantile normalization, median polishing, cyclical LOESS, and least oscillating set (LOS) normalization. Whereas LOS normalization mostly preserves the phases of the raw data, the remaining methods introduce systematic biases. In particular, quantile-normalization is found to introduce a phase-shift of 180°, effectively changing night-expressed genes into day-expressed ones. Comparison of a large number of clustering results of differently normalized data shows that the normalization method determines the result. Subsequent steps, such as the choice of data transformation, similarity measure, and clustering algorithm, only play minor roles. We find that the standardization and the DTF transformation are favorable for the clustering of time series in contrast to the 12 m transformation. We use the cluster-wise functional enrichment of a clustering derived by LOS normalization, clustering using flowClust, and DFT transformation to derive the diurnal biological program of Synechocystis sp.. CONCLUSION: Application of quantile normalization, median polishing, and also cyclic LOESS normalization of the presented cyanobacterial dataset lead to increased numbers of oscillating genes and the systematic shift of the expression phase. The LOS normalization minimizes the observed detrimental effects. As previous analyses employed a variety of different normalization methods, a direct comparison of results must be treated with caution. BioMed Central 2013-04-21 /pmc/articles/PMC3679775/ /pubmed/23601192 http://dx.doi.org/10.1186/1471-2105-14-133 Text en Copyright © 2013 Lehmann et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Lehmann, Robert Machné, Rainer Georg, Jens Benary, Manuela Axmann, Ilka Steuer, Ralf How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis |
title | How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis |
title_full | How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis |
title_fullStr | How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis |
title_full_unstemmed | How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis |
title_short | How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis |
title_sort | how cyanobacteria pose new problems to old methods: challenges in microarray time series analysis |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3679775/ https://www.ncbi.nlm.nih.gov/pubmed/23601192 http://dx.doi.org/10.1186/1471-2105-14-133 |
work_keys_str_mv | AT lehmannrobert howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis AT machnerainer howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis AT georgjens howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis AT benarymanuela howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis AT axmannilka howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis AT steuerralf howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis |