Cargando…

How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis

BACKGROUND: The transcriptomes of several cyanobacterial strains have been shown to exhibit diurnal oscillation patterns, reflecting the diurnal phototrophic lifestyle of the organisms. The analysis of such genome-wide transcriptional oscillations is often facilitated by the use of clustering algori...

Descripción completa

Detalles Bibliográficos
Autores principales: Lehmann, Robert, Machné, Rainer, Georg, Jens, Benary, Manuela, Axmann, Ilka, Steuer, Ralf
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3679775/
https://www.ncbi.nlm.nih.gov/pubmed/23601192
http://dx.doi.org/10.1186/1471-2105-14-133
_version_ 1782273013715042304
author Lehmann, Robert
Machné, Rainer
Georg, Jens
Benary, Manuela
Axmann, Ilka
Steuer, Ralf
author_facet Lehmann, Robert
Machné, Rainer
Georg, Jens
Benary, Manuela
Axmann, Ilka
Steuer, Ralf
author_sort Lehmann, Robert
collection PubMed
description BACKGROUND: The transcriptomes of several cyanobacterial strains have been shown to exhibit diurnal oscillation patterns, reflecting the diurnal phototrophic lifestyle of the organisms. The analysis of such genome-wide transcriptional oscillations is often facilitated by the use of clustering algorithms in conjunction with a number of pre-processing steps. Biological interpretation is usually focussed on the time and phase of expression of the resulting groups of genes. However, the use of microarray technology in such studies requires the normalization of pre-processing data, with unclear impact on the qualitative and quantitative features of the derived information on the number of oscillating transcripts and their respective phases. RESULTS: A microarray based evaluation of diurnal expression in the cyanobacterium Synechocystis sp. PCC 6803 is presented. As expected, the temporal expression patterns reveal strong oscillations in transcript abundance. We compare the Fourier transformation-based expression phase before and after the application of quantile normalization, median polishing, cyclical LOESS, and least oscillating set (LOS) normalization. Whereas LOS normalization mostly preserves the phases of the raw data, the remaining methods introduce systematic biases. In particular, quantile-normalization is found to introduce a phase-shift of 180°, effectively changing night-expressed genes into day-expressed ones. Comparison of a large number of clustering results of differently normalized data shows that the normalization method determines the result. Subsequent steps, such as the choice of data transformation, similarity measure, and clustering algorithm, only play minor roles. We find that the standardization and the DTF transformation are favorable for the clustering of time series in contrast to the 12 m transformation. We use the cluster-wise functional enrichment of a clustering derived by LOS normalization, clustering using flowClust, and DFT transformation to derive the diurnal biological program of Synechocystis sp.. CONCLUSION: Application of quantile normalization, median polishing, and also cyclic LOESS normalization of the presented cyanobacterial dataset lead to increased numbers of oscillating genes and the systematic shift of the expression phase. The LOS normalization minimizes the observed detrimental effects. As previous analyses employed a variety of different normalization methods, a direct comparison of results must be treated with caution.
format Online
Article
Text
id pubmed-3679775
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36797752013-06-25 How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis Lehmann, Robert Machné, Rainer Georg, Jens Benary, Manuela Axmann, Ilka Steuer, Ralf BMC Bioinformatics Methodology Article BACKGROUND: The transcriptomes of several cyanobacterial strains have been shown to exhibit diurnal oscillation patterns, reflecting the diurnal phototrophic lifestyle of the organisms. The analysis of such genome-wide transcriptional oscillations is often facilitated by the use of clustering algorithms in conjunction with a number of pre-processing steps. Biological interpretation is usually focussed on the time and phase of expression of the resulting groups of genes. However, the use of microarray technology in such studies requires the normalization of pre-processing data, with unclear impact on the qualitative and quantitative features of the derived information on the number of oscillating transcripts and their respective phases. RESULTS: A microarray based evaluation of diurnal expression in the cyanobacterium Synechocystis sp. PCC 6803 is presented. As expected, the temporal expression patterns reveal strong oscillations in transcript abundance. We compare the Fourier transformation-based expression phase before and after the application of quantile normalization, median polishing, cyclical LOESS, and least oscillating set (LOS) normalization. Whereas LOS normalization mostly preserves the phases of the raw data, the remaining methods introduce systematic biases. In particular, quantile-normalization is found to introduce a phase-shift of 180°, effectively changing night-expressed genes into day-expressed ones. Comparison of a large number of clustering results of differently normalized data shows that the normalization method determines the result. Subsequent steps, such as the choice of data transformation, similarity measure, and clustering algorithm, only play minor roles. We find that the standardization and the DTF transformation are favorable for the clustering of time series in contrast to the 12 m transformation. We use the cluster-wise functional enrichment of a clustering derived by LOS normalization, clustering using flowClust, and DFT transformation to derive the diurnal biological program of Synechocystis sp.. CONCLUSION: Application of quantile normalization, median polishing, and also cyclic LOESS normalization of the presented cyanobacterial dataset lead to increased numbers of oscillating genes and the systematic shift of the expression phase. The LOS normalization minimizes the observed detrimental effects. As previous analyses employed a variety of different normalization methods, a direct comparison of results must be treated with caution. BioMed Central 2013-04-21 /pmc/articles/PMC3679775/ /pubmed/23601192 http://dx.doi.org/10.1186/1471-2105-14-133 Text en Copyright © 2013 Lehmann et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Lehmann, Robert
Machné, Rainer
Georg, Jens
Benary, Manuela
Axmann, Ilka
Steuer, Ralf
How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis
title How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis
title_full How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis
title_fullStr How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis
title_full_unstemmed How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis
title_short How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis
title_sort how cyanobacteria pose new problems to old methods: challenges in microarray time series analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3679775/
https://www.ncbi.nlm.nih.gov/pubmed/23601192
http://dx.doi.org/10.1186/1471-2105-14-133
work_keys_str_mv AT lehmannrobert howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis
AT machnerainer howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis
AT georgjens howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis
AT benarymanuela howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis
AT axmannilka howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis
AT steuerralf howcyanobacteriaposenewproblemstooldmethodschallengesinmicroarraytimeseriesanalysis