Cargando…

Robust regression for periodicity detection in non-uniformly sampled time-course gene expression data

BACKGROUND: In practice many biological time series measurements, including gene microarrays, are conducted at time points that seem to be interesting in the biologist's opinion and not necessarily at fixed time intervals. In many circumstances we are interested in finding targets that are expr...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahdesmäki, Miika, Lähdesmäki, Harri, Gracey, Andrew, Shmulevich, llya, Yli-Harja, Olli
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1934414/
https://www.ncbi.nlm.nih.gov/pubmed/17605777
http://dx.doi.org/10.1186/1471-2105-8-233
Descripción
Sumario:BACKGROUND: In practice many biological time series measurements, including gene microarrays, are conducted at time points that seem to be interesting in the biologist's opinion and not necessarily at fixed time intervals. In many circumstances we are interested in finding targets that are expressed periodically. To tackle the problems of uneven sampling and unknown type of noise in periodicity detection, we propose to use robust regression. METHODS: The aim of this paper is to develop a general framework for robust periodicity detection and review and rank different approaches by means of simulations. We also show the results for some real measurement data. RESULTS: The simulation results clearly show that when the sampling of time series gets more and more uneven, the methods that assume even sampling become unusable. We find that M-estimation provides a good compromise between robustness and computational efficiency. CONCLUSION: Since uneven sampling occurs often in biological measurements, the robust methods developed in this paper are expected to have many uses. The regression based formulation of the periodicity detection problem easily adapts to non-uniform sampling. Using robust regression helps to reject inconsistently behaving data points. AVAILABILITY: The implementations are currently available for Matlab and will be made available for the users of R as well. More information can be found in the web-supplement [1].