Cargando…

A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data

Time course ‘omics’ experiments are becoming increasingly important to study system-wide dynamic regulation. Despite their high information content, analysis remains challenging. ‘Omics’ technologies capture quantitative measurements on tens of thousands of molecules. Therefore, in a time course ‘om...

Descripción completa

Detalles Bibliográficos
Autores principales: Straube, Jasmin, Gorse, Alain-Dominique, Huang, Bevan Emma, Lê Cao, Kim-Anh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551847/
https://www.ncbi.nlm.nih.gov/pubmed/26313144
http://dx.doi.org/10.1371/journal.pone.0134540
_version_ 1782387631246540800
author Straube, Jasmin
Gorse, Alain-Dominique
Huang, Bevan Emma
Lê Cao, Kim-Anh
author_facet Straube, Jasmin
Gorse, Alain-Dominique
Huang, Bevan Emma
Lê Cao, Kim-Anh
author_sort Straube, Jasmin
collection PubMed
description Time course ‘omics’ experiments are becoming increasingly important to study system-wide dynamic regulation. Despite their high information content, analysis remains challenging. ‘Omics’ technologies capture quantitative measurements on tens of thousands of molecules. Therefore, in a time course ‘omics’ experiment molecules are measured for multiple subjects over multiple time points. This results in a large, high-dimensional dataset, which requires computationally efficient approaches for statistical analysis. Moreover, methods need to be able to handle missing values and various levels of noise. We present a novel, robust and powerful framework to analyze time course ‘omics’ data that consists of three stages: quality assessment and filtering, profile modelling, and analysis. The first step consists of removing molecules for which expression or abundance is highly variable over time. The second step models each molecular expression profile in a linear mixed model framework which takes into account subject-specific variability. The best model is selected through a serial model selection approach and results in dimension reduction of the time course data. The final step includes two types of analysis of the modelled trajectories, namely, clustering analysis to identify groups of correlated profiles over time, and differential expression analysis to identify profiles which differ over time and/or between treatment groups. Through simulation studies we demonstrate the high sensitivity and specificity of our approach for differential expression analysis. We then illustrate how our framework can bring novel insights on two time course ‘omics’ studies in breast cancer and kidney rejection. The methods are publicly available, implemented in the R CRAN package lmms.
format Online
Article
Text
id pubmed-4551847
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45518472015-09-01 A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data Straube, Jasmin Gorse, Alain-Dominique Huang, Bevan Emma Lê Cao, Kim-Anh PLoS One Research Article Time course ‘omics’ experiments are becoming increasingly important to study system-wide dynamic regulation. Despite their high information content, analysis remains challenging. ‘Omics’ technologies capture quantitative measurements on tens of thousands of molecules. Therefore, in a time course ‘omics’ experiment molecules are measured for multiple subjects over multiple time points. This results in a large, high-dimensional dataset, which requires computationally efficient approaches for statistical analysis. Moreover, methods need to be able to handle missing values and various levels of noise. We present a novel, robust and powerful framework to analyze time course ‘omics’ data that consists of three stages: quality assessment and filtering, profile modelling, and analysis. The first step consists of removing molecules for which expression or abundance is highly variable over time. The second step models each molecular expression profile in a linear mixed model framework which takes into account subject-specific variability. The best model is selected through a serial model selection approach and results in dimension reduction of the time course data. The final step includes two types of analysis of the modelled trajectories, namely, clustering analysis to identify groups of correlated profiles over time, and differential expression analysis to identify profiles which differ over time and/or between treatment groups. Through simulation studies we demonstrate the high sensitivity and specificity of our approach for differential expression analysis. We then illustrate how our framework can bring novel insights on two time course ‘omics’ studies in breast cancer and kidney rejection. The methods are publicly available, implemented in the R CRAN package lmms. Public Library of Science 2015-08-27 /pmc/articles/PMC4551847/ /pubmed/26313144 http://dx.doi.org/10.1371/journal.pone.0134540 Text en © 2015 Straube et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Straube, Jasmin
Gorse, Alain-Dominique
Huang, Bevan Emma
Lê Cao, Kim-Anh
A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data
title A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data
title_full A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data
title_fullStr A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data
title_full_unstemmed A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data
title_short A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data
title_sort linear mixed model spline framework for analysing time course ‘omics’ data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551847/
https://www.ncbi.nlm.nih.gov/pubmed/26313144
http://dx.doi.org/10.1371/journal.pone.0134540
work_keys_str_mv AT straubejasmin alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT gorsealaindominique alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT huangbevanemma alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT lecaokimanh alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT straubejasmin linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT gorsealaindominique linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT huangbevanemma linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata
AT lecaokimanh linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata