Cargando…
A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data
Time course ‘omics’ experiments are becoming increasingly important to study system-wide dynamic regulation. Despite their high information content, analysis remains challenging. ‘Omics’ technologies capture quantitative measurements on tens of thousands of molecules. Therefore, in a time course ‘om...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551847/ https://www.ncbi.nlm.nih.gov/pubmed/26313144 http://dx.doi.org/10.1371/journal.pone.0134540 |
_version_ | 1782387631246540800 |
---|---|
author | Straube, Jasmin Gorse, Alain-Dominique Huang, Bevan Emma Lê Cao, Kim-Anh |
author_facet | Straube, Jasmin Gorse, Alain-Dominique Huang, Bevan Emma Lê Cao, Kim-Anh |
author_sort | Straube, Jasmin |
collection | PubMed |
description | Time course ‘omics’ experiments are becoming increasingly important to study system-wide dynamic regulation. Despite their high information content, analysis remains challenging. ‘Omics’ technologies capture quantitative measurements on tens of thousands of molecules. Therefore, in a time course ‘omics’ experiment molecules are measured for multiple subjects over multiple time points. This results in a large, high-dimensional dataset, which requires computationally efficient approaches for statistical analysis. Moreover, methods need to be able to handle missing values and various levels of noise. We present a novel, robust and powerful framework to analyze time course ‘omics’ data that consists of three stages: quality assessment and filtering, profile modelling, and analysis. The first step consists of removing molecules for which expression or abundance is highly variable over time. The second step models each molecular expression profile in a linear mixed model framework which takes into account subject-specific variability. The best model is selected through a serial model selection approach and results in dimension reduction of the time course data. The final step includes two types of analysis of the modelled trajectories, namely, clustering analysis to identify groups of correlated profiles over time, and differential expression analysis to identify profiles which differ over time and/or between treatment groups. Through simulation studies we demonstrate the high sensitivity and specificity of our approach for differential expression analysis. We then illustrate how our framework can bring novel insights on two time course ‘omics’ studies in breast cancer and kidney rejection. The methods are publicly available, implemented in the R CRAN package lmms. |
format | Online Article Text |
id | pubmed-4551847 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-45518472015-09-01 A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data Straube, Jasmin Gorse, Alain-Dominique Huang, Bevan Emma Lê Cao, Kim-Anh PLoS One Research Article Time course ‘omics’ experiments are becoming increasingly important to study system-wide dynamic regulation. Despite their high information content, analysis remains challenging. ‘Omics’ technologies capture quantitative measurements on tens of thousands of molecules. Therefore, in a time course ‘omics’ experiment molecules are measured for multiple subjects over multiple time points. This results in a large, high-dimensional dataset, which requires computationally efficient approaches for statistical analysis. Moreover, methods need to be able to handle missing values and various levels of noise. We present a novel, robust and powerful framework to analyze time course ‘omics’ data that consists of three stages: quality assessment and filtering, profile modelling, and analysis. The first step consists of removing molecules for which expression or abundance is highly variable over time. The second step models each molecular expression profile in a linear mixed model framework which takes into account subject-specific variability. The best model is selected through a serial model selection approach and results in dimension reduction of the time course data. The final step includes two types of analysis of the modelled trajectories, namely, clustering analysis to identify groups of correlated profiles over time, and differential expression analysis to identify profiles which differ over time and/or between treatment groups. Through simulation studies we demonstrate the high sensitivity and specificity of our approach for differential expression analysis. We then illustrate how our framework can bring novel insights on two time course ‘omics’ studies in breast cancer and kidney rejection. The methods are publicly available, implemented in the R CRAN package lmms. Public Library of Science 2015-08-27 /pmc/articles/PMC4551847/ /pubmed/26313144 http://dx.doi.org/10.1371/journal.pone.0134540 Text en © 2015 Straube et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Straube, Jasmin Gorse, Alain-Dominique Huang, Bevan Emma Lê Cao, Kim-Anh A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data |
title | A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data |
title_full | A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data |
title_fullStr | A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data |
title_full_unstemmed | A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data |
title_short | A Linear Mixed Model Spline Framework for Analysing Time Course ‘Omics’ Data |
title_sort | linear mixed model spline framework for analysing time course ‘omics’ data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551847/ https://www.ncbi.nlm.nih.gov/pubmed/26313144 http://dx.doi.org/10.1371/journal.pone.0134540 |
work_keys_str_mv | AT straubejasmin alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT gorsealaindominique alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT huangbevanemma alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT lecaokimanh alinearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT straubejasmin linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT gorsealaindominique linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT huangbevanemma linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata AT lecaokimanh linearmixedmodelsplineframeworkforanalysingtimecourseomicsdata |