Cargando…

Combined statistical modeling enables accurate mining of circadian transcription

Circadian-regulated genes are essential for tissue homeostasis and organismal function, and are therefore common targets of scrutiny. Detection of rhythmic genes using current analytical tools requires exhaustive sampling, a demand that is costly and raises ethical concerns, making it unfeasible in...

Descripción completa

Detalles Bibliográficos
Autores principales: Rubio-Ponce, Andrea, Ballesteros, Iván, Quintana, Juan A, Solanas, Guiomar, Benitah, Salvador A, Hidalgo, Andrés, Sánchez-Cabo, Fátima
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8074341/
https://www.ncbi.nlm.nih.gov/pubmed/33937766
http://dx.doi.org/10.1093/nargab/lqab031
_version_ 1783684333422247936
author Rubio-Ponce, Andrea
Ballesteros, Iván
Quintana, Juan A
Solanas, Guiomar
Benitah, Salvador A
Hidalgo, Andrés
Sánchez-Cabo, Fátima
author_facet Rubio-Ponce, Andrea
Ballesteros, Iván
Quintana, Juan A
Solanas, Guiomar
Benitah, Salvador A
Hidalgo, Andrés
Sánchez-Cabo, Fátima
author_sort Rubio-Ponce, Andrea
collection PubMed
description Circadian-regulated genes are essential for tissue homeostasis and organismal function, and are therefore common targets of scrutiny. Detection of rhythmic genes using current analytical tools requires exhaustive sampling, a demand that is costly and raises ethical concerns, making it unfeasible in certain mammalian systems. Several non-parametric methods have been commonly used to analyze short-term (24 h) circadian data, such as JTK_cycle and MetaCycle. However, algorithm performance varies greatly depending on various biological and technical factors. Here, we present CircaN, an ad-hoc implementation of a non-linear mixed model for the identification of circadian genes in all types of omics data. Based on the variable but complementary results obtained through several biological and in silico datasets, we propose a combined approach of CircaN and non-parametric models to dramatically improve the number of circadian genes detected, without affecting accuracy. We also introduce an R package to make this approach available to the community.
format Online
Article
Text
id pubmed-8074341
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80743412021-04-29 Combined statistical modeling enables accurate mining of circadian transcription Rubio-Ponce, Andrea Ballesteros, Iván Quintana, Juan A Solanas, Guiomar Benitah, Salvador A Hidalgo, Andrés Sánchez-Cabo, Fátima NAR Genom Bioinform Standard Article Circadian-regulated genes are essential for tissue homeostasis and organismal function, and are therefore common targets of scrutiny. Detection of rhythmic genes using current analytical tools requires exhaustive sampling, a demand that is costly and raises ethical concerns, making it unfeasible in certain mammalian systems. Several non-parametric methods have been commonly used to analyze short-term (24 h) circadian data, such as JTK_cycle and MetaCycle. However, algorithm performance varies greatly depending on various biological and technical factors. Here, we present CircaN, an ad-hoc implementation of a non-linear mixed model for the identification of circadian genes in all types of omics data. Based on the variable but complementary results obtained through several biological and in silico datasets, we propose a combined approach of CircaN and non-parametric models to dramatically improve the number of circadian genes detected, without affecting accuracy. We also introduce an R package to make this approach available to the community. Oxford University Press 2021-04-26 /pmc/articles/PMC8074341/ /pubmed/33937766 http://dx.doi.org/10.1093/nargab/lqab031 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Standard Article
Rubio-Ponce, Andrea
Ballesteros, Iván
Quintana, Juan A
Solanas, Guiomar
Benitah, Salvador A
Hidalgo, Andrés
Sánchez-Cabo, Fátima
Combined statistical modeling enables accurate mining of circadian transcription
title Combined statistical modeling enables accurate mining of circadian transcription
title_full Combined statistical modeling enables accurate mining of circadian transcription
title_fullStr Combined statistical modeling enables accurate mining of circadian transcription
title_full_unstemmed Combined statistical modeling enables accurate mining of circadian transcription
title_short Combined statistical modeling enables accurate mining of circadian transcription
title_sort combined statistical modeling enables accurate mining of circadian transcription
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8074341/
https://www.ncbi.nlm.nih.gov/pubmed/33937766
http://dx.doi.org/10.1093/nargab/lqab031
work_keys_str_mv AT rubioponceandrea combinedstatisticalmodelingenablesaccurateminingofcircadiantranscription
AT ballesterosivan combinedstatisticalmodelingenablesaccurateminingofcircadiantranscription
AT quintanajuana combinedstatisticalmodelingenablesaccurateminingofcircadiantranscription
AT solanasguiomar combinedstatisticalmodelingenablesaccurateminingofcircadiantranscription
AT benitahsalvadora combinedstatisticalmodelingenablesaccurateminingofcircadiantranscription
AT hidalgoandres combinedstatisticalmodelingenablesaccurateminingofcircadiantranscription
AT sanchezcabofatima combinedstatisticalmodelingenablesaccurateminingofcircadiantranscription