Cargando…

Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study

BACKGROUND: Inferring regulatory interactions between genes from transcriptomics time-resolved data, yielding reverse engineered gene regulatory networks, is of paramount importance to systems biology and bioinformatics studies. Accurate methods to address this problem can ultimately provide a deepe...

Descripción completa

Detalles Bibliográficos
Autores principales: Hempel, Sabrina, Koseska, Aneta, Nikoloski, Zoran, Kurths, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3161045/
https://www.ncbi.nlm.nih.gov/pubmed/21771321
http://dx.doi.org/10.1186/1471-2105-12-292
_version_ 1782210630727499776
author Hempel, Sabrina
Koseska, Aneta
Nikoloski, Zoran
Kurths, Jürgen
author_facet Hempel, Sabrina
Koseska, Aneta
Nikoloski, Zoran
Kurths, Jürgen
author_sort Hempel, Sabrina
collection PubMed
description BACKGROUND: Inferring regulatory interactions between genes from transcriptomics time-resolved data, yielding reverse engineered gene regulatory networks, is of paramount importance to systems biology and bioinformatics studies. Accurate methods to address this problem can ultimately provide a deeper insight into the complexity, behavior, and functions of the underlying biological systems. However, the large number of interacting genes coupled with short and often noisy time-resolved read-outs of the system renders the reverse engineering a challenging task. Therefore, the development and assessment of methods which are computationally efficient, robust against noise, applicable to short time series data, and preferably capable of reconstructing the directionality of the regulatory interactions remains a pressing research problem with valuable applications. RESULTS: Here we perform the largest systematic analysis of a set of similarity measures and scoring schemes within the scope of the relevance network approach which are commonly used for gene regulatory network reconstruction from time series data. In addition, we define and analyze several novel measures and schemes which are particularly suitable for short transcriptomics time series. We also compare the considered 21 measures and 6 scoring schemes according to their ability to correctly reconstruct such networks from short time series data by calculating summary statistics based on the corresponding specificity and sensitivity. Our results demonstrate that rank and symbol based measures have the highest performance in inferring regulatory interactions. In addition, the proposed scoring scheme by asymmetric weighting has shown to be valuable in reducing the number of false positive interactions. On the other hand, Granger causality as well as information-theoretic measures, frequently used in inference of regulatory networks, show low performance on the short time series analyzed in this study. CONCLUSIONS: Our study is intended to serve as a guide for choosing a particular combination of similarity measures and scoring schemes suitable for reconstruction of gene regulatory networks from short time series data. We show that further improvement of algorithms for reverse engineering can be obtained if one considers measures that are rooted in the study of symbolic dynamics or ranks, in contrast to the application of common similarity measures which do not consider the temporal character of the employed data. Moreover, we establish that the asymmetric weighting scoring scheme together with symbol based measures (for low noise level) and rank based measures (for high noise level) are the most suitable choices.
format Online
Article
Text
id pubmed-3161045
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31610452011-08-25 Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study Hempel, Sabrina Koseska, Aneta Nikoloski, Zoran Kurths, Jürgen BMC Bioinformatics Research Article BACKGROUND: Inferring regulatory interactions between genes from transcriptomics time-resolved data, yielding reverse engineered gene regulatory networks, is of paramount importance to systems biology and bioinformatics studies. Accurate methods to address this problem can ultimately provide a deeper insight into the complexity, behavior, and functions of the underlying biological systems. However, the large number of interacting genes coupled with short and often noisy time-resolved read-outs of the system renders the reverse engineering a challenging task. Therefore, the development and assessment of methods which are computationally efficient, robust against noise, applicable to short time series data, and preferably capable of reconstructing the directionality of the regulatory interactions remains a pressing research problem with valuable applications. RESULTS: Here we perform the largest systematic analysis of a set of similarity measures and scoring schemes within the scope of the relevance network approach which are commonly used for gene regulatory network reconstruction from time series data. In addition, we define and analyze several novel measures and schemes which are particularly suitable for short transcriptomics time series. We also compare the considered 21 measures and 6 scoring schemes according to their ability to correctly reconstruct such networks from short time series data by calculating summary statistics based on the corresponding specificity and sensitivity. Our results demonstrate that rank and symbol based measures have the highest performance in inferring regulatory interactions. In addition, the proposed scoring scheme by asymmetric weighting has shown to be valuable in reducing the number of false positive interactions. On the other hand, Granger causality as well as information-theoretic measures, frequently used in inference of regulatory networks, show low performance on the short time series analyzed in this study. CONCLUSIONS: Our study is intended to serve as a guide for choosing a particular combination of similarity measures and scoring schemes suitable for reconstruction of gene regulatory networks from short time series data. We show that further improvement of algorithms for reverse engineering can be obtained if one considers measures that are rooted in the study of symbolic dynamics or ranks, in contrast to the application of common similarity measures which do not consider the temporal character of the employed data. Moreover, we establish that the asymmetric weighting scoring scheme together with symbol based measures (for low noise level) and rank based measures (for high noise level) are the most suitable choices. BioMed Central 2011-07-19 /pmc/articles/PMC3161045/ /pubmed/21771321 http://dx.doi.org/10.1186/1471-2105-12-292 Text en Copyright ©2011 Hempel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hempel, Sabrina
Koseska, Aneta
Nikoloski, Zoran
Kurths, Jürgen
Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study
title Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study
title_full Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study
title_fullStr Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study
title_full_unstemmed Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study
title_short Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study
title_sort unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3161045/
https://www.ncbi.nlm.nih.gov/pubmed/21771321
http://dx.doi.org/10.1186/1471-2105-12-292
work_keys_str_mv AT hempelsabrina unravelinggeneregulatorynetworksfromtimeresolvedgeneexpressiondataameasurescomparisonstudy
AT koseskaaneta unravelinggeneregulatorynetworksfromtimeresolvedgeneexpressiondataameasurescomparisonstudy
AT nikoloskizoran unravelinggeneregulatorynetworksfromtimeresolvedgeneexpressiondataameasurescomparisonstudy
AT kurthsjurgen unravelinggeneregulatorynetworksfromtimeresolvedgeneexpressiondataameasurescomparisonstudy