Cargando…

A network-based approach to classify the three domains of life

BACKGROUND: Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. Here, we present an approach to classify the three domains of life using topological information about the underlying metabolic networks. These networks have been s...

Descripción completa

Detalles Bibliográficos
Autores principales: Mueller, Laurin AJ, Kugler, Karl G, Netzer, Michael, Graber, Armin, Dehmer, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226542/
https://www.ncbi.nlm.nih.gov/pubmed/21995640
http://dx.doi.org/10.1186/1745-6150-6-53
_version_ 1782217636936941568
author Mueller, Laurin AJ
Kugler, Karl G
Netzer, Michael
Graber, Armin
Dehmer, Matthias
author_facet Mueller, Laurin AJ
Kugler, Karl G
Netzer, Michael
Graber, Armin
Dehmer, Matthias
author_sort Mueller, Laurin AJ
collection PubMed
description BACKGROUND: Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. Here, we present an approach to classify the three domains of life using topological information about the underlying metabolic networks. These networks have been shown to share domain-independent structural similarities, which pose a special challenge for our endeavour. We quantify specific structural information by using topological network descriptors to classify this set of metabolic networks. Such measures quantify the structural complexity of the underlying networks. In this study, we use such measures to capture domain-specific structural features of the metabolic networks to classify the data set. So far, it has been a challenging undertaking to examine what kind of structural complexity such measures do detect. In this paper, we apply two groups of topological network descriptors to metabolic networks and evaluate their classification performance. Moreover, we combine the two groups to perform a feature selection to estimate the structural features with the highest classification ability in order to optimize the classification performance. RESULTS: By combining the two groups, we can identify seven topological network descriptors that show a group-specific characteristic by ANOVA. A multivariate analysis using feature selection and supervised machine learning leads to a reasonable classification performance with a weighted F-score of 83.7% and an accuracy of 83.9%. We further demonstrate that our approach outperforms alternative methods. Also, our results reveal that entropy-based descriptors show the highest classification ability for this set of networks. CONCLUSIONS: Our results show that these particular topological network descriptors are able to capture domain-specific structural characteristics for classifying metabolic networks between the three domains of life.
format Online
Article
Text
id pubmed-3226542
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32265422011-11-30 A network-based approach to classify the three domains of life Mueller, Laurin AJ Kugler, Karl G Netzer, Michael Graber, Armin Dehmer, Matthias Biol Direct Research BACKGROUND: Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. Here, we present an approach to classify the three domains of life using topological information about the underlying metabolic networks. These networks have been shown to share domain-independent structural similarities, which pose a special challenge for our endeavour. We quantify specific structural information by using topological network descriptors to classify this set of metabolic networks. Such measures quantify the structural complexity of the underlying networks. In this study, we use such measures to capture domain-specific structural features of the metabolic networks to classify the data set. So far, it has been a challenging undertaking to examine what kind of structural complexity such measures do detect. In this paper, we apply two groups of topological network descriptors to metabolic networks and evaluate their classification performance. Moreover, we combine the two groups to perform a feature selection to estimate the structural features with the highest classification ability in order to optimize the classification performance. RESULTS: By combining the two groups, we can identify seven topological network descriptors that show a group-specific characteristic by ANOVA. A multivariate analysis using feature selection and supervised machine learning leads to a reasonable classification performance with a weighted F-score of 83.7% and an accuracy of 83.9%. We further demonstrate that our approach outperforms alternative methods. Also, our results reveal that entropy-based descriptors show the highest classification ability for this set of networks. CONCLUSIONS: Our results show that these particular topological network descriptors are able to capture domain-specific structural characteristics for classifying metabolic networks between the three domains of life. BioMed Central 2011-10-13 /pmc/articles/PMC3226542/ /pubmed/21995640 http://dx.doi.org/10.1186/1745-6150-6-53 Text en Copyright ©2011 Mueller et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Mueller, Laurin AJ
Kugler, Karl G
Netzer, Michael
Graber, Armin
Dehmer, Matthias
A network-based approach to classify the three domains of life
title A network-based approach to classify the three domains of life
title_full A network-based approach to classify the three domains of life
title_fullStr A network-based approach to classify the three domains of life
title_full_unstemmed A network-based approach to classify the three domains of life
title_short A network-based approach to classify the three domains of life
title_sort network-based approach to classify the three domains of life
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226542/
https://www.ncbi.nlm.nih.gov/pubmed/21995640
http://dx.doi.org/10.1186/1745-6150-6-53
work_keys_str_mv AT muellerlaurinaj anetworkbasedapproachtoclassifythethreedomainsoflife
AT kuglerkarlg anetworkbasedapproachtoclassifythethreedomainsoflife
AT netzermichael anetworkbasedapproachtoclassifythethreedomainsoflife
AT graberarmin anetworkbasedapproachtoclassifythethreedomainsoflife
AT dehmermatthias anetworkbasedapproachtoclassifythethreedomainsoflife
AT muellerlaurinaj networkbasedapproachtoclassifythethreedomainsoflife
AT kuglerkarlg networkbasedapproachtoclassifythethreedomainsoflife
AT netzermichael networkbasedapproachtoclassifythethreedomainsoflife
AT graberarmin networkbasedapproachtoclassifythethreedomainsoflife
AT dehmermatthias networkbasedapproachtoclassifythethreedomainsoflife