Cargando…
Optimising orbit counting of arbitrary order by equation selection
BACKGROUND: Graphlets are useful for bioinformatics network analysis. Based on the structure of Hočevar and Demšar’s ORCA algorithm, we have created an orbit counting algorithm, named Jesse. This algorithm, like ORCA, uses equations to count the orbits, but unlike ORCA it can count graphlets of any...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6334470/ https://www.ncbi.nlm.nih.gov/pubmed/30646859 http://dx.doi.org/10.1186/s12859-018-2483-9 |
_version_ | 1783387724102762496 |
---|---|
author | Melckenbeeck, Ine Audenaert, Pieter Van Parys, Thomas Van De Peer, Yves Colle, Didier Pickavet, Mario |
author_facet | Melckenbeeck, Ine Audenaert, Pieter Van Parys, Thomas Van De Peer, Yves Colle, Didier Pickavet, Mario |
author_sort | Melckenbeeck, Ine |
collection | PubMed |
description | BACKGROUND: Graphlets are useful for bioinformatics network analysis. Based on the structure of Hočevar and Demšar’s ORCA algorithm, we have created an orbit counting algorithm, named Jesse. This algorithm, like ORCA, uses equations to count the orbits, but unlike ORCA it can count graphlets of any order. To do so, it generates the required internal structures and equations automatically. Many more redundant equations are generated, however, and Jesse’s running time is highly dependent on which of these equations are used. Therefore, this paper aims to investigate which equations are most efficient, and which factors have an effect on this efficiency. RESULTS: With appropriate equation selection, Jesse’s running time may be reduced by a factor of up to 2 in the best case, compared to using randomly selected equations. Which equations are most efficient depends on the density of the graph, but barely on the graph type. At low graph density, equations with terms in their right-hand side with few arguments are more efficient, whereas at high density, equations with terms with many arguments in the right-hand side are most efficient. At a density between 0.6 and 0.7, both types of equations are about equally efficient. CONCLUSIONS: Our Jesse algorithm became up to a factor 2 more efficient, by automatically selecting the best equations based on graph density. It was adapted into a Cytoscape App that is freely available from the Cytoscape App Store to ease application by bioinformaticians. |
format | Online Article Text |
id | pubmed-6334470 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63344702019-01-23 Optimising orbit counting of arbitrary order by equation selection Melckenbeeck, Ine Audenaert, Pieter Van Parys, Thomas Van De Peer, Yves Colle, Didier Pickavet, Mario BMC Bioinformatics Software BACKGROUND: Graphlets are useful for bioinformatics network analysis. Based on the structure of Hočevar and Demšar’s ORCA algorithm, we have created an orbit counting algorithm, named Jesse. This algorithm, like ORCA, uses equations to count the orbits, but unlike ORCA it can count graphlets of any order. To do so, it generates the required internal structures and equations automatically. Many more redundant equations are generated, however, and Jesse’s running time is highly dependent on which of these equations are used. Therefore, this paper aims to investigate which equations are most efficient, and which factors have an effect on this efficiency. RESULTS: With appropriate equation selection, Jesse’s running time may be reduced by a factor of up to 2 in the best case, compared to using randomly selected equations. Which equations are most efficient depends on the density of the graph, but barely on the graph type. At low graph density, equations with terms in their right-hand side with few arguments are more efficient, whereas at high density, equations with terms with many arguments in the right-hand side are most efficient. At a density between 0.6 and 0.7, both types of equations are about equally efficient. CONCLUSIONS: Our Jesse algorithm became up to a factor 2 more efficient, by automatically selecting the best equations based on graph density. It was adapted into a Cytoscape App that is freely available from the Cytoscape App Store to ease application by bioinformaticians. BioMed Central 2019-01-15 /pmc/articles/PMC6334470/ /pubmed/30646859 http://dx.doi.org/10.1186/s12859-018-2483-9 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Melckenbeeck, Ine Audenaert, Pieter Van Parys, Thomas Van De Peer, Yves Colle, Didier Pickavet, Mario Optimising orbit counting of arbitrary order by equation selection |
title | Optimising orbit counting of arbitrary order by equation selection |
title_full | Optimising orbit counting of arbitrary order by equation selection |
title_fullStr | Optimising orbit counting of arbitrary order by equation selection |
title_full_unstemmed | Optimising orbit counting of arbitrary order by equation selection |
title_short | Optimising orbit counting of arbitrary order by equation selection |
title_sort | optimising orbit counting of arbitrary order by equation selection |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6334470/ https://www.ncbi.nlm.nih.gov/pubmed/30646859 http://dx.doi.org/10.1186/s12859-018-2483-9 |
work_keys_str_mv | AT melckenbeeckine optimisingorbitcountingofarbitraryorderbyequationselection AT audenaertpieter optimisingorbitcountingofarbitraryorderbyequationselection AT vanparysthomas optimisingorbitcountingofarbitraryorderbyequationselection AT vandepeeryves optimisingorbitcountingofarbitraryorderbyequationselection AT colledidier optimisingorbitcountingofarbitraryorderbyequationselection AT pickavetmario optimisingorbitcountingofarbitraryorderbyequationselection |