Cargando…

Optimising orbit counting of arbitrary order by equation selection

BACKGROUND: Graphlets are useful for bioinformatics network analysis. Based on the structure of Hočevar and Demšar’s ORCA algorithm, we have created an orbit counting algorithm, named Jesse. This algorithm, like ORCA, uses equations to count the orbits, but unlike ORCA it can count graphlets of any...

Descripción completa

Detalles Bibliográficos
Autores principales: Melckenbeeck, Ine, Audenaert, Pieter, Van Parys, Thomas, Van De Peer, Yves, Colle, Didier, Pickavet, Mario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6334470/
https://www.ncbi.nlm.nih.gov/pubmed/30646859
http://dx.doi.org/10.1186/s12859-018-2483-9
_version_ 1783387724102762496
author Melckenbeeck, Ine
Audenaert, Pieter
Van Parys, Thomas
Van De Peer, Yves
Colle, Didier
Pickavet, Mario
author_facet Melckenbeeck, Ine
Audenaert, Pieter
Van Parys, Thomas
Van De Peer, Yves
Colle, Didier
Pickavet, Mario
author_sort Melckenbeeck, Ine
collection PubMed
description BACKGROUND: Graphlets are useful for bioinformatics network analysis. Based on the structure of Hočevar and Demšar’s ORCA algorithm, we have created an orbit counting algorithm, named Jesse. This algorithm, like ORCA, uses equations to count the orbits, but unlike ORCA it can count graphlets of any order. To do so, it generates the required internal structures and equations automatically. Many more redundant equations are generated, however, and Jesse’s running time is highly dependent on which of these equations are used. Therefore, this paper aims to investigate which equations are most efficient, and which factors have an effect on this efficiency. RESULTS: With appropriate equation selection, Jesse’s running time may be reduced by a factor of up to 2 in the best case, compared to using randomly selected equations. Which equations are most efficient depends on the density of the graph, but barely on the graph type. At low graph density, equations with terms in their right-hand side with few arguments are more efficient, whereas at high density, equations with terms with many arguments in the right-hand side are most efficient. At a density between 0.6 and 0.7, both types of equations are about equally efficient. CONCLUSIONS: Our Jesse algorithm became up to a factor 2 more efficient, by automatically selecting the best equations based on graph density. It was adapted into a Cytoscape App that is freely available from the Cytoscape App Store to ease application by bioinformaticians.
format Online
Article
Text
id pubmed-6334470
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63344702019-01-23 Optimising orbit counting of arbitrary order by equation selection Melckenbeeck, Ine Audenaert, Pieter Van Parys, Thomas Van De Peer, Yves Colle, Didier Pickavet, Mario BMC Bioinformatics Software BACKGROUND: Graphlets are useful for bioinformatics network analysis. Based on the structure of Hočevar and Demšar’s ORCA algorithm, we have created an orbit counting algorithm, named Jesse. This algorithm, like ORCA, uses equations to count the orbits, but unlike ORCA it can count graphlets of any order. To do so, it generates the required internal structures and equations automatically. Many more redundant equations are generated, however, and Jesse’s running time is highly dependent on which of these equations are used. Therefore, this paper aims to investigate which equations are most efficient, and which factors have an effect on this efficiency. RESULTS: With appropriate equation selection, Jesse’s running time may be reduced by a factor of up to 2 in the best case, compared to using randomly selected equations. Which equations are most efficient depends on the density of the graph, but barely on the graph type. At low graph density, equations with terms in their right-hand side with few arguments are more efficient, whereas at high density, equations with terms with many arguments in the right-hand side are most efficient. At a density between 0.6 and 0.7, both types of equations are about equally efficient. CONCLUSIONS: Our Jesse algorithm became up to a factor 2 more efficient, by automatically selecting the best equations based on graph density. It was adapted into a Cytoscape App that is freely available from the Cytoscape App Store to ease application by bioinformaticians. BioMed Central 2019-01-15 /pmc/articles/PMC6334470/ /pubmed/30646859 http://dx.doi.org/10.1186/s12859-018-2483-9 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Melckenbeeck, Ine
Audenaert, Pieter
Van Parys, Thomas
Van De Peer, Yves
Colle, Didier
Pickavet, Mario
Optimising orbit counting of arbitrary order by equation selection
title Optimising orbit counting of arbitrary order by equation selection
title_full Optimising orbit counting of arbitrary order by equation selection
title_fullStr Optimising orbit counting of arbitrary order by equation selection
title_full_unstemmed Optimising orbit counting of arbitrary order by equation selection
title_short Optimising orbit counting of arbitrary order by equation selection
title_sort optimising orbit counting of arbitrary order by equation selection
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6334470/
https://www.ncbi.nlm.nih.gov/pubmed/30646859
http://dx.doi.org/10.1186/s12859-018-2483-9
work_keys_str_mv AT melckenbeeckine optimisingorbitcountingofarbitraryorderbyequationselection
AT audenaertpieter optimisingorbitcountingofarbitraryorderbyequationselection
AT vanparysthomas optimisingorbitcountingofarbitraryorderbyequationselection
AT vandepeeryves optimisingorbitcountingofarbitraryorderbyequationselection
AT colledidier optimisingorbitcountingofarbitraryorderbyequationselection
AT pickavetmario optimisingorbitcountingofarbitraryorderbyequationselection