Cargando…

HCGA: Highly comparative graph analysis for network phenotyping

Networks are widely used as mathematical models of complex systems across many scientific disciplines. Decades of work have produced a vast corpus of research characterizing the topological, combinatorial, statistical, and spectral properties of graphs. Each graph property can be thought of as a fea...

Descripción completa

Detalles Bibliográficos
Autores principales: Peach, Robert L., Arnaudon, Alexis, Schmidt, Julia A., Palasciano, Henry A., Bernier, Nathan R., Jelfs, Kim E., Yaliraki, Sophia N., Barahona, Mauricio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8085611/
https://www.ncbi.nlm.nih.gov/pubmed/33982022
http://dx.doi.org/10.1016/j.patter.2021.100227
_version_ 1783686378899374080
author Peach, Robert L.
Arnaudon, Alexis
Schmidt, Julia A.
Palasciano, Henry A.
Bernier, Nathan R.
Jelfs, Kim E.
Yaliraki, Sophia N.
Barahona, Mauricio
author_facet Peach, Robert L.
Arnaudon, Alexis
Schmidt, Julia A.
Palasciano, Henry A.
Bernier, Nathan R.
Jelfs, Kim E.
Yaliraki, Sophia N.
Barahona, Mauricio
author_sort Peach, Robert L.
collection PubMed
description Networks are widely used as mathematical models of complex systems across many scientific disciplines. Decades of work have produced a vast corpus of research characterizing the topological, combinatorial, statistical, and spectral properties of graphs. Each graph property can be thought of as a feature that captures important (and sometimes overlapping) characteristics of a network. In this paper, we introduce HCGA, a framework for highly comparative analysis of graph datasets that computes several thousands of graph features from any given network. HCGA also offers a suite of statistical learning and data analysis tools for automated identification and selection of important and interpretable features underpinning the characterization of graph datasets. We show that HCGA outperforms other methodologies on supervised classification tasks on benchmark datasets while retaining the interpretability of network features. We exemplify HCGA by predicting the charge transfer in organic semiconductors and clustering a dataset of neuronal morphology images.
format Online
Article
Text
id pubmed-8085611
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-80856112021-05-11 HCGA: Highly comparative graph analysis for network phenotyping Peach, Robert L. Arnaudon, Alexis Schmidt, Julia A. Palasciano, Henry A. Bernier, Nathan R. Jelfs, Kim E. Yaliraki, Sophia N. Barahona, Mauricio Patterns (N Y) Descriptor Networks are widely used as mathematical models of complex systems across many scientific disciplines. Decades of work have produced a vast corpus of research characterizing the topological, combinatorial, statistical, and spectral properties of graphs. Each graph property can be thought of as a feature that captures important (and sometimes overlapping) characteristics of a network. In this paper, we introduce HCGA, a framework for highly comparative analysis of graph datasets that computes several thousands of graph features from any given network. HCGA also offers a suite of statistical learning and data analysis tools for automated identification and selection of important and interpretable features underpinning the characterization of graph datasets. We show that HCGA outperforms other methodologies on supervised classification tasks on benchmark datasets while retaining the interpretability of network features. We exemplify HCGA by predicting the charge transfer in organic semiconductors and clustering a dataset of neuronal morphology images. Elsevier 2021-04-02 /pmc/articles/PMC8085611/ /pubmed/33982022 http://dx.doi.org/10.1016/j.patter.2021.100227 Text en © 2021 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Descriptor
Peach, Robert L.
Arnaudon, Alexis
Schmidt, Julia A.
Palasciano, Henry A.
Bernier, Nathan R.
Jelfs, Kim E.
Yaliraki, Sophia N.
Barahona, Mauricio
HCGA: Highly comparative graph analysis for network phenotyping
title HCGA: Highly comparative graph analysis for network phenotyping
title_full HCGA: Highly comparative graph analysis for network phenotyping
title_fullStr HCGA: Highly comparative graph analysis for network phenotyping
title_full_unstemmed HCGA: Highly comparative graph analysis for network phenotyping
title_short HCGA: Highly comparative graph analysis for network phenotyping
title_sort hcga: highly comparative graph analysis for network phenotyping
topic Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8085611/
https://www.ncbi.nlm.nih.gov/pubmed/33982022
http://dx.doi.org/10.1016/j.patter.2021.100227
work_keys_str_mv AT peachrobertl hcgahighlycomparativegraphanalysisfornetworkphenotyping
AT arnaudonalexis hcgahighlycomparativegraphanalysisfornetworkphenotyping
AT schmidtjuliaa hcgahighlycomparativegraphanalysisfornetworkphenotyping
AT palascianohenrya hcgahighlycomparativegraphanalysisfornetworkphenotyping
AT berniernathanr hcgahighlycomparativegraphanalysisfornetworkphenotyping
AT jelfskime hcgahighlycomparativegraphanalysisfornetworkphenotyping
AT yalirakisophian hcgahighlycomparativegraphanalysisfornetworkphenotyping
AT barahonamauricio hcgahighlycomparativegraphanalysisfornetworkphenotyping