Cargando…
Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections
Projections of bipartite or two-mode networks capture co-occurrences, and are used in diverse fields (e.g., ecology, economics, bibliometrics, politics) to represent unipartite networks. A key challenge in analyzing such networks is determining whether an observed number of co-occurrences between tw...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8671427/ https://www.ncbi.nlm.nih.gov/pubmed/34907253 http://dx.doi.org/10.1038/s41598-021-03238-3 |
_version_ | 1784615134043308032 |
---|---|
author | Neal, Zachary P. Domagalski, Rachel Sagan, Bruce |
author_facet | Neal, Zachary P. Domagalski, Rachel Sagan, Bruce |
author_sort | Neal, Zachary P. |
collection | PubMed |
description | Projections of bipartite or two-mode networks capture co-occurrences, and are used in diverse fields (e.g., ecology, economics, bibliometrics, politics) to represent unipartite networks. A key challenge in analyzing such networks is determining whether an observed number of co-occurrences between two nodes is significant, and therefore whether an edge exists between them. One approach, the fixed degree sequence model (FDSM), evaluates the significance of an edge’s weight by comparison to a null model in which the degree sequences of the original bipartite network are fixed. Although the FDSM is an intuitive null model, it is computationally expensive because it requires Monte Carlo simulation to estimate each edge’s p value, and therefore is impractical for large projections. In this paper, we explore four potential alternatives to FDSM: fixed fill model, fixed row model, fixed column model, and stochastic degree sequence model (SDSM). We compare these models to FDSM in terms of accuracy, speed, statistical power, similarity, and ability to recover known communities. We find that the computationally-fast SDSM offers a statistically conservative but close approximation of the computationally-impractical FDSM under a wide range of conditions, and that it correctly recovers a known community structure even when the signal is weak. Therefore, although each backbone model may have particular applications, we recommend SDSM for extracting the backbone of bipartite projections when FDSM is impractical. |
format | Online Article Text |
id | pubmed-8671427 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-86714272021-12-16 Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections Neal, Zachary P. Domagalski, Rachel Sagan, Bruce Sci Rep Article Projections of bipartite or two-mode networks capture co-occurrences, and are used in diverse fields (e.g., ecology, economics, bibliometrics, politics) to represent unipartite networks. A key challenge in analyzing such networks is determining whether an observed number of co-occurrences between two nodes is significant, and therefore whether an edge exists between them. One approach, the fixed degree sequence model (FDSM), evaluates the significance of an edge’s weight by comparison to a null model in which the degree sequences of the original bipartite network are fixed. Although the FDSM is an intuitive null model, it is computationally expensive because it requires Monte Carlo simulation to estimate each edge’s p value, and therefore is impractical for large projections. In this paper, we explore four potential alternatives to FDSM: fixed fill model, fixed row model, fixed column model, and stochastic degree sequence model (SDSM). We compare these models to FDSM in terms of accuracy, speed, statistical power, similarity, and ability to recover known communities. We find that the computationally-fast SDSM offers a statistically conservative but close approximation of the computationally-impractical FDSM under a wide range of conditions, and that it correctly recovers a known community structure even when the signal is weak. Therefore, although each backbone model may have particular applications, we recommend SDSM for extracting the backbone of bipartite projections when FDSM is impractical. Nature Publishing Group UK 2021-12-14 /pmc/articles/PMC8671427/ /pubmed/34907253 http://dx.doi.org/10.1038/s41598-021-03238-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Neal, Zachary P. Domagalski, Rachel Sagan, Bruce Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections |
title | Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections |
title_full | Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections |
title_fullStr | Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections |
title_full_unstemmed | Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections |
title_short | Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections |
title_sort | comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8671427/ https://www.ncbi.nlm.nih.gov/pubmed/34907253 http://dx.doi.org/10.1038/s41598-021-03238-3 |
work_keys_str_mv | AT nealzacharyp comparingalternativestothefixeddegreesequencemodelforextractingthebackboneofbipartiteprojections AT domagalskirachel comparingalternativestothefixeddegreesequencemodelforextractingthebackboneofbipartiteprojections AT saganbruce comparingalternativestothefixeddegreesequencemodelforextractingthebackboneofbipartiteprojections |