Cargando…
The persistent homology of genealogical networks
Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Y...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950181/ https://www.ncbi.nlm.nih.gov/pubmed/36852178 http://dx.doi.org/10.1007/s41109-023-00538-7 |
_version_ | 1784893108593360896 |
---|---|
author | Boyd, Zachary M. Callor, Nick Gledhill, Taylor Jenkins, Abigail Snellman, Robert Webb, Benjamin Wonnacott, Raelynn |
author_facet | Boyd, Zachary M. Callor, Nick Gledhill, Taylor Jenkins, Abigail Snellman, Robert Webb, Benjamin Wonnacott, Raelynn |
author_sort | Boyd, Zachary M. |
collection | PubMed |
description | Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Yet little mathematical attention has been paid to the complex network properties of genealogical networks, especially at large scales. The structure of genealogical networks is of particular interest due to the practice of forming unions, e.g. marriages, that are typically well outside one’s immediate family. In most other networks, including other social networks, no equivalent restriction exists on the distance at which relationships form. To study the effect this has on genealogical networks we use persistent homology to identify and compare the structure of 101 genealogical and 31 other social networks. Specifically, we introduce the notion of a network’s persistence curve, which encodes the network’s set of persistence intervals. We find that the persistence curves of genealogical networks have a distinct structure when compared to other social networks. This difference in structure also extends to subnetworks of genealogical and social networks suggesting that, even with incomplete data, persistent homology can be used to meaningfully analyze genealogical networks. Here we also describe how concepts from genealogical networks, such as common ancestor cycles, are represented using persistent homology. We expect that persistent homology tools will become increasingly important in genealogical exploration as popular interest in ancestry research continues to expand. |
format | Online Article Text |
id | pubmed-9950181 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-99501812023-02-25 The persistent homology of genealogical networks Boyd, Zachary M. Callor, Nick Gledhill, Taylor Jenkins, Abigail Snellman, Robert Webb, Benjamin Wonnacott, Raelynn Appl Netw Sci Research Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Yet little mathematical attention has been paid to the complex network properties of genealogical networks, especially at large scales. The structure of genealogical networks is of particular interest due to the practice of forming unions, e.g. marriages, that are typically well outside one’s immediate family. In most other networks, including other social networks, no equivalent restriction exists on the distance at which relationships form. To study the effect this has on genealogical networks we use persistent homology to identify and compare the structure of 101 genealogical and 31 other social networks. Specifically, we introduce the notion of a network’s persistence curve, which encodes the network’s set of persistence intervals. We find that the persistence curves of genealogical networks have a distinct structure when compared to other social networks. This difference in structure also extends to subnetworks of genealogical and social networks suggesting that, even with incomplete data, persistent homology can be used to meaningfully analyze genealogical networks. Here we also describe how concepts from genealogical networks, such as common ancestor cycles, are represented using persistent homology. We expect that persistent homology tools will become increasingly important in genealogical exploration as popular interest in ancestry research continues to expand. Springer International Publishing 2023-02-23 2023 /pmc/articles/PMC9950181/ /pubmed/36852178 http://dx.doi.org/10.1007/s41109-023-00538-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Boyd, Zachary M. Callor, Nick Gledhill, Taylor Jenkins, Abigail Snellman, Robert Webb, Benjamin Wonnacott, Raelynn The persistent homology of genealogical networks |
title | The persistent homology of genealogical networks |
title_full | The persistent homology of genealogical networks |
title_fullStr | The persistent homology of genealogical networks |
title_full_unstemmed | The persistent homology of genealogical networks |
title_short | The persistent homology of genealogical networks |
title_sort | persistent homology of genealogical networks |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950181/ https://www.ncbi.nlm.nih.gov/pubmed/36852178 http://dx.doi.org/10.1007/s41109-023-00538-7 |
work_keys_str_mv | AT boydzacharym thepersistenthomologyofgenealogicalnetworks AT callornick thepersistenthomologyofgenealogicalnetworks AT gledhilltaylor thepersistenthomologyofgenealogicalnetworks AT jenkinsabigail thepersistenthomologyofgenealogicalnetworks AT snellmanrobert thepersistenthomologyofgenealogicalnetworks AT webbbenjamin thepersistenthomologyofgenealogicalnetworks AT wonnacottraelynn thepersistenthomologyofgenealogicalnetworks AT boydzacharym persistenthomologyofgenealogicalnetworks AT callornick persistenthomologyofgenealogicalnetworks AT gledhilltaylor persistenthomologyofgenealogicalnetworks AT jenkinsabigail persistenthomologyofgenealogicalnetworks AT snellmanrobert persistenthomologyofgenealogicalnetworks AT webbbenjamin persistenthomologyofgenealogicalnetworks AT wonnacottraelynn persistenthomologyofgenealogicalnetworks |