Cargando…

The persistent homology of genealogical networks

Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Y...

Descripción completa

Detalles Bibliográficos
Autores principales: Boyd, Zachary M., Callor, Nick, Gledhill, Taylor, Jenkins, Abigail, Snellman, Robert, Webb, Benjamin, Wonnacott, Raelynn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950181/
https://www.ncbi.nlm.nih.gov/pubmed/36852178
http://dx.doi.org/10.1007/s41109-023-00538-7
_version_ 1784893108593360896
author Boyd, Zachary M.
Callor, Nick
Gledhill, Taylor
Jenkins, Abigail
Snellman, Robert
Webb, Benjamin
Wonnacott, Raelynn
author_facet Boyd, Zachary M.
Callor, Nick
Gledhill, Taylor
Jenkins, Abigail
Snellman, Robert
Webb, Benjamin
Wonnacott, Raelynn
author_sort Boyd, Zachary M.
collection PubMed
description Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Yet little mathematical attention has been paid to the complex network properties of genealogical networks, especially at large scales. The structure of genealogical networks is of particular interest due to the practice of forming unions, e.g. marriages, that are typically well outside one’s immediate family. In most other networks, including other social networks, no equivalent restriction exists on the distance at which relationships form. To study the effect this has on genealogical networks we use persistent homology to identify and compare the structure of 101 genealogical and 31 other social networks. Specifically, we introduce the notion of a network’s persistence curve, which encodes the network’s set of persistence intervals. We find that the persistence curves of genealogical networks have a distinct structure when compared to other social networks. This difference in structure also extends to subnetworks of genealogical and social networks suggesting that, even with incomplete data, persistent homology can be used to meaningfully analyze genealogical networks. Here we also describe how concepts from genealogical networks, such as common ancestor cycles, are represented using persistent homology. We expect that persistent homology tools will become increasingly important in genealogical exploration as popular interest in ancestry research continues to expand.
format Online
Article
Text
id pubmed-9950181
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-99501812023-02-25 The persistent homology of genealogical networks Boyd, Zachary M. Callor, Nick Gledhill, Taylor Jenkins, Abigail Snellman, Robert Webb, Benjamin Wonnacott, Raelynn Appl Netw Sci Research Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Yet little mathematical attention has been paid to the complex network properties of genealogical networks, especially at large scales. The structure of genealogical networks is of particular interest due to the practice of forming unions, e.g. marriages, that are typically well outside one’s immediate family. In most other networks, including other social networks, no equivalent restriction exists on the distance at which relationships form. To study the effect this has on genealogical networks we use persistent homology to identify and compare the structure of 101 genealogical and 31 other social networks. Specifically, we introduce the notion of a network’s persistence curve, which encodes the network’s set of persistence intervals. We find that the persistence curves of genealogical networks have a distinct structure when compared to other social networks. This difference in structure also extends to subnetworks of genealogical and social networks suggesting that, even with incomplete data, persistent homology can be used to meaningfully analyze genealogical networks. Here we also describe how concepts from genealogical networks, such as common ancestor cycles, are represented using persistent homology. We expect that persistent homology tools will become increasingly important in genealogical exploration as popular interest in ancestry research continues to expand. Springer International Publishing 2023-02-23 2023 /pmc/articles/PMC9950181/ /pubmed/36852178 http://dx.doi.org/10.1007/s41109-023-00538-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Boyd, Zachary M.
Callor, Nick
Gledhill, Taylor
Jenkins, Abigail
Snellman, Robert
Webb, Benjamin
Wonnacott, Raelynn
The persistent homology of genealogical networks
title The persistent homology of genealogical networks
title_full The persistent homology of genealogical networks
title_fullStr The persistent homology of genealogical networks
title_full_unstemmed The persistent homology of genealogical networks
title_short The persistent homology of genealogical networks
title_sort persistent homology of genealogical networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950181/
https://www.ncbi.nlm.nih.gov/pubmed/36852178
http://dx.doi.org/10.1007/s41109-023-00538-7
work_keys_str_mv AT boydzacharym thepersistenthomologyofgenealogicalnetworks
AT callornick thepersistenthomologyofgenealogicalnetworks
AT gledhilltaylor thepersistenthomologyofgenealogicalnetworks
AT jenkinsabigail thepersistenthomologyofgenealogicalnetworks
AT snellmanrobert thepersistenthomologyofgenealogicalnetworks
AT webbbenjamin thepersistenthomologyofgenealogicalnetworks
AT wonnacottraelynn thepersistenthomologyofgenealogicalnetworks
AT boydzacharym persistenthomologyofgenealogicalnetworks
AT callornick persistenthomologyofgenealogicalnetworks
AT gledhilltaylor persistenthomologyofgenealogicalnetworks
AT jenkinsabigail persistenthomologyofgenealogicalnetworks
AT snellmanrobert persistenthomologyofgenealogicalnetworks
AT webbbenjamin persistenthomologyofgenealogicalnetworks
AT wonnacottraelynn persistenthomologyofgenealogicalnetworks