Cargando…

eSMC: a statistical model to infer admixture events from individual genomics data

BACKGROUND: Inferring historical population admixture events yield essential insights in understanding a species demographic history. Methods are available to infer admixture events in demographic history with extant genetic data from multiple sources. Due to the deficiency in ancient population gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yonghui, Zhao, Zicheng, Miao, Xinyao, Wang, Yinan, Qian, Xiaobo, Chen, Lingxi, Wang, Changfa, Li, Shuaicheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9748406/
https://www.ncbi.nlm.nih.gov/pubmed/36517735
http://dx.doi.org/10.1186/s12864-022-09033-2
_version_ 1784849815816896512
author Wang, Yonghui
Zhao, Zicheng
Miao, Xinyao
Wang, Yinan
Qian, Xiaobo
Chen, Lingxi
Wang, Changfa
Li, Shuaicheng
author_facet Wang, Yonghui
Zhao, Zicheng
Miao, Xinyao
Wang, Yinan
Qian, Xiaobo
Chen, Lingxi
Wang, Changfa
Li, Shuaicheng
author_sort Wang, Yonghui
collection PubMed
description BACKGROUND: Inferring historical population admixture events yield essential insights in understanding a species demographic history. Methods are available to infer admixture events in demographic history with extant genetic data from multiple sources. Due to the deficiency in ancient population genetic data, there lacks a method for admixture inference from a single source. Pairwise Sequentially Markovian Coalescent (PSMC) estimates the historical effective population size from lineage genomes of a single individual, based on the distribution of the most recent common ancestor between the diploid’s alleles. However, PSMC does not infer the admixture event. RESULTS: Here, we proposed eSMC, an extended PSMC model for admixture inference from a single source. We evaluated our model’s performance on both in silico data and real data. We simulated population admixture events at an admixture time range from 5 kya to 100 kya (5 years/generation) with population admix ratio at 1:1, 2:1, 3:1, and 4:1, respectively. The root means the square error is [Formula: see text] kya for all experiments. Then we implemented our method to infer the historical admixture events in human, donkey and goat populations. The estimated admixture time for both Han and Tibetan individuals range from 60 kya to 80 kya (25 years/generation), while the estimated admixture time for the domesticated donkeys and the goats ranged from 40 kya to 60 kya (8 years/generation) and 40 kya to 100 kya (6 years/generation), respectively. The estimated admixture times were concordance to the time that domestication occurred in human history. CONCLUSION: Our eSMC effectively infers the time of the most recent admixture event in history from a single individual’s genomics data. The source code of eSMC is hosted at https://github.com/zachary-zzc/eSMC.
format Online
Article
Text
id pubmed-9748406
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-97484062022-12-14 eSMC: a statistical model to infer admixture events from individual genomics data Wang, Yonghui Zhao, Zicheng Miao, Xinyao Wang, Yinan Qian, Xiaobo Chen, Lingxi Wang, Changfa Li, Shuaicheng BMC Genomics Software BACKGROUND: Inferring historical population admixture events yield essential insights in understanding a species demographic history. Methods are available to infer admixture events in demographic history with extant genetic data from multiple sources. Due to the deficiency in ancient population genetic data, there lacks a method for admixture inference from a single source. Pairwise Sequentially Markovian Coalescent (PSMC) estimates the historical effective population size from lineage genomes of a single individual, based on the distribution of the most recent common ancestor between the diploid’s alleles. However, PSMC does not infer the admixture event. RESULTS: Here, we proposed eSMC, an extended PSMC model for admixture inference from a single source. We evaluated our model’s performance on both in silico data and real data. We simulated population admixture events at an admixture time range from 5 kya to 100 kya (5 years/generation) with population admix ratio at 1:1, 2:1, 3:1, and 4:1, respectively. The root means the square error is [Formula: see text] kya for all experiments. Then we implemented our method to infer the historical admixture events in human, donkey and goat populations. The estimated admixture time for both Han and Tibetan individuals range from 60 kya to 80 kya (25 years/generation), while the estimated admixture time for the domesticated donkeys and the goats ranged from 40 kya to 60 kya (8 years/generation) and 40 kya to 100 kya (6 years/generation), respectively. The estimated admixture times were concordance to the time that domestication occurred in human history. CONCLUSION: Our eSMC effectively infers the time of the most recent admixture event in history from a single individual’s genomics data. The source code of eSMC is hosted at https://github.com/zachary-zzc/eSMC. BioMed Central 2022-12-14 /pmc/articles/PMC9748406/ /pubmed/36517735 http://dx.doi.org/10.1186/s12864-022-09033-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Wang, Yonghui
Zhao, Zicheng
Miao, Xinyao
Wang, Yinan
Qian, Xiaobo
Chen, Lingxi
Wang, Changfa
Li, Shuaicheng
eSMC: a statistical model to infer admixture events from individual genomics data
title eSMC: a statistical model to infer admixture events from individual genomics data
title_full eSMC: a statistical model to infer admixture events from individual genomics data
title_fullStr eSMC: a statistical model to infer admixture events from individual genomics data
title_full_unstemmed eSMC: a statistical model to infer admixture events from individual genomics data
title_short eSMC: a statistical model to infer admixture events from individual genomics data
title_sort esmc: a statistical model to infer admixture events from individual genomics data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9748406/
https://www.ncbi.nlm.nih.gov/pubmed/36517735
http://dx.doi.org/10.1186/s12864-022-09033-2
work_keys_str_mv AT wangyonghui esmcastatisticalmodeltoinferadmixtureeventsfromindividualgenomicsdata
AT zhaozicheng esmcastatisticalmodeltoinferadmixtureeventsfromindividualgenomicsdata
AT miaoxinyao esmcastatisticalmodeltoinferadmixtureeventsfromindividualgenomicsdata
AT wangyinan esmcastatisticalmodeltoinferadmixtureeventsfromindividualgenomicsdata
AT qianxiaobo esmcastatisticalmodeltoinferadmixtureeventsfromindividualgenomicsdata
AT chenlingxi esmcastatisticalmodeltoinferadmixtureeventsfromindividualgenomicsdata
AT wangchangfa esmcastatisticalmodeltoinferadmixtureeventsfromindividualgenomicsdata
AT lishuaicheng esmcastatisticalmodeltoinferadmixtureeventsfromindividualgenomicsdata