Cargando…
Moment estimators of relatedness from low-depth whole-genome sequencing data
BACKGROUND: Estimating relatedness is an important step for many genetic study designs. A variety of methods for estimating coefficients of pairwise relatedness from genotype data have been proposed. Both the kinship coefficient [Formula: see text] and the fraternity coefficient [Formula: see text]...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9233360/ https://www.ncbi.nlm.nih.gov/pubmed/35751014 http://dx.doi.org/10.1186/s12859-022-04795-8 |
_version_ | 1784735746815426560 |
---|---|
author | Herzig, Anthony F. Ciullo, M. Leutenegger, A-L. Perdry, H. |
author_facet | Herzig, Anthony F. Ciullo, M. Leutenegger, A-L. Perdry, H. |
author_sort | Herzig, Anthony F. |
collection | PubMed |
description | BACKGROUND: Estimating relatedness is an important step for many genetic study designs. A variety of methods for estimating coefficients of pairwise relatedness from genotype data have been proposed. Both the kinship coefficient [Formula: see text] and the fraternity coefficient [Formula: see text] for all pairs of individuals are of interest. However, when dealing with low-depth sequencing or imputation data, individual level genotypes cannot be confidently called. To ignore such uncertainty is known to result in biased estimates. Accordingly, methods have recently been developed to estimate kinship from uncertain genotypes. RESULTS: We present new method-of-moment estimators of both the coefficients [Formula: see text] and [Formula: see text] calculated directly from genotype likelihoods. We have simulated low-depth genetic data for a sample of individuals with extensive relatedness by using the complex pedigree of the known genetic isolates of Cilento in South Italy. Through this simulation, we explore the behaviour of our estimators, demonstrate their properties, and show advantages over alternative methods. A demonstration of our method is given for a sample of 150 French individuals with down-sampled sequencing data. CONCLUSIONS: We find that our method can provide accurate relatedness estimates whilst holding advantages over existing methods in terms of robustness, independence from external software, and required computation time. The method presented in this paper is referred to as LowKi (Low-depth Kinship) and has been made available in an R package (https://github.com/genostats/LowKi). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04795-8. |
format | Online Article Text |
id | pubmed-9233360 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-92333602022-06-26 Moment estimators of relatedness from low-depth whole-genome sequencing data Herzig, Anthony F. Ciullo, M. Leutenegger, A-L. Perdry, H. BMC Bioinformatics Research BACKGROUND: Estimating relatedness is an important step for many genetic study designs. A variety of methods for estimating coefficients of pairwise relatedness from genotype data have been proposed. Both the kinship coefficient [Formula: see text] and the fraternity coefficient [Formula: see text] for all pairs of individuals are of interest. However, when dealing with low-depth sequencing or imputation data, individual level genotypes cannot be confidently called. To ignore such uncertainty is known to result in biased estimates. Accordingly, methods have recently been developed to estimate kinship from uncertain genotypes. RESULTS: We present new method-of-moment estimators of both the coefficients [Formula: see text] and [Formula: see text] calculated directly from genotype likelihoods. We have simulated low-depth genetic data for a sample of individuals with extensive relatedness by using the complex pedigree of the known genetic isolates of Cilento in South Italy. Through this simulation, we explore the behaviour of our estimators, demonstrate their properties, and show advantages over alternative methods. A demonstration of our method is given for a sample of 150 French individuals with down-sampled sequencing data. CONCLUSIONS: We find that our method can provide accurate relatedness estimates whilst holding advantages over existing methods in terms of robustness, independence from external software, and required computation time. The method presented in this paper is referred to as LowKi (Low-depth Kinship) and has been made available in an R package (https://github.com/genostats/LowKi). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04795-8. BioMed Central 2022-06-24 /pmc/articles/PMC9233360/ /pubmed/35751014 http://dx.doi.org/10.1186/s12859-022-04795-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Herzig, Anthony F. Ciullo, M. Leutenegger, A-L. Perdry, H. Moment estimators of relatedness from low-depth whole-genome sequencing data |
title | Moment estimators of relatedness from low-depth whole-genome sequencing data |
title_full | Moment estimators of relatedness from low-depth whole-genome sequencing data |
title_fullStr | Moment estimators of relatedness from low-depth whole-genome sequencing data |
title_full_unstemmed | Moment estimators of relatedness from low-depth whole-genome sequencing data |
title_short | Moment estimators of relatedness from low-depth whole-genome sequencing data |
title_sort | moment estimators of relatedness from low-depth whole-genome sequencing data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9233360/ https://www.ncbi.nlm.nih.gov/pubmed/35751014 http://dx.doi.org/10.1186/s12859-022-04795-8 |
work_keys_str_mv | AT herziganthonyf momentestimatorsofrelatednessfromlowdepthwholegenomesequencingdata AT ciullom momentestimatorsofrelatednessfromlowdepthwholegenomesequencingdata AT momentestimatorsofrelatednessfromlowdepthwholegenomesequencingdata AT leuteneggeral momentestimatorsofrelatednessfromlowdepthwholegenomesequencingdata AT perdryh momentestimatorsofrelatednessfromlowdepthwholegenomesequencingdata |