Cargando…
A likelihood ratio approach for identifying three-quarter siblings in genetic databases
The detection of family relationships in genetic databases is of interest in various scientific disciplines such as genetic epidemiology, population and conservation genetics, forensic science, and genealogical research. Nowadays, screening genetic databases for related individuals forms an importan...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8027836/ https://www.ncbi.nlm.nih.gov/pubmed/33452467 http://dx.doi.org/10.1038/s41437-020-00392-8 |
_version_ | 1783675877683363840 |
---|---|
author | Galván-Femenía, Iván Barceló-Vidal, Carles Sumoy, Lauro Moreno, Victor de Cid, Rafael Graffelman, Jan |
author_facet | Galván-Femenía, Iván Barceló-Vidal, Carles Sumoy, Lauro Moreno, Victor de Cid, Rafael Graffelman, Jan |
author_sort | Galván-Femenía, Iván |
collection | PubMed |
description | The detection of family relationships in genetic databases is of interest in various scientific disciplines such as genetic epidemiology, population and conservation genetics, forensic science, and genealogical research. Nowadays, screening genetic databases for related individuals forms an important aspect of standard quality control procedures. Relatedness research is usually based on an allele sharing analysis of identity by state (IBS) or identity by descent (IBD) alleles. Existing IBS/IBD methods mainly aim to identify first-degree relationships (parent–offspring or full siblings) and second degree (half-siblings, avuncular, or grandparent–grandchild) pairs. Little attention has been paid to the detection of in-between first and second-degree relationships such as three-quarter siblings (3/4S) who share fewer alleles than first-degree relationships but more alleles than second-degree relationships. With the progressively increasing sample sizes used in genetic research, it becomes more likely that such relationships are present in the database under study. In this paper, we extend existing likelihood ratio (LR) methodology to accurately infer the existence of 3/4S, distinguishing them from full siblings and second-degree relatives. We use bootstrap confidence intervals to express uncertainty in the LRs. Our proposal accounts for linkage disequilibrium (LD) by using marker pruning, and we validate our methodology with a pedigree-based simulation study accounting for both LD and recombination. An empirical genome-wide array data set from the GCAT Genomes for Life cohort project is used to illustrate the method. |
format | Online Article Text |
id | pubmed-8027836 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-80278362021-04-21 A likelihood ratio approach for identifying three-quarter siblings in genetic databases Galván-Femenía, Iván Barceló-Vidal, Carles Sumoy, Lauro Moreno, Victor de Cid, Rafael Graffelman, Jan Heredity (Edinb) Article The detection of family relationships in genetic databases is of interest in various scientific disciplines such as genetic epidemiology, population and conservation genetics, forensic science, and genealogical research. Nowadays, screening genetic databases for related individuals forms an important aspect of standard quality control procedures. Relatedness research is usually based on an allele sharing analysis of identity by state (IBS) or identity by descent (IBD) alleles. Existing IBS/IBD methods mainly aim to identify first-degree relationships (parent–offspring or full siblings) and second degree (half-siblings, avuncular, or grandparent–grandchild) pairs. Little attention has been paid to the detection of in-between first and second-degree relationships such as three-quarter siblings (3/4S) who share fewer alleles than first-degree relationships but more alleles than second-degree relationships. With the progressively increasing sample sizes used in genetic research, it becomes more likely that such relationships are present in the database under study. In this paper, we extend existing likelihood ratio (LR) methodology to accurately infer the existence of 3/4S, distinguishing them from full siblings and second-degree relatives. We use bootstrap confidence intervals to express uncertainty in the LRs. Our proposal accounts for linkage disequilibrium (LD) by using marker pruning, and we validate our methodology with a pedigree-based simulation study accounting for both LD and recombination. An empirical genome-wide array data set from the GCAT Genomes for Life cohort project is used to illustrate the method. Springer International Publishing 2021-01-15 2021-03 /pmc/articles/PMC8027836/ /pubmed/33452467 http://dx.doi.org/10.1038/s41437-020-00392-8 Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Galván-Femenía, Iván Barceló-Vidal, Carles Sumoy, Lauro Moreno, Victor de Cid, Rafael Graffelman, Jan A likelihood ratio approach for identifying three-quarter siblings in genetic databases |
title | A likelihood ratio approach for identifying three-quarter siblings in genetic databases |
title_full | A likelihood ratio approach for identifying three-quarter siblings in genetic databases |
title_fullStr | A likelihood ratio approach for identifying three-quarter siblings in genetic databases |
title_full_unstemmed | A likelihood ratio approach for identifying three-quarter siblings in genetic databases |
title_short | A likelihood ratio approach for identifying three-quarter siblings in genetic databases |
title_sort | likelihood ratio approach for identifying three-quarter siblings in genetic databases |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8027836/ https://www.ncbi.nlm.nih.gov/pubmed/33452467 http://dx.doi.org/10.1038/s41437-020-00392-8 |
work_keys_str_mv | AT galvanfemeniaivan alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT barcelovidalcarles alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT sumoylauro alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT morenovictor alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT decidrafael alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT graffelmanjan alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT galvanfemeniaivan likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT barcelovidalcarles likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT sumoylauro likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT morenovictor likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT decidrafael likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases AT graffelmanjan likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases |