Cargando…

A likelihood ratio approach for identifying three-quarter siblings in genetic databases

The detection of family relationships in genetic databases is of interest in various scientific disciplines such as genetic epidemiology, population and conservation genetics, forensic science, and genealogical research. Nowadays, screening genetic databases for related individuals forms an importan...

Descripción completa

Detalles Bibliográficos
Autores principales: Galván-Femenía, Iván, Barceló-Vidal, Carles, Sumoy, Lauro, Moreno, Victor, de Cid, Rafael, Graffelman, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8027836/
https://www.ncbi.nlm.nih.gov/pubmed/33452467
http://dx.doi.org/10.1038/s41437-020-00392-8
_version_ 1783675877683363840
author Galván-Femenía, Iván
Barceló-Vidal, Carles
Sumoy, Lauro
Moreno, Victor
de Cid, Rafael
Graffelman, Jan
author_facet Galván-Femenía, Iván
Barceló-Vidal, Carles
Sumoy, Lauro
Moreno, Victor
de Cid, Rafael
Graffelman, Jan
author_sort Galván-Femenía, Iván
collection PubMed
description The detection of family relationships in genetic databases is of interest in various scientific disciplines such as genetic epidemiology, population and conservation genetics, forensic science, and genealogical research. Nowadays, screening genetic databases for related individuals forms an important aspect of standard quality control procedures. Relatedness research is usually based on an allele sharing analysis of identity by state (IBS) or identity by descent (IBD) alleles. Existing IBS/IBD methods mainly aim to identify first-degree relationships (parent–offspring or full siblings) and second degree (half-siblings, avuncular, or grandparent–grandchild) pairs. Little attention has been paid to the detection of in-between first and second-degree relationships such as three-quarter siblings (3/4S) who share fewer alleles than first-degree relationships but more alleles than second-degree relationships. With the progressively increasing sample sizes used in genetic research, it becomes more likely that such relationships are present in the database under study. In this paper, we extend existing likelihood ratio (LR) methodology to accurately infer the existence of 3/4S, distinguishing them from full siblings and second-degree relatives. We use bootstrap confidence intervals to express uncertainty in the LRs. Our proposal accounts for linkage disequilibrium (LD) by using marker pruning, and we validate our methodology with a pedigree-based simulation study accounting for both LD and recombination. An empirical genome-wide array data set from the GCAT Genomes for Life cohort project is used to illustrate the method.
format Online
Article
Text
id pubmed-8027836
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-80278362021-04-21 A likelihood ratio approach for identifying three-quarter siblings in genetic databases Galván-Femenía, Iván Barceló-Vidal, Carles Sumoy, Lauro Moreno, Victor de Cid, Rafael Graffelman, Jan Heredity (Edinb) Article The detection of family relationships in genetic databases is of interest in various scientific disciplines such as genetic epidemiology, population and conservation genetics, forensic science, and genealogical research. Nowadays, screening genetic databases for related individuals forms an important aspect of standard quality control procedures. Relatedness research is usually based on an allele sharing analysis of identity by state (IBS) or identity by descent (IBD) alleles. Existing IBS/IBD methods mainly aim to identify first-degree relationships (parent–offspring or full siblings) and second degree (half-siblings, avuncular, or grandparent–grandchild) pairs. Little attention has been paid to the detection of in-between first and second-degree relationships such as three-quarter siblings (3/4S) who share fewer alleles than first-degree relationships but more alleles than second-degree relationships. With the progressively increasing sample sizes used in genetic research, it becomes more likely that such relationships are present in the database under study. In this paper, we extend existing likelihood ratio (LR) methodology to accurately infer the existence of 3/4S, distinguishing them from full siblings and second-degree relatives. We use bootstrap confidence intervals to express uncertainty in the LRs. Our proposal accounts for linkage disequilibrium (LD) by using marker pruning, and we validate our methodology with a pedigree-based simulation study accounting for both LD and recombination. An empirical genome-wide array data set from the GCAT Genomes for Life cohort project is used to illustrate the method. Springer International Publishing 2021-01-15 2021-03 /pmc/articles/PMC8027836/ /pubmed/33452467 http://dx.doi.org/10.1038/s41437-020-00392-8 Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Galván-Femenía, Iván
Barceló-Vidal, Carles
Sumoy, Lauro
Moreno, Victor
de Cid, Rafael
Graffelman, Jan
A likelihood ratio approach for identifying three-quarter siblings in genetic databases
title A likelihood ratio approach for identifying three-quarter siblings in genetic databases
title_full A likelihood ratio approach for identifying three-quarter siblings in genetic databases
title_fullStr A likelihood ratio approach for identifying three-quarter siblings in genetic databases
title_full_unstemmed A likelihood ratio approach for identifying three-quarter siblings in genetic databases
title_short A likelihood ratio approach for identifying three-quarter siblings in genetic databases
title_sort likelihood ratio approach for identifying three-quarter siblings in genetic databases
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8027836/
https://www.ncbi.nlm.nih.gov/pubmed/33452467
http://dx.doi.org/10.1038/s41437-020-00392-8
work_keys_str_mv AT galvanfemeniaivan alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT barcelovidalcarles alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT sumoylauro alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT morenovictor alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT decidrafael alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT graffelmanjan alikelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT galvanfemeniaivan likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT barcelovidalcarles likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT sumoylauro likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT morenovictor likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT decidrafael likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases
AT graffelmanjan likelihoodratioapproachforidentifyingthreequartersiblingsingeneticdatabases