Cargando…

Two molecular measures of relatedness based on haplotype sharing

BACKGROUND: Measuring the extent of shared ancestry between individuals or organisms is important in many fields, including forensic science, conservation genetics and animal breeding. The traditional approach is to calculate the expected degree of relatedness between individuals in a pedigree. This...

Descripción completa

Detalles Bibliográficos
Autor principal: Edwards, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4642635/
https://www.ncbi.nlm.nih.gov/pubmed/26558877
http://dx.doi.org/10.1186/s12859-015-0802-y
_version_ 1782400394942480384
author Edwards, David
author_facet Edwards, David
author_sort Edwards, David
collection PubMed
description BACKGROUND: Measuring the extent of shared ancestry between individuals or organisms is important in many fields, including forensic science, conservation genetics and animal breeding. The traditional approach is to calculate the expected degree of relatedness between individuals in a pedigree. This assumes that the founders of the pedigree are non-inbred and unrelated to each other, which is rarely the case. In contrast, molecular data allow measurement of actual relatedness without knowledge of a pedigree. Methods to do this have been proposed, but generally do not take the lengths of the genomic regions shared between individuals into account. RESULTS: Two measures based on the extent of haplotype sharing between genomes are proposed. The intercept measure B estimates the fraction of shared genome between individuals, and the product measure C is closely related to the numerator relationship matrix A. Both are based on a model for the joint distribution of markers at the haplotype level. The two measures are compared to the pedigree-based measure A and to vanRaden’s G, a frequently used molecular measure, using a set of data comprising 5037 dairy cattle. The comparison criteria include the ability to capture genealogical relatedness and the prediction accuracy obtained when used in genomic prediction. Both B and C explain around 95 % of the variation in A, whereas G explains around 6 %. G captures genealogical relatedness poorly, particularly for distantly related individuals (second cousins or farther). Both B and C tend to be larger than A but this can be ascribed to the assumption of non-inbred unrelated founders. Using C in linear mixed models results in slightly higher prediction accuracy than G, and using B results in slightly lower prediction accuracy. CONCLUSIONS: The two proposed measures of relatedness capture genealogical relatedness well, outperforming vanRaden’s G in this respect. When used in genomic prediction models, the product measure leads to slightly improved prediction accuracy.
format Online
Article
Text
id pubmed-4642635
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46426352015-11-13 Two molecular measures of relatedness based on haplotype sharing Edwards, David BMC Bioinformatics Methodology Article BACKGROUND: Measuring the extent of shared ancestry between individuals or organisms is important in many fields, including forensic science, conservation genetics and animal breeding. The traditional approach is to calculate the expected degree of relatedness between individuals in a pedigree. This assumes that the founders of the pedigree are non-inbred and unrelated to each other, which is rarely the case. In contrast, molecular data allow measurement of actual relatedness without knowledge of a pedigree. Methods to do this have been proposed, but generally do not take the lengths of the genomic regions shared between individuals into account. RESULTS: Two measures based on the extent of haplotype sharing between genomes are proposed. The intercept measure B estimates the fraction of shared genome between individuals, and the product measure C is closely related to the numerator relationship matrix A. Both are based on a model for the joint distribution of markers at the haplotype level. The two measures are compared to the pedigree-based measure A and to vanRaden’s G, a frequently used molecular measure, using a set of data comprising 5037 dairy cattle. The comparison criteria include the ability to capture genealogical relatedness and the prediction accuracy obtained when used in genomic prediction. Both B and C explain around 95 % of the variation in A, whereas G explains around 6 %. G captures genealogical relatedness poorly, particularly for distantly related individuals (second cousins or farther). Both B and C tend to be larger than A but this can be ascribed to the assumption of non-inbred unrelated founders. Using C in linear mixed models results in slightly higher prediction accuracy than G, and using B results in slightly lower prediction accuracy. CONCLUSIONS: The two proposed measures of relatedness capture genealogical relatedness well, outperforming vanRaden’s G in this respect. When used in genomic prediction models, the product measure leads to slightly improved prediction accuracy. BioMed Central 2015-11-11 /pmc/articles/PMC4642635/ /pubmed/26558877 http://dx.doi.org/10.1186/s12859-015-0802-y Text en © Edwards. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Edwards, David
Two molecular measures of relatedness based on haplotype sharing
title Two molecular measures of relatedness based on haplotype sharing
title_full Two molecular measures of relatedness based on haplotype sharing
title_fullStr Two molecular measures of relatedness based on haplotype sharing
title_full_unstemmed Two molecular measures of relatedness based on haplotype sharing
title_short Two molecular measures of relatedness based on haplotype sharing
title_sort two molecular measures of relatedness based on haplotype sharing
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4642635/
https://www.ncbi.nlm.nih.gov/pubmed/26558877
http://dx.doi.org/10.1186/s12859-015-0802-y
work_keys_str_mv AT edwardsdavid twomolecularmeasuresofrelatednessbasedonhaplotypesharing