Cargando…
Two molecular measures of relatedness based on haplotype sharing
BACKGROUND: Measuring the extent of shared ancestry between individuals or organisms is important in many fields, including forensic science, conservation genetics and animal breeding. The traditional approach is to calculate the expected degree of relatedness between individuals in a pedigree. This...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4642635/ https://www.ncbi.nlm.nih.gov/pubmed/26558877 http://dx.doi.org/10.1186/s12859-015-0802-y |
_version_ | 1782400394942480384 |
---|---|
author | Edwards, David |
author_facet | Edwards, David |
author_sort | Edwards, David |
collection | PubMed |
description | BACKGROUND: Measuring the extent of shared ancestry between individuals or organisms is important in many fields, including forensic science, conservation genetics and animal breeding. The traditional approach is to calculate the expected degree of relatedness between individuals in a pedigree. This assumes that the founders of the pedigree are non-inbred and unrelated to each other, which is rarely the case. In contrast, molecular data allow measurement of actual relatedness without knowledge of a pedigree. Methods to do this have been proposed, but generally do not take the lengths of the genomic regions shared between individuals into account. RESULTS: Two measures based on the extent of haplotype sharing between genomes are proposed. The intercept measure B estimates the fraction of shared genome between individuals, and the product measure C is closely related to the numerator relationship matrix A. Both are based on a model for the joint distribution of markers at the haplotype level. The two measures are compared to the pedigree-based measure A and to vanRaden’s G, a frequently used molecular measure, using a set of data comprising 5037 dairy cattle. The comparison criteria include the ability to capture genealogical relatedness and the prediction accuracy obtained when used in genomic prediction. Both B and C explain around 95 % of the variation in A, whereas G explains around 6 %. G captures genealogical relatedness poorly, particularly for distantly related individuals (second cousins or farther). Both B and C tend to be larger than A but this can be ascribed to the assumption of non-inbred unrelated founders. Using C in linear mixed models results in slightly higher prediction accuracy than G, and using B results in slightly lower prediction accuracy. CONCLUSIONS: The two proposed measures of relatedness capture genealogical relatedness well, outperforming vanRaden’s G in this respect. When used in genomic prediction models, the product measure leads to slightly improved prediction accuracy. |
format | Online Article Text |
id | pubmed-4642635 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46426352015-11-13 Two molecular measures of relatedness based on haplotype sharing Edwards, David BMC Bioinformatics Methodology Article BACKGROUND: Measuring the extent of shared ancestry between individuals or organisms is important in many fields, including forensic science, conservation genetics and animal breeding. The traditional approach is to calculate the expected degree of relatedness between individuals in a pedigree. This assumes that the founders of the pedigree are non-inbred and unrelated to each other, which is rarely the case. In contrast, molecular data allow measurement of actual relatedness without knowledge of a pedigree. Methods to do this have been proposed, but generally do not take the lengths of the genomic regions shared between individuals into account. RESULTS: Two measures based on the extent of haplotype sharing between genomes are proposed. The intercept measure B estimates the fraction of shared genome between individuals, and the product measure C is closely related to the numerator relationship matrix A. Both are based on a model for the joint distribution of markers at the haplotype level. The two measures are compared to the pedigree-based measure A and to vanRaden’s G, a frequently used molecular measure, using a set of data comprising 5037 dairy cattle. The comparison criteria include the ability to capture genealogical relatedness and the prediction accuracy obtained when used in genomic prediction. Both B and C explain around 95 % of the variation in A, whereas G explains around 6 %. G captures genealogical relatedness poorly, particularly for distantly related individuals (second cousins or farther). Both B and C tend to be larger than A but this can be ascribed to the assumption of non-inbred unrelated founders. Using C in linear mixed models results in slightly higher prediction accuracy than G, and using B results in slightly lower prediction accuracy. CONCLUSIONS: The two proposed measures of relatedness capture genealogical relatedness well, outperforming vanRaden’s G in this respect. When used in genomic prediction models, the product measure leads to slightly improved prediction accuracy. BioMed Central 2015-11-11 /pmc/articles/PMC4642635/ /pubmed/26558877 http://dx.doi.org/10.1186/s12859-015-0802-y Text en © Edwards. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Edwards, David Two molecular measures of relatedness based on haplotype sharing |
title | Two molecular measures of relatedness based on haplotype sharing |
title_full | Two molecular measures of relatedness based on haplotype sharing |
title_fullStr | Two molecular measures of relatedness based on haplotype sharing |
title_full_unstemmed | Two molecular measures of relatedness based on haplotype sharing |
title_short | Two molecular measures of relatedness based on haplotype sharing |
title_sort | two molecular measures of relatedness based on haplotype sharing |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4642635/ https://www.ncbi.nlm.nih.gov/pubmed/26558877 http://dx.doi.org/10.1186/s12859-015-0802-y |
work_keys_str_mv | AT edwardsdavid twomolecularmeasuresofrelatednessbasedonhaplotypesharing |