Cargando…

Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle

BACKGROUND: Many phenotypes in animal breeding are derived from incomplete measures, especially if they are challenging or expensive to measure precisely. Examples include time-dependent traits such as reproductive status, or lifespan. Incomplete measures for these traits result in phenotypes that a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Stephen, Melissa A., Burke, Chris R., Pryce, Jennie E., Steele, Nicole M., Amer, Peter R., Meier, Susanne, Phyn, Claire V. C., Garrick, Dorian J.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2023
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10492402/ https://www.ncbi.nlm.nih.gov/pubmed/37684681 http://dx.doi.org/10.1186/s40104-023-00921-5

_version_	1785104249024151552
author	Stephen, Melissa A. Burke, Chris R. Pryce, Jennie E. Steele, Nicole M. Amer, Peter R. Meier, Susanne Phyn, Claire V. C. Garrick, Dorian J.
author_facet	Stephen, Melissa A. Burke, Chris R. Pryce, Jennie E. Steele, Nicole M. Amer, Peter R. Meier, Susanne Phyn, Claire V. C. Garrick, Dorian J.
author_sort	Stephen, Melissa A.
collection	PubMed
description	BACKGROUND: Many phenotypes in animal breeding are derived from incomplete measures, especially if they are challenging or expensive to measure precisely. Examples include time-dependent traits such as reproductive status, or lifespan. Incomplete measures for these traits result in phenotypes that are subject to left-, interval- and right-censoring, where phenotypes are only known to fall below an upper bound, between a lower and upper bound, or above a lower bound respectively. Here we compare three methods for deriving phenotypes from incomplete data using age at first elevation (> 1 ng/mL) in blood plasma progesterone (AGEP4), which generally coincides with onset of puberty, as an example trait. METHODS: We produced AGEP4 phenotypes from three blood samples collected at about 30-day intervals from approximately 5,000 Holstein–Friesian or Holstein–Friesian × Jersey cross-bred dairy heifers managed in 54 seasonal-calving, pasture-based herds in New Zealand. We used these actual data to simulate 7 different visit scenarios, increasing the extent of censoring by disregarding data from one or two of the three visits. Three methods for deriving phenotypes from these data were explored: 1) ordinal categorical variables which were analysed using categorical threshold analysis; 2) continuous variables, with a penalty of 31 d assigned to right-censored phenotypes; and 3) continuous variables, sampled from within a lower and upper bound using a data augmentation approach. RESULTS: Credibility intervals for heritability estimations overlapped across all methods and visit scenarios, but estimated heritabilities tended to be higher when left censoring was reduced. For sires with at least 5 daughters, the correlations between estimated breeding values (EBVs) from our three-visit scenario and each reduced data scenario varied by method, ranging from 0.65 to 0.95. The estimated breed effects also varied by method, but breed differences were smaller as phenotype censoring increased. CONCLUSION: Our results indicate that using some methods, phenotypes derived from one observation per offspring for a time-dependent trait such as AGEP4 may provide comparable sire rankings to three observations per offspring. This has implications for the design of large-scale phenotyping initiatives where animal breeders aim to estimate variance parameters and estimated breeding values (EBVs) for phenotypes that are challenging to measure or prohibitively expensive.
format	Online Article Text
id	pubmed-10492402
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-104924022023-09-10 Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle Stephen, Melissa A. Burke, Chris R. Pryce, Jennie E. Steele, Nicole M. Amer, Peter R. Meier, Susanne Phyn, Claire V. C. Garrick, Dorian J. J Anim Sci Biotechnol Research BACKGROUND: Many phenotypes in animal breeding are derived from incomplete measures, especially if they are challenging or expensive to measure precisely. Examples include time-dependent traits such as reproductive status, or lifespan. Incomplete measures for these traits result in phenotypes that are subject to left-, interval- and right-censoring, where phenotypes are only known to fall below an upper bound, between a lower and upper bound, or above a lower bound respectively. Here we compare three methods for deriving phenotypes from incomplete data using age at first elevation (> 1 ng/mL) in blood plasma progesterone (AGEP4), which generally coincides with onset of puberty, as an example trait. METHODS: We produced AGEP4 phenotypes from three blood samples collected at about 30-day intervals from approximately 5,000 Holstein–Friesian or Holstein–Friesian × Jersey cross-bred dairy heifers managed in 54 seasonal-calving, pasture-based herds in New Zealand. We used these actual data to simulate 7 different visit scenarios, increasing the extent of censoring by disregarding data from one or two of the three visits. Three methods for deriving phenotypes from these data were explored: 1) ordinal categorical variables which were analysed using categorical threshold analysis; 2) continuous variables, with a penalty of 31 d assigned to right-censored phenotypes; and 3) continuous variables, sampled from within a lower and upper bound using a data augmentation approach. RESULTS: Credibility intervals for heritability estimations overlapped across all methods and visit scenarios, but estimated heritabilities tended to be higher when left censoring was reduced. For sires with at least 5 daughters, the correlations between estimated breeding values (EBVs) from our three-visit scenario and each reduced data scenario varied by method, ranging from 0.65 to 0.95. The estimated breed effects also varied by method, but breed differences were smaller as phenotype censoring increased. CONCLUSION: Our results indicate that using some methods, phenotypes derived from one observation per offspring for a time-dependent trait such as AGEP4 may provide comparable sire rankings to three observations per offspring. This has implications for the design of large-scale phenotyping initiatives where animal breeders aim to estimate variance parameters and estimated breeding values (EBVs) for phenotypes that are challenging to measure or prohibitively expensive. BioMed Central 2023-09-09 /pmc/articles/PMC10492402/ /pubmed/37684681 http://dx.doi.org/10.1186/s40104-023-00921-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Stephen, Melissa A. Burke, Chris R. Pryce, Jennie E. Steele, Nicole M. Amer, Peter R. Meier, Susanne Phyn, Claire V. C. Garrick, Dorian J. Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle
title	Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle
title_full	Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle
title_fullStr	Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle
title_full_unstemmed	Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle
title_short	Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle
title_sort	comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10492402/ https://www.ncbi.nlm.nih.gov/pubmed/37684681 http://dx.doi.org/10.1186/s40104-023-00921-5
work_keys_str_mv	AT stephenmelissaa comparisonofmethodsforderivingphenotypesfromincompleteobservationdatawithanapplicationtoageatpubertyindairycattle AT burkechrisr comparisonofmethodsforderivingphenotypesfromincompleteobservationdatawithanapplicationtoageatpubertyindairycattle AT prycejenniee comparisonofmethodsforderivingphenotypesfromincompleteobservationdatawithanapplicationtoageatpubertyindairycattle AT steelenicolem comparisonofmethodsforderivingphenotypesfromincompleteobservationdatawithanapplicationtoageatpubertyindairycattle AT amerpeterr comparisonofmethodsforderivingphenotypesfromincompleteobservationdatawithanapplicationtoageatpubertyindairycattle AT meiersusanne comparisonofmethodsforderivingphenotypesfromincompleteobservationdatawithanapplicationtoageatpubertyindairycattle AT phynclairevc comparisonofmethodsforderivingphenotypesfromincompleteobservationdatawithanapplicationtoageatpubertyindairycattle AT garrickdorianj comparisonofmethodsforderivingphenotypesfromincompleteobservationdatawithanapplicationtoageatpubertyindairycattle

Comparison of methods for deriving phenotypes from incomplete observation data with an application to age at puberty in dairy cattle

Ejemplares similares