Cargando…

Codominant scoring of AFLP in association panels

A study on the codominant scoring of AFLP markers in association panels without prior knowledge on genotype probabilities is described. Bands are scored codominantly by fitting normal mixture models to band intensities, illustrating and optimizing existing methodology, which employs the EM-algorithm...

Descripción completa

Detalles Bibliográficos
Autores principales: Gort, Gerrit, van Eeuwijk, Fred A.
Formato: Texto
Lenguaje:English
Publicado: Springer-Verlag 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2886132/
https://www.ncbi.nlm.nih.gov/pubmed/20237752
http://dx.doi.org/10.1007/s00122-010-1313-x
_version_ 1782182448400957440
author Gort, Gerrit
van Eeuwijk, Fred A.
author_facet Gort, Gerrit
van Eeuwijk, Fred A.
author_sort Gort, Gerrit
collection PubMed
description A study on the codominant scoring of AFLP markers in association panels without prior knowledge on genotype probabilities is described. Bands are scored codominantly by fitting normal mixture models to band intensities, illustrating and optimizing existing methodology, which employs the EM-algorithm. We study features that improve the performance of the algorithm, and the unmixing in general, like parameter initialization, restrictions on parameters, data transformation, and outlier removal. Parameter restrictions include equal component variances, equal or nearly equal distances between component means, and mixing probabilities according to Hardy–Weinberg Equilibrium. Histogram visualization of band intensities with superimposed normal densities, and optional classification scores and other grouping information, assists further in the codominant scoring. We find empirical evidence favoring the square root transformation of the band intensity, as was found in segregating populations. Our approach provides posterior genotype probabilities for marker loci. These probabilities can form the basis for association mapping and are more useful than the standard scoring categories A, H, B, C, D. They can also be used to calculate predictors for additive and dominance effects. Diagnostics for data quality of AFLP markers are described: preference for three-component mixture model, good separation between component means, and lack of singletons for the component with highest mean. Software has been developed in R, containing the models for normal mixtures with facilitating features, and visualizations. The methods are applied to an association panel in tomato, comprising 1,175 polymorphic markers on 94 tomato hybrids, as part of a larger study within the Dutch Centre for BioSystems Genomics.
format Text
id pubmed-2886132
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Springer-Verlag
record_format MEDLINE/PubMed
spelling pubmed-28861322010-07-21 Codominant scoring of AFLP in association panels Gort, Gerrit van Eeuwijk, Fred A. Theor Appl Genet Original Paper A study on the codominant scoring of AFLP markers in association panels without prior knowledge on genotype probabilities is described. Bands are scored codominantly by fitting normal mixture models to band intensities, illustrating and optimizing existing methodology, which employs the EM-algorithm. We study features that improve the performance of the algorithm, and the unmixing in general, like parameter initialization, restrictions on parameters, data transformation, and outlier removal. Parameter restrictions include equal component variances, equal or nearly equal distances between component means, and mixing probabilities according to Hardy–Weinberg Equilibrium. Histogram visualization of band intensities with superimposed normal densities, and optional classification scores and other grouping information, assists further in the codominant scoring. We find empirical evidence favoring the square root transformation of the band intensity, as was found in segregating populations. Our approach provides posterior genotype probabilities for marker loci. These probabilities can form the basis for association mapping and are more useful than the standard scoring categories A, H, B, C, D. They can also be used to calculate predictors for additive and dominance effects. Diagnostics for data quality of AFLP markers are described: preference for three-component mixture model, good separation between component means, and lack of singletons for the component with highest mean. Software has been developed in R, containing the models for normal mixtures with facilitating features, and visualizations. The methods are applied to an association panel in tomato, comprising 1,175 polymorphic markers on 94 tomato hybrids, as part of a larger study within the Dutch Centre for BioSystems Genomics. Springer-Verlag 2010-03-17 2010 /pmc/articles/PMC2886132/ /pubmed/20237752 http://dx.doi.org/10.1007/s00122-010-1313-x Text en © The Author(s) 2010 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle Original Paper
Gort, Gerrit
van Eeuwijk, Fred A.
Codominant scoring of AFLP in association panels
title Codominant scoring of AFLP in association panels
title_full Codominant scoring of AFLP in association panels
title_fullStr Codominant scoring of AFLP in association panels
title_full_unstemmed Codominant scoring of AFLP in association panels
title_short Codominant scoring of AFLP in association panels
title_sort codominant scoring of aflp in association panels
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2886132/
https://www.ncbi.nlm.nih.gov/pubmed/20237752
http://dx.doi.org/10.1007/s00122-010-1313-x
work_keys_str_mv AT gortgerrit codominantscoringofaflpinassociationpanels
AT vaneeuwijkfreda codominantscoringofaflpinassociationpanels