Cargando…

Comparison of classification methods for detecting associations between SNPs and chick mortality

Multi-category classification methods were used to detect SNP-mortality associations in broilers. The objective was to select a subset of whole genome SNPs associated with chick mortality. This was done by categorizing mortality rates and using a filter-wrapper feature selection procedure in each of...

Descripción completa

Detalles Bibliográficos
Autores principales: Long, Nanye, Gianola, Daniel, Rosa, Guilherme JM, Weigel, Kent A, Avendaño, Santiago
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3225888/
https://www.ncbi.nlm.nih.gov/pubmed/19284707
http://dx.doi.org/10.1186/1297-9686-41-18
_version_ 1782217541989433344
author Long, Nanye
Gianola, Daniel
Rosa, Guilherme JM
Weigel, Kent A
Avendaño, Santiago
author_facet Long, Nanye
Gianola, Daniel
Rosa, Guilherme JM
Weigel, Kent A
Avendaño, Santiago
author_sort Long, Nanye
collection PubMed
description Multi-category classification methods were used to detect SNP-mortality associations in broilers. The objective was to select a subset of whole genome SNPs associated with chick mortality. This was done by categorizing mortality rates and using a filter-wrapper feature selection procedure in each of the classification methods evaluated. Different numbers of categories (2, 3, 4, 5 and 10) and three classification algorithms (naïve Bayes classifiers, Bayesian networks and neural networks) were compared, using early and late chick mortality rates in low and high hygiene environments. Evaluation of SNPs selected by each classification method was done by predicted residual sum of squares and a significance test-related metric. A naïve Bayes classifier, coupled with discretization into two or three categories generated the SNP subset with greatest predictive ability. Further, an alternative categorization scheme, which used only two extreme portions of the empirical distribution of mortality rates, was considered. This scheme selected SNPs with greater predictive ability than those chosen by the methods described previously. Use of extreme samples seems to enhance the ability of feature selection procedures to select influential SNPs in genetic association studies.
format Online
Article
Text
id pubmed-3225888
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32258882011-11-30 Comparison of classification methods for detecting associations between SNPs and chick mortality Long, Nanye Gianola, Daniel Rosa, Guilherme JM Weigel, Kent A Avendaño, Santiago Genet Sel Evol Research Multi-category classification methods were used to detect SNP-mortality associations in broilers. The objective was to select a subset of whole genome SNPs associated with chick mortality. This was done by categorizing mortality rates and using a filter-wrapper feature selection procedure in each of the classification methods evaluated. Different numbers of categories (2, 3, 4, 5 and 10) and three classification algorithms (naïve Bayes classifiers, Bayesian networks and neural networks) were compared, using early and late chick mortality rates in low and high hygiene environments. Evaluation of SNPs selected by each classification method was done by predicted residual sum of squares and a significance test-related metric. A naïve Bayes classifier, coupled with discretization into two or three categories generated the SNP subset with greatest predictive ability. Further, an alternative categorization scheme, which used only two extreme portions of the empirical distribution of mortality rates, was considered. This scheme selected SNPs with greater predictive ability than those chosen by the methods described previously. Use of extreme samples seems to enhance the ability of feature selection procedures to select influential SNPs in genetic association studies. BioMed Central 2009-01-23 /pmc/articles/PMC3225888/ /pubmed/19284707 http://dx.doi.org/10.1186/1297-9686-41-18 Text en Copyright ©2009 Long et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Long, Nanye
Gianola, Daniel
Rosa, Guilherme JM
Weigel, Kent A
Avendaño, Santiago
Comparison of classification methods for detecting associations between SNPs and chick mortality
title Comparison of classification methods for detecting associations between SNPs and chick mortality
title_full Comparison of classification methods for detecting associations between SNPs and chick mortality
title_fullStr Comparison of classification methods for detecting associations between SNPs and chick mortality
title_full_unstemmed Comparison of classification methods for detecting associations between SNPs and chick mortality
title_short Comparison of classification methods for detecting associations between SNPs and chick mortality
title_sort comparison of classification methods for detecting associations between snps and chick mortality
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3225888/
https://www.ncbi.nlm.nih.gov/pubmed/19284707
http://dx.doi.org/10.1186/1297-9686-41-18
work_keys_str_mv AT longnanye comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality
AT gianoladaniel comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality
AT rosaguilhermejm comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality
AT weigelkenta comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality
AT avendanosantiago comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality