Cargando…
Comparison of classification methods for detecting associations between SNPs and chick mortality
Multi-category classification methods were used to detect SNP-mortality associations in broilers. The objective was to select a subset of whole genome SNPs associated with chick mortality. This was done by categorizing mortality rates and using a filter-wrapper feature selection procedure in each of...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3225888/ https://www.ncbi.nlm.nih.gov/pubmed/19284707 http://dx.doi.org/10.1186/1297-9686-41-18 |
_version_ | 1782217541989433344 |
---|---|
author | Long, Nanye Gianola, Daniel Rosa, Guilherme JM Weigel, Kent A Avendaño, Santiago |
author_facet | Long, Nanye Gianola, Daniel Rosa, Guilherme JM Weigel, Kent A Avendaño, Santiago |
author_sort | Long, Nanye |
collection | PubMed |
description | Multi-category classification methods were used to detect SNP-mortality associations in broilers. The objective was to select a subset of whole genome SNPs associated with chick mortality. This was done by categorizing mortality rates and using a filter-wrapper feature selection procedure in each of the classification methods evaluated. Different numbers of categories (2, 3, 4, 5 and 10) and three classification algorithms (naïve Bayes classifiers, Bayesian networks and neural networks) were compared, using early and late chick mortality rates in low and high hygiene environments. Evaluation of SNPs selected by each classification method was done by predicted residual sum of squares and a significance test-related metric. A naïve Bayes classifier, coupled with discretization into two or three categories generated the SNP subset with greatest predictive ability. Further, an alternative categorization scheme, which used only two extreme portions of the empirical distribution of mortality rates, was considered. This scheme selected SNPs with greater predictive ability than those chosen by the methods described previously. Use of extreme samples seems to enhance the ability of feature selection procedures to select influential SNPs in genetic association studies. |
format | Online Article Text |
id | pubmed-3225888 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32258882011-11-30 Comparison of classification methods for detecting associations between SNPs and chick mortality Long, Nanye Gianola, Daniel Rosa, Guilherme JM Weigel, Kent A Avendaño, Santiago Genet Sel Evol Research Multi-category classification methods were used to detect SNP-mortality associations in broilers. The objective was to select a subset of whole genome SNPs associated with chick mortality. This was done by categorizing mortality rates and using a filter-wrapper feature selection procedure in each of the classification methods evaluated. Different numbers of categories (2, 3, 4, 5 and 10) and three classification algorithms (naïve Bayes classifiers, Bayesian networks and neural networks) were compared, using early and late chick mortality rates in low and high hygiene environments. Evaluation of SNPs selected by each classification method was done by predicted residual sum of squares and a significance test-related metric. A naïve Bayes classifier, coupled with discretization into two or three categories generated the SNP subset with greatest predictive ability. Further, an alternative categorization scheme, which used only two extreme portions of the empirical distribution of mortality rates, was considered. This scheme selected SNPs with greater predictive ability than those chosen by the methods described previously. Use of extreme samples seems to enhance the ability of feature selection procedures to select influential SNPs in genetic association studies. BioMed Central 2009-01-23 /pmc/articles/PMC3225888/ /pubmed/19284707 http://dx.doi.org/10.1186/1297-9686-41-18 Text en Copyright ©2009 Long et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Long, Nanye Gianola, Daniel Rosa, Guilherme JM Weigel, Kent A Avendaño, Santiago Comparison of classification methods for detecting associations between SNPs and chick mortality |
title | Comparison of classification methods for detecting associations between SNPs and chick mortality |
title_full | Comparison of classification methods for detecting associations between SNPs and chick mortality |
title_fullStr | Comparison of classification methods for detecting associations between SNPs and chick mortality |
title_full_unstemmed | Comparison of classification methods for detecting associations between SNPs and chick mortality |
title_short | Comparison of classification methods for detecting associations between SNPs and chick mortality |
title_sort | comparison of classification methods for detecting associations between snps and chick mortality |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3225888/ https://www.ncbi.nlm.nih.gov/pubmed/19284707 http://dx.doi.org/10.1186/1297-9686-41-18 |
work_keys_str_mv | AT longnanye comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality AT gianoladaniel comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality AT rosaguilhermejm comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality AT weigelkenta comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality AT avendanosantiago comparisonofclassificationmethodsfordetectingassociationsbetweensnpsandchickmortality |