Cargando…

Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken

Two categories of immune responses—innate and adaptive immunity—have both polygenic backgrounds and a significant environmental component. The goal of the reported study was to define candidate genes and mutations for the immune traits of interest in chickens using machine learning–based sensitivity...

Descripción completa

Detalles Bibliográficos
Autores principales: Polewko-Klim, Aneta, Lesiński, Wojciech, Golińska, Agnieszka Kitlas, Mnich, Krzysztof, Siwek, Maria, Rudnicki, Witold R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7704721/
https://www.ncbi.nlm.nih.gov/pubmed/33248550
http://dx.doi.org/10.1016/j.psj.2020.08.059
_version_ 1783616852558086144
author Polewko-Klim, Aneta
Lesiński, Wojciech
Golińska, Agnieszka Kitlas
Mnich, Krzysztof
Siwek, Maria
Rudnicki, Witold R.
author_facet Polewko-Klim, Aneta
Lesiński, Wojciech
Golińska, Agnieszka Kitlas
Mnich, Krzysztof
Siwek, Maria
Rudnicki, Witold R.
author_sort Polewko-Klim, Aneta
collection PubMed
description Two categories of immune responses—innate and adaptive immunity—have both polygenic backgrounds and a significant environmental component. The goal of the reported study was to define candidate genes and mutations for the immune traits of interest in chickens using machine learning–based sensitivity analysis for single-nucleotide polymorphisms (SNPs) located in candidate genes defined in quantitative trait loci regions. Here the adaptive immunity is represented by the specific antibody response toward keyhole limpet hemocyanin (KLH), whereas the innate immunity was represented by natural antibodies toward lipopolysaccharide (LPS) and lipoteichoic acid (LTA). The analysis consisted of 3 basic steps: an identification of candidate SNPs via feature selection, an optimisation of the feature set using recursive feature elimination, and finally a gene-level sensitivity analysis for final selection of models. The predictive model based on 5 genes (MAPK8IP3 CRLF3, UNC13D, ILR9, and PRCKB) explains 14.9% of variance for KLH adaptive response. The models obtained for LTA and LPS use more genes and have lower predictive power, explaining respectively 7.8 and 4.5% of total variance. In comparison, the linear models built on genes identified by a standard statistical analysis explain 1.5, 0.5, and 0.3% of variance for KLH, LTA, and LPS response, respectively. The present study shows that machine learning methods applied to systems with a complex interaction network can discover phenotype-genotype associations with much higher sensitivity than traditional statistical models. It adds contribution to evidence suggesting a role of MAPK8IP3 in the adaptive immune response. It also indicates that CRLF3 is involved in this process as well. Both findings need additional verification.
format Online
Article
Text
id pubmed-7704721
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-77047212020-12-08 Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken Polewko-Klim, Aneta Lesiński, Wojciech Golińska, Agnieszka Kitlas Mnich, Krzysztof Siwek, Maria Rudnicki, Witold R. Poult Sci Genetics and Molecular Biology Two categories of immune responses—innate and adaptive immunity—have both polygenic backgrounds and a significant environmental component. The goal of the reported study was to define candidate genes and mutations for the immune traits of interest in chickens using machine learning–based sensitivity analysis for single-nucleotide polymorphisms (SNPs) located in candidate genes defined in quantitative trait loci regions. Here the adaptive immunity is represented by the specific antibody response toward keyhole limpet hemocyanin (KLH), whereas the innate immunity was represented by natural antibodies toward lipopolysaccharide (LPS) and lipoteichoic acid (LTA). The analysis consisted of 3 basic steps: an identification of candidate SNPs via feature selection, an optimisation of the feature set using recursive feature elimination, and finally a gene-level sensitivity analysis for final selection of models. The predictive model based on 5 genes (MAPK8IP3 CRLF3, UNC13D, ILR9, and PRCKB) explains 14.9% of variance for KLH adaptive response. The models obtained for LTA and LPS use more genes and have lower predictive power, explaining respectively 7.8 and 4.5% of total variance. In comparison, the linear models built on genes identified by a standard statistical analysis explain 1.5, 0.5, and 0.3% of variance for KLH, LTA, and LPS response, respectively. The present study shows that machine learning methods applied to systems with a complex interaction network can discover phenotype-genotype associations with much higher sensitivity than traditional statistical models. It adds contribution to evidence suggesting a role of MAPK8IP3 in the adaptive immune response. It also indicates that CRLF3 is involved in this process as well. Both findings need additional verification. Elsevier 2020-09-12 /pmc/articles/PMC7704721/ /pubmed/33248550 http://dx.doi.org/10.1016/j.psj.2020.08.059 Text en © 2020 Published by Elsevier Inc. on behalf of Poultry Science Association Inc. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Genetics and Molecular Biology
Polewko-Klim, Aneta
Lesiński, Wojciech
Golińska, Agnieszka Kitlas
Mnich, Krzysztof
Siwek, Maria
Rudnicki, Witold R.
Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken
title Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken
title_full Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken
title_fullStr Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken
title_full_unstemmed Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken
title_short Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken
title_sort sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken
topic Genetics and Molecular Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7704721/
https://www.ncbi.nlm.nih.gov/pubmed/33248550
http://dx.doi.org/10.1016/j.psj.2020.08.059
work_keys_str_mv AT polewkoklimaneta sensitivityanalysisbasedontherandomforestmachinelearningalgorithmidentifiescandidategenesforregulationofinnateandadaptiveimmuneresponseofchicken
AT lesinskiwojciech sensitivityanalysisbasedontherandomforestmachinelearningalgorithmidentifiescandidategenesforregulationofinnateandadaptiveimmuneresponseofchicken
AT golinskaagnieszkakitlas sensitivityanalysisbasedontherandomforestmachinelearningalgorithmidentifiescandidategenesforregulationofinnateandadaptiveimmuneresponseofchicken
AT mnichkrzysztof sensitivityanalysisbasedontherandomforestmachinelearningalgorithmidentifiescandidategenesforregulationofinnateandadaptiveimmuneresponseofchicken
AT siwekmaria sensitivityanalysisbasedontherandomforestmachinelearningalgorithmidentifiescandidategenesforregulationofinnateandadaptiveimmuneresponseofchicken
AT rudnickiwitoldr sensitivityanalysisbasedontherandomforestmachinelearningalgorithmidentifiescandidategenesforregulationofinnateandadaptiveimmuneresponseofchicken