Cargando…

Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology

BACKGROUND: Asthma and allergies prevalence increased in recent decades, being a serious global health problem. They are complex diseases with strong contextual influence, so that the use of advanced machine learning tools such as genetic programming could be important for the understanding the caus...

Descripción completa

Detalles Bibliográficos
Autores principales: Veiga, Rafael V., Barbosa, Helio J. C., Bernardino, Heder S., Freitas, João M., Feitosa, Caroline A., Matos, Sheila M. A., Alcântara-Neves, Neuza M., Barreto, Maurício L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6047363/
https://www.ncbi.nlm.nih.gov/pubmed/29940834
http://dx.doi.org/10.1186/s12859-018-2233-z
_version_ 1783339937762902016
author Veiga, Rafael V.
Barbosa, Helio J. C.
Bernardino, Heder S.
Freitas, João M.
Feitosa, Caroline A.
Matos, Sheila M. A.
Alcântara-Neves, Neuza M.
Barreto, Maurício L.
author_facet Veiga, Rafael V.
Barbosa, Helio J. C.
Bernardino, Heder S.
Freitas, João M.
Feitosa, Caroline A.
Matos, Sheila M. A.
Alcântara-Neves, Neuza M.
Barreto, Maurício L.
author_sort Veiga, Rafael V.
collection PubMed
description BACKGROUND: Asthma and allergies prevalence increased in recent decades, being a serious global health problem. They are complex diseases with strong contextual influence, so that the use of advanced machine learning tools such as genetic programming could be important for the understanding the causal mechanisms explaining those conditions. Here, we applied a multiobjective grammar-based genetic programming (MGGP) to a dataset composed by 1047 subjects. The dataset contains information on the environmental, psychosocial, socioeconomics, nutritional and infectious factors collected from participating children. The objective of this work is to generate models that explain the occurrence of asthma, and two markers of allergy: presence of IgE antibody against common allergens, and skin prick test positivity for common allergens (SPT). RESULTS: The average of the accuracies of the models for asthma higher in MGGP than C4.5. IgE were higher in MGGP than in both, logistic regression and C4.5. MGGP had levels of accuracy similar to RF, but unlike RF, MGGP was able to generate models that were easy to interpret. CONCLUSIONS: MGGP has shown that infections, psychosocial, nutritional, hygiene, and socioeconomic factors may be related in such an intricate way, that could be hardly detected using traditional regression based epidemiological techniques. The algorithm MGGP was implemented in c ++ and is available on repository: http://bitbucket.org/ciml-ufjf/ciml-lib. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2233-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6047363
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60473632018-07-19 Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology Veiga, Rafael V. Barbosa, Helio J. C. Bernardino, Heder S. Freitas, João M. Feitosa, Caroline A. Matos, Sheila M. A. Alcântara-Neves, Neuza M. Barreto, Maurício L. BMC Bioinformatics Methodology Article BACKGROUND: Asthma and allergies prevalence increased in recent decades, being a serious global health problem. They are complex diseases with strong contextual influence, so that the use of advanced machine learning tools such as genetic programming could be important for the understanding the causal mechanisms explaining those conditions. Here, we applied a multiobjective grammar-based genetic programming (MGGP) to a dataset composed by 1047 subjects. The dataset contains information on the environmental, psychosocial, socioeconomics, nutritional and infectious factors collected from participating children. The objective of this work is to generate models that explain the occurrence of asthma, and two markers of allergy: presence of IgE antibody against common allergens, and skin prick test positivity for common allergens (SPT). RESULTS: The average of the accuracies of the models for asthma higher in MGGP than C4.5. IgE were higher in MGGP than in both, logistic regression and C4.5. MGGP had levels of accuracy similar to RF, but unlike RF, MGGP was able to generate models that were easy to interpret. CONCLUSIONS: MGGP has shown that infections, psychosocial, nutritional, hygiene, and socioeconomic factors may be related in such an intricate way, that could be hardly detected using traditional regression based epidemiological techniques. The algorithm MGGP was implemented in c ++ and is available on repository: http://bitbucket.org/ciml-ufjf/ciml-lib. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2233-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-26 /pmc/articles/PMC6047363/ /pubmed/29940834 http://dx.doi.org/10.1186/s12859-018-2233-z Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Veiga, Rafael V.
Barbosa, Helio J. C.
Bernardino, Heder S.
Freitas, João M.
Feitosa, Caroline A.
Matos, Sheila M. A.
Alcântara-Neves, Neuza M.
Barreto, Maurício L.
Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology
title Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology
title_full Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology
title_fullStr Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology
title_full_unstemmed Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology
title_short Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology
title_sort multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6047363/
https://www.ncbi.nlm.nih.gov/pubmed/29940834
http://dx.doi.org/10.1186/s12859-018-2233-z
work_keys_str_mv AT veigarafaelv multiobjectivegrammarbasedgeneticprogrammingappliedtothestudyofasthmaandallergyepidemiology
AT barbosaheliojc multiobjectivegrammarbasedgeneticprogrammingappliedtothestudyofasthmaandallergyepidemiology
AT bernardinoheders multiobjectivegrammarbasedgeneticprogrammingappliedtothestudyofasthmaandallergyepidemiology
AT freitasjoaom multiobjectivegrammarbasedgeneticprogrammingappliedtothestudyofasthmaandallergyepidemiology
AT feitosacarolinea multiobjectivegrammarbasedgeneticprogrammingappliedtothestudyofasthmaandallergyepidemiology
AT matossheilama multiobjectivegrammarbasedgeneticprogrammingappliedtothestudyofasthmaandallergyepidemiology
AT alcantaranevesneuzam multiobjectivegrammarbasedgeneticprogrammingappliedtothestudyofasthmaandallergyepidemiology
AT barretomauriciol multiobjectivegrammarbasedgeneticprogrammingappliedtothestudyofasthmaandallergyepidemiology