Cargando…

GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies

BACKGROUND: Identification of epistatic interactions provides a systematic way for exploring associations among different single nucleotide polymorphism (SNP) and complex diseases. Although considerable progress has been made in epistasis detection, efficiently and accurately identifying epistatic i...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Yu Zhong, Lin, Yanmei, Huang, Yiran, Li, Ying, Luo, Guangsheng, Liao, Jianping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8686218/
https://www.ncbi.nlm.nih.gov/pubmed/34930147
http://dx.doi.org/10.1186/s12864-021-08207-8
_version_ 1784617971484721152
author Peng, Yu Zhong
Lin, Yanmei
Huang, Yiran
Li, Ying
Luo, Guangsheng
Liao, Jianping
author_facet Peng, Yu Zhong
Lin, Yanmei
Huang, Yiran
Li, Ying
Luo, Guangsheng
Liao, Jianping
author_sort Peng, Yu Zhong
collection PubMed
description BACKGROUND: Identification of epistatic interactions provides a systematic way for exploring associations among different single nucleotide polymorphism (SNP) and complex diseases. Although considerable progress has been made in epistasis detection, efficiently and accurately identifying epistatic interactions remains a challenge due to the intensive growth of measuring SNP combinations. RESULTS: In this work, we formulate the detection of epistatic interactions by a combinational optimization problem, and propose a novel evolutionary-based framework, called GEP-EpiSeeker, to detect epistatic interactions using Gene Expression Programming. In GEP-EpiSeeker, we propose several tailor-made chromosome rules to describe SNP combinations, and incorporate Bayesian network-based fitness evaluation into the evolution of tailor-made chromosomes to find suspected SNP combinations, and adopt the Chi-square test to identify optimal solutions from suspected SNP combinations. Moreover, to improve the convergence and accuracy of the algorithm, we design two genetic operators with multiple and adjacent mutations and an adaptive genetic manipulation method with fuzzy control to efficiently manipulate the evolution of tailor-made chromosomes. We compared GEP-EpiSeeker with state-of-the-art methods including BEAM, BOOST, AntEpiSeeker, MACOED, and EACO in terms of power, recall, precision and F1-score on the GWAS datasets of 12 DME disease models and 10 DNME disease models. Our experimental results show that GEP-EpiSeeker outperforms comparative methods. CONCLUSIONS: Here we presented a novel method named GEP-EpiSeeker, based on the Gene Expression Programming algorithm, to identify epistatic interactions in Genome-wide Association Studies. The results indicate that GEP-EpiSeeker could be a promising alternative to the existing methods in epistasis detection and will provide a new way for accurately identifying epistasis.
format Online
Article
Text
id pubmed-8686218
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-86862182021-12-20 GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies Peng, Yu Zhong Lin, Yanmei Huang, Yiran Li, Ying Luo, Guangsheng Liao, Jianping BMC Genomics Research BACKGROUND: Identification of epistatic interactions provides a systematic way for exploring associations among different single nucleotide polymorphism (SNP) and complex diseases. Although considerable progress has been made in epistasis detection, efficiently and accurately identifying epistatic interactions remains a challenge due to the intensive growth of measuring SNP combinations. RESULTS: In this work, we formulate the detection of epistatic interactions by a combinational optimization problem, and propose a novel evolutionary-based framework, called GEP-EpiSeeker, to detect epistatic interactions using Gene Expression Programming. In GEP-EpiSeeker, we propose several tailor-made chromosome rules to describe SNP combinations, and incorporate Bayesian network-based fitness evaluation into the evolution of tailor-made chromosomes to find suspected SNP combinations, and adopt the Chi-square test to identify optimal solutions from suspected SNP combinations. Moreover, to improve the convergence and accuracy of the algorithm, we design two genetic operators with multiple and adjacent mutations and an adaptive genetic manipulation method with fuzzy control to efficiently manipulate the evolution of tailor-made chromosomes. We compared GEP-EpiSeeker with state-of-the-art methods including BEAM, BOOST, AntEpiSeeker, MACOED, and EACO in terms of power, recall, precision and F1-score on the GWAS datasets of 12 DME disease models and 10 DNME disease models. Our experimental results show that GEP-EpiSeeker outperforms comparative methods. CONCLUSIONS: Here we presented a novel method named GEP-EpiSeeker, based on the Gene Expression Programming algorithm, to identify epistatic interactions in Genome-wide Association Studies. The results indicate that GEP-EpiSeeker could be a promising alternative to the existing methods in epistasis detection and will provide a new way for accurately identifying epistasis. BioMed Central 2021-12-20 /pmc/articles/PMC8686218/ /pubmed/34930147 http://dx.doi.org/10.1186/s12864-021-08207-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Peng, Yu Zhong
Lin, Yanmei
Huang, Yiran
Li, Ying
Luo, Guangsheng
Liao, Jianping
GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies
title GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies
title_full GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies
title_fullStr GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies
title_full_unstemmed GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies
title_short GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies
title_sort gep-episeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8686218/
https://www.ncbi.nlm.nih.gov/pubmed/34930147
http://dx.doi.org/10.1186/s12864-021-08207-8
work_keys_str_mv AT pengyuzhong gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies
AT linyanmei gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies
AT huangyiran gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies
AT liying gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies
AT luoguangsheng gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies
AT liaojianping gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies