Cargando…
GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies
BACKGROUND: Identification of epistatic interactions provides a systematic way for exploring associations among different single nucleotide polymorphism (SNP) and complex diseases. Although considerable progress has been made in epistasis detection, efficiently and accurately identifying epistatic i...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8686218/ https://www.ncbi.nlm.nih.gov/pubmed/34930147 http://dx.doi.org/10.1186/s12864-021-08207-8 |
_version_ | 1784617971484721152 |
---|---|
author | Peng, Yu Zhong Lin, Yanmei Huang, Yiran Li, Ying Luo, Guangsheng Liao, Jianping |
author_facet | Peng, Yu Zhong Lin, Yanmei Huang, Yiran Li, Ying Luo, Guangsheng Liao, Jianping |
author_sort | Peng, Yu Zhong |
collection | PubMed |
description | BACKGROUND: Identification of epistatic interactions provides a systematic way for exploring associations among different single nucleotide polymorphism (SNP) and complex diseases. Although considerable progress has been made in epistasis detection, efficiently and accurately identifying epistatic interactions remains a challenge due to the intensive growth of measuring SNP combinations. RESULTS: In this work, we formulate the detection of epistatic interactions by a combinational optimization problem, and propose a novel evolutionary-based framework, called GEP-EpiSeeker, to detect epistatic interactions using Gene Expression Programming. In GEP-EpiSeeker, we propose several tailor-made chromosome rules to describe SNP combinations, and incorporate Bayesian network-based fitness evaluation into the evolution of tailor-made chromosomes to find suspected SNP combinations, and adopt the Chi-square test to identify optimal solutions from suspected SNP combinations. Moreover, to improve the convergence and accuracy of the algorithm, we design two genetic operators with multiple and adjacent mutations and an adaptive genetic manipulation method with fuzzy control to efficiently manipulate the evolution of tailor-made chromosomes. We compared GEP-EpiSeeker with state-of-the-art methods including BEAM, BOOST, AntEpiSeeker, MACOED, and EACO in terms of power, recall, precision and F1-score on the GWAS datasets of 12 DME disease models and 10 DNME disease models. Our experimental results show that GEP-EpiSeeker outperforms comparative methods. CONCLUSIONS: Here we presented a novel method named GEP-EpiSeeker, based on the Gene Expression Programming algorithm, to identify epistatic interactions in Genome-wide Association Studies. The results indicate that GEP-EpiSeeker could be a promising alternative to the existing methods in epistasis detection and will provide a new way for accurately identifying epistasis. |
format | Online Article Text |
id | pubmed-8686218 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-86862182021-12-20 GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies Peng, Yu Zhong Lin, Yanmei Huang, Yiran Li, Ying Luo, Guangsheng Liao, Jianping BMC Genomics Research BACKGROUND: Identification of epistatic interactions provides a systematic way for exploring associations among different single nucleotide polymorphism (SNP) and complex diseases. Although considerable progress has been made in epistasis detection, efficiently and accurately identifying epistatic interactions remains a challenge due to the intensive growth of measuring SNP combinations. RESULTS: In this work, we formulate the detection of epistatic interactions by a combinational optimization problem, and propose a novel evolutionary-based framework, called GEP-EpiSeeker, to detect epistatic interactions using Gene Expression Programming. In GEP-EpiSeeker, we propose several tailor-made chromosome rules to describe SNP combinations, and incorporate Bayesian network-based fitness evaluation into the evolution of tailor-made chromosomes to find suspected SNP combinations, and adopt the Chi-square test to identify optimal solutions from suspected SNP combinations. Moreover, to improve the convergence and accuracy of the algorithm, we design two genetic operators with multiple and adjacent mutations and an adaptive genetic manipulation method with fuzzy control to efficiently manipulate the evolution of tailor-made chromosomes. We compared GEP-EpiSeeker with state-of-the-art methods including BEAM, BOOST, AntEpiSeeker, MACOED, and EACO in terms of power, recall, precision and F1-score on the GWAS datasets of 12 DME disease models and 10 DNME disease models. Our experimental results show that GEP-EpiSeeker outperforms comparative methods. CONCLUSIONS: Here we presented a novel method named GEP-EpiSeeker, based on the Gene Expression Programming algorithm, to identify epistatic interactions in Genome-wide Association Studies. The results indicate that GEP-EpiSeeker could be a promising alternative to the existing methods in epistasis detection and will provide a new way for accurately identifying epistasis. BioMed Central 2021-12-20 /pmc/articles/PMC8686218/ /pubmed/34930147 http://dx.doi.org/10.1186/s12864-021-08207-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Peng, Yu Zhong Lin, Yanmei Huang, Yiran Li, Ying Luo, Guangsheng Liao, Jianping GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies |
title | GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies |
title_full | GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies |
title_fullStr | GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies |
title_full_unstemmed | GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies |
title_short | GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies |
title_sort | gep-episeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8686218/ https://www.ncbi.nlm.nih.gov/pubmed/34930147 http://dx.doi.org/10.1186/s12864-021-08207-8 |
work_keys_str_mv | AT pengyuzhong gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies AT linyanmei gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies AT huangyiran gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies AT liying gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies AT luoguangsheng gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies AT liaojianping gepepiseekerageneexpressionprogrammingbasedmethodforepistaticinteractiondetectioningenomewideassociationstudies |