Cargando…

Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects

Genome-wide association studies (GWAS) play a vital role in identifying important genes those is associated with the phenotypic variations of living organisms. There are several statistical methods for GWAS including the linear mixed model (LMM) which is popular for addressing the challenges of hidd...

Descripción completa

Detalles Bibliográficos
Autores principales: Akond, Zobaer, Ahsan, Md. Asif, Alam, Munirul, Mollah, Md. Nurul Haque
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219685/
https://www.ncbi.nlm.nih.gov/pubmed/34158546
http://dx.doi.org/10.1038/s41598-021-90774-7
_version_ 1783710988410814464
author Akond, Zobaer
Ahsan, Md. Asif
Alam, Munirul
Mollah, Md. Nurul Haque
author_facet Akond, Zobaer
Ahsan, Md. Asif
Alam, Munirul
Mollah, Md. Nurul Haque
author_sort Akond, Zobaer
collection PubMed
description Genome-wide association studies (GWAS) play a vital role in identifying important genes those is associated with the phenotypic variations of living organisms. There are several statistical methods for GWAS including the linear mixed model (LMM) which is popular for addressing the challenges of hidden population stratification and polygenic effects. However, most of these methods including LMM are sensitive to phenotypic outliers that may lead the misleading results. To overcome this problem, in this paper, we proposed a way to robustify the LMM approach for reducing the influence of outlying observations using the β-divergence method. The performance of the proposed method was investigated using both synthetic and real data analysis. Simulation results showed that the proposed method performs better than both linear regression model (LRM) and LMM approaches in terms of powers and false discovery rates in presence of phenotypic outliers. On the other hand, the proposed method performed almost similar to LMM approach but much better than LRM approach in absence of outliers. In the case of real data analysis, our proposed method identified 11 SNPs that are significantly associated with the rice flowering time. Among the identified candidate SNPs, some were involved in seed development and flowering time pathways, and some were connected with flower and other developmental processes. These identified candidate SNPs could assist rice breeding programs effectively. Thus, our findings highlighted the importance of robust GWAS in identifying candidate genes.
format Online
Article
Text
id pubmed-8219685
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-82196852021-06-24 Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects Akond, Zobaer Ahsan, Md. Asif Alam, Munirul Mollah, Md. Nurul Haque Sci Rep Article Genome-wide association studies (GWAS) play a vital role in identifying important genes those is associated with the phenotypic variations of living organisms. There are several statistical methods for GWAS including the linear mixed model (LMM) which is popular for addressing the challenges of hidden population stratification and polygenic effects. However, most of these methods including LMM are sensitive to phenotypic outliers that may lead the misleading results. To overcome this problem, in this paper, we proposed a way to robustify the LMM approach for reducing the influence of outlying observations using the β-divergence method. The performance of the proposed method was investigated using both synthetic and real data analysis. Simulation results showed that the proposed method performs better than both linear regression model (LRM) and LMM approaches in terms of powers and false discovery rates in presence of phenotypic outliers. On the other hand, the proposed method performed almost similar to LMM approach but much better than LRM approach in absence of outliers. In the case of real data analysis, our proposed method identified 11 SNPs that are significantly associated with the rice flowering time. Among the identified candidate SNPs, some were involved in seed development and flowering time pathways, and some were connected with flower and other developmental processes. These identified candidate SNPs could assist rice breeding programs effectively. Thus, our findings highlighted the importance of robust GWAS in identifying candidate genes. Nature Publishing Group UK 2021-06-22 /pmc/articles/PMC8219685/ /pubmed/34158546 http://dx.doi.org/10.1038/s41598-021-90774-7 Text en © The Author(s) 2021, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Akond, Zobaer
Ahsan, Md. Asif
Alam, Munirul
Mollah, Md. Nurul Haque
Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects
title Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects
title_full Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects
title_fullStr Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects
title_full_unstemmed Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects
title_short Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects
title_sort robustification of gwas to explore effective snps addressing the challenges of hidden population stratification and polygenic effects
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219685/
https://www.ncbi.nlm.nih.gov/pubmed/34158546
http://dx.doi.org/10.1038/s41598-021-90774-7
work_keys_str_mv AT akondzobaer robustificationofgwastoexploreeffectivesnpsaddressingthechallengesofhiddenpopulationstratificationandpolygeniceffects
AT ahsanmdasif robustificationofgwastoexploreeffectivesnpsaddressingthechallengesofhiddenpopulationstratificationandpolygeniceffects
AT alammunirul robustificationofgwastoexploreeffectivesnpsaddressingthechallengesofhiddenpopulationstratificationandpolygeniceffects
AT mollahmdnurulhaque robustificationofgwastoexploreeffectivesnpsaddressingthechallengesofhiddenpopulationstratificationandpolygeniceffects