Cargando…
Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects
Genome-wide association studies (GWAS) play a vital role in identifying important genes those is associated with the phenotypic variations of living organisms. There are several statistical methods for GWAS including the linear mixed model (LMM) which is popular for addressing the challenges of hidd...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219685/ https://www.ncbi.nlm.nih.gov/pubmed/34158546 http://dx.doi.org/10.1038/s41598-021-90774-7 |
_version_ | 1783710988410814464 |
---|---|
author | Akond, Zobaer Ahsan, Md. Asif Alam, Munirul Mollah, Md. Nurul Haque |
author_facet | Akond, Zobaer Ahsan, Md. Asif Alam, Munirul Mollah, Md. Nurul Haque |
author_sort | Akond, Zobaer |
collection | PubMed |
description | Genome-wide association studies (GWAS) play a vital role in identifying important genes those is associated with the phenotypic variations of living organisms. There are several statistical methods for GWAS including the linear mixed model (LMM) which is popular for addressing the challenges of hidden population stratification and polygenic effects. However, most of these methods including LMM are sensitive to phenotypic outliers that may lead the misleading results. To overcome this problem, in this paper, we proposed a way to robustify the LMM approach for reducing the influence of outlying observations using the β-divergence method. The performance of the proposed method was investigated using both synthetic and real data analysis. Simulation results showed that the proposed method performs better than both linear regression model (LRM) and LMM approaches in terms of powers and false discovery rates in presence of phenotypic outliers. On the other hand, the proposed method performed almost similar to LMM approach but much better than LRM approach in absence of outliers. In the case of real data analysis, our proposed method identified 11 SNPs that are significantly associated with the rice flowering time. Among the identified candidate SNPs, some were involved in seed development and flowering time pathways, and some were connected with flower and other developmental processes. These identified candidate SNPs could assist rice breeding programs effectively. Thus, our findings highlighted the importance of robust GWAS in identifying candidate genes. |
format | Online Article Text |
id | pubmed-8219685 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-82196852021-06-24 Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects Akond, Zobaer Ahsan, Md. Asif Alam, Munirul Mollah, Md. Nurul Haque Sci Rep Article Genome-wide association studies (GWAS) play a vital role in identifying important genes those is associated with the phenotypic variations of living organisms. There are several statistical methods for GWAS including the linear mixed model (LMM) which is popular for addressing the challenges of hidden population stratification and polygenic effects. However, most of these methods including LMM are sensitive to phenotypic outliers that may lead the misleading results. To overcome this problem, in this paper, we proposed a way to robustify the LMM approach for reducing the influence of outlying observations using the β-divergence method. The performance of the proposed method was investigated using both synthetic and real data analysis. Simulation results showed that the proposed method performs better than both linear regression model (LRM) and LMM approaches in terms of powers and false discovery rates in presence of phenotypic outliers. On the other hand, the proposed method performed almost similar to LMM approach but much better than LRM approach in absence of outliers. In the case of real data analysis, our proposed method identified 11 SNPs that are significantly associated with the rice flowering time. Among the identified candidate SNPs, some were involved in seed development and flowering time pathways, and some were connected with flower and other developmental processes. These identified candidate SNPs could assist rice breeding programs effectively. Thus, our findings highlighted the importance of robust GWAS in identifying candidate genes. Nature Publishing Group UK 2021-06-22 /pmc/articles/PMC8219685/ /pubmed/34158546 http://dx.doi.org/10.1038/s41598-021-90774-7 Text en © The Author(s) 2021, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Akond, Zobaer Ahsan, Md. Asif Alam, Munirul Mollah, Md. Nurul Haque Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects |
title | Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects |
title_full | Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects |
title_fullStr | Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects |
title_full_unstemmed | Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects |
title_short | Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects |
title_sort | robustification of gwas to explore effective snps addressing the challenges of hidden population stratification and polygenic effects |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219685/ https://www.ncbi.nlm.nih.gov/pubmed/34158546 http://dx.doi.org/10.1038/s41598-021-90774-7 |
work_keys_str_mv | AT akondzobaer robustificationofgwastoexploreeffectivesnpsaddressingthechallengesofhiddenpopulationstratificationandpolygeniceffects AT ahsanmdasif robustificationofgwastoexploreeffectivesnpsaddressingthechallengesofhiddenpopulationstratificationandpolygeniceffects AT alammunirul robustificationofgwastoexploreeffectivesnpsaddressingthechallengesofhiddenpopulationstratificationandpolygeniceffects AT mollahmdnurulhaque robustificationofgwastoexploreeffectivesnpsaddressingthechallengesofhiddenpopulationstratificationandpolygeniceffects |