Cargando…
MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms
The genome‐wide association studies (GWASs) are essential to determine the genetic bases of either ecological or economic phenotypic variation across individuals within populations of the model and nonmodel organisms. For this research question, the GWAS replication testing different parameters and...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8216910/ https://www.ncbi.nlm.nih.gov/pubmed/34188823 http://dx.doi.org/10.1002/ece3.7572 |
_version_ | 1783710516164689920 |
---|---|
author | Garreta, Luis Cerón‐Souza, Ivania Palacio, Manfred Ricardo Reyes‐Herrera, Paula H. |
author_facet | Garreta, Luis Cerón‐Souza, Ivania Palacio, Manfred Ricardo Reyes‐Herrera, Paula H. |
author_sort | Garreta, Luis |
collection | PubMed |
description | The genome‐wide association studies (GWASs) are essential to determine the genetic bases of either ecological or economic phenotypic variation across individuals within populations of the model and nonmodel organisms. For this research question, the GWAS replication testing different parameters and models to validate the results' reproducibility is common. However, straightforward methodologies that manage both replication and tetraploid data are still missing. To solve this problem, we designed the MultiGWAS, a tool that does GWAS for diploid and tetraploid organisms by executing in parallel four software packages, two designed for polyploid data (GWASpoly and SHEsis) and two designed for diploid data (GAPIT and TASSEL). MultiGWAS has several advantages. It runs either in the command line or in a graphical interface; it manages different genotype formats, including VCF. Moreover, it allows control for population structure, relatedness, and several quality control checks on genotype data. Besides, MultiGWAS can test for additive and dominant gene action models, and, through a proprietary scoring function, select the best model to report its associations. Finally, it generates several reports that facilitate identifying false associations from both the significant and the best‐ranked association Single Nucleotide Polymorphisms (SNPs) among the four software packages. We tested MultiGWAS with public tetraploid potato data for tuber shape and several simulated data under both additive and dominant models. These tests demonstrated that MultiGWAS is better at detecting reliable associations than using each of the four software packages individually. Moreover, the parallel analysis of polyploid and diploid software that only offers MultiGWAS demonstrates its utility in understanding the best genetic model behind the SNP association in tetraploid organisms. Therefore, MultiGWAS probed to be an excellent alternative for wrapping GWAS replication in diploid and tetraploid organisms in a single analysis environment. |
format | Online Article Text |
id | pubmed-8216910 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82169102021-06-28 MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms Garreta, Luis Cerón‐Souza, Ivania Palacio, Manfred Ricardo Reyes‐Herrera, Paula H. Ecol Evol Original Research The genome‐wide association studies (GWASs) are essential to determine the genetic bases of either ecological or economic phenotypic variation across individuals within populations of the model and nonmodel organisms. For this research question, the GWAS replication testing different parameters and models to validate the results' reproducibility is common. However, straightforward methodologies that manage both replication and tetraploid data are still missing. To solve this problem, we designed the MultiGWAS, a tool that does GWAS for diploid and tetraploid organisms by executing in parallel four software packages, two designed for polyploid data (GWASpoly and SHEsis) and two designed for diploid data (GAPIT and TASSEL). MultiGWAS has several advantages. It runs either in the command line or in a graphical interface; it manages different genotype formats, including VCF. Moreover, it allows control for population structure, relatedness, and several quality control checks on genotype data. Besides, MultiGWAS can test for additive and dominant gene action models, and, through a proprietary scoring function, select the best model to report its associations. Finally, it generates several reports that facilitate identifying false associations from both the significant and the best‐ranked association Single Nucleotide Polymorphisms (SNPs) among the four software packages. We tested MultiGWAS with public tetraploid potato data for tuber shape and several simulated data under both additive and dominant models. These tests demonstrated that MultiGWAS is better at detecting reliable associations than using each of the four software packages individually. Moreover, the parallel analysis of polyploid and diploid software that only offers MultiGWAS demonstrates its utility in understanding the best genetic model behind the SNP association in tetraploid organisms. Therefore, MultiGWAS probed to be an excellent alternative for wrapping GWAS replication in diploid and tetraploid organisms in a single analysis environment. John Wiley and Sons Inc. 2021-05-12 /pmc/articles/PMC8216910/ /pubmed/34188823 http://dx.doi.org/10.1002/ece3.7572 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Research Garreta, Luis Cerón‐Souza, Ivania Palacio, Manfred Ricardo Reyes‐Herrera, Paula H. MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms |
title | MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms |
title_full | MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms |
title_fullStr | MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms |
title_full_unstemmed | MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms |
title_short | MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms |
title_sort | multigwas: an integrative tool for genome wide association studies in tetraploid organisms |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8216910/ https://www.ncbi.nlm.nih.gov/pubmed/34188823 http://dx.doi.org/10.1002/ece3.7572 |
work_keys_str_mv | AT garretaluis multigwasanintegrativetoolforgenomewideassociationstudiesintetraploidorganisms AT ceronsouzaivania multigwasanintegrativetoolforgenomewideassociationstudiesintetraploidorganisms AT palaciomanfredricardo multigwasanintegrativetoolforgenomewideassociationstudiesintetraploidorganisms AT reyesherrerapaulah multigwasanintegrativetoolforgenomewideassociationstudiesintetraploidorganisms |