Cargando…

MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms

The genome‐wide association studies (GWASs) are essential to determine the genetic bases of either ecological or economic phenotypic variation across individuals within populations of the model and nonmodel organisms. For this research question, the GWAS replication testing different parameters and...

Descripción completa

Detalles Bibliográficos
Autores principales: Garreta, Luis, Cerón‐Souza, Ivania, Palacio, Manfred Ricardo, Reyes‐Herrera, Paula H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8216910/
https://www.ncbi.nlm.nih.gov/pubmed/34188823
http://dx.doi.org/10.1002/ece3.7572
_version_ 1783710516164689920
author Garreta, Luis
Cerón‐Souza, Ivania
Palacio, Manfred Ricardo
Reyes‐Herrera, Paula H.
author_facet Garreta, Luis
Cerón‐Souza, Ivania
Palacio, Manfred Ricardo
Reyes‐Herrera, Paula H.
author_sort Garreta, Luis
collection PubMed
description The genome‐wide association studies (GWASs) are essential to determine the genetic bases of either ecological or economic phenotypic variation across individuals within populations of the model and nonmodel organisms. For this research question, the GWAS replication testing different parameters and models to validate the results' reproducibility is common. However, straightforward methodologies that manage both replication and tetraploid data are still missing. To solve this problem, we designed the MultiGWAS, a tool that does GWAS for diploid and tetraploid organisms by executing in parallel four software packages, two designed for polyploid data (GWASpoly and SHEsis) and two designed for diploid data (GAPIT and TASSEL). MultiGWAS has several advantages. It runs either in the command line or in a graphical interface; it manages different genotype formats, including VCF. Moreover, it allows control for population structure, relatedness, and several quality control checks on genotype data. Besides, MultiGWAS can test for additive and dominant gene action models, and, through a proprietary scoring function, select the best model to report its associations. Finally, it generates several reports that facilitate identifying false associations from both the significant and the best‐ranked association Single Nucleotide Polymorphisms (SNPs) among the four software packages. We tested MultiGWAS with public tetraploid potato data for tuber shape and several simulated data under both additive and dominant models. These tests demonstrated that MultiGWAS is better at detecting reliable associations than using each of the four software packages individually. Moreover, the parallel analysis of polyploid and diploid software that only offers MultiGWAS demonstrates its utility in understanding the best genetic model behind the SNP association in tetraploid organisms. Therefore, MultiGWAS probed to be an excellent alternative for wrapping GWAS replication in diploid and tetraploid organisms in a single analysis environment.
format Online
Article
Text
id pubmed-8216910
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-82169102021-06-28 MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms Garreta, Luis Cerón‐Souza, Ivania Palacio, Manfred Ricardo Reyes‐Herrera, Paula H. Ecol Evol Original Research The genome‐wide association studies (GWASs) are essential to determine the genetic bases of either ecological or economic phenotypic variation across individuals within populations of the model and nonmodel organisms. For this research question, the GWAS replication testing different parameters and models to validate the results' reproducibility is common. However, straightforward methodologies that manage both replication and tetraploid data are still missing. To solve this problem, we designed the MultiGWAS, a tool that does GWAS for diploid and tetraploid organisms by executing in parallel four software packages, two designed for polyploid data (GWASpoly and SHEsis) and two designed for diploid data (GAPIT and TASSEL). MultiGWAS has several advantages. It runs either in the command line or in a graphical interface; it manages different genotype formats, including VCF. Moreover, it allows control for population structure, relatedness, and several quality control checks on genotype data. Besides, MultiGWAS can test for additive and dominant gene action models, and, through a proprietary scoring function, select the best model to report its associations. Finally, it generates several reports that facilitate identifying false associations from both the significant and the best‐ranked association Single Nucleotide Polymorphisms (SNPs) among the four software packages. We tested MultiGWAS with public tetraploid potato data for tuber shape and several simulated data under both additive and dominant models. These tests demonstrated that MultiGWAS is better at detecting reliable associations than using each of the four software packages individually. Moreover, the parallel analysis of polyploid and diploid software that only offers MultiGWAS demonstrates its utility in understanding the best genetic model behind the SNP association in tetraploid organisms. Therefore, MultiGWAS probed to be an excellent alternative for wrapping GWAS replication in diploid and tetraploid organisms in a single analysis environment. John Wiley and Sons Inc. 2021-05-12 /pmc/articles/PMC8216910/ /pubmed/34188823 http://dx.doi.org/10.1002/ece3.7572 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Garreta, Luis
Cerón‐Souza, Ivania
Palacio, Manfred Ricardo
Reyes‐Herrera, Paula H.
MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms
title MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms
title_full MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms
title_fullStr MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms
title_full_unstemmed MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms
title_short MultiGWAS: An integrative tool for Genome Wide Association Studies in tetraploid organisms
title_sort multigwas: an integrative tool for genome wide association studies in tetraploid organisms
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8216910/
https://www.ncbi.nlm.nih.gov/pubmed/34188823
http://dx.doi.org/10.1002/ece3.7572
work_keys_str_mv AT garretaluis multigwasanintegrativetoolforgenomewideassociationstudiesintetraploidorganisms
AT ceronsouzaivania multigwasanintegrativetoolforgenomewideassociationstudiesintetraploidorganisms
AT palaciomanfredricardo multigwasanintegrativetoolforgenomewideassociationstudiesintetraploidorganisms
AT reyesherrerapaulah multigwasanintegrativetoolforgenomewideassociationstudiesintetraploidorganisms