Cargando…
Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers
BACKGROUND: The malaria risk analysis of multiple populations is crucial and of great importance whilst compressing limitations. However, the exponential growth in diversity and accumulation of genetic variation data obtained from malaria-infected patients through Genome-Wide Association Studies ope...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8905822/ https://www.ncbi.nlm.nih.gov/pubmed/35264165 http://dx.doi.org/10.1186/s12936-022-04104-x |
_version_ | 1784665273934020608 |
---|---|
author | Tai, Kah Yee Dhaliwal, Jasbir Balasubramaniam, Vinod |
author_facet | Tai, Kah Yee Dhaliwal, Jasbir Balasubramaniam, Vinod |
author_sort | Tai, Kah Yee |
collection | PubMed |
description | BACKGROUND: The malaria risk analysis of multiple populations is crucial and of great importance whilst compressing limitations. However, the exponential growth in diversity and accumulation of genetic variation data obtained from malaria-infected patients through Genome-Wide Association Studies opens up unprecedented opportunities to explore the significant differences between genetic markers (risk factors), particularly in the resistance or susceptibility of populations to malaria risk. Thus, this study proposes using statistical tests to analyse large-scale genetic variation data, comprising 20,854 samples from 11 populations within three continents: Africa, Oceania, and Asia. METHODS: Even though statistical tests have been utilized to conduct case–control studies since the 1950s to link risk factors to a particular disease, several challenges faced, including the choice of data (ordinal vs. non-ordinal) and test (parametric vs. non-parametric). This study overcomes these challenges by adopting the Mann–Whitney U test to analyse large-scale genetic variation data; to explore the statistical significance of markers between populations; and to further identify the highly differentiated markers. RESULTS: The findings of this study revealed a significant difference in the genetic markers between populations (p < 0.01) in all the case groups and most control groups. However, for the highly differentiated genetic markers, a significant difference (p < 0.01) was present for most genetic markers with varying p-values between the populations in the case and control groups. Moreover, several genetic markers were observed to have very significant differences (p < 0.001) across all populations, while others exist between certain specific populations. Also, several genetic markers have no significant differences between populations. CONCLUSIONS: These findings further support that the genetic markers contribute differently between populations towards malaria resistance or susceptibility, thus showing differences in the likelihood of malaria infection. In addition, this study demonstrated the robustness of the Mann–Whitney U test in analysing genetic markers in large-scale genetic variation data, thereby indicating an alternative method to explore genetic markers in other complex diseases. The findings hold great promise for genetic markers analysis, and the pipeline emphasized in this study can fully be reproduced to analyse new data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12936-022-04104-x. |
format | Online Article Text |
id | pubmed-8905822 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-89058222022-03-18 Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers Tai, Kah Yee Dhaliwal, Jasbir Balasubramaniam, Vinod Malar J Research BACKGROUND: The malaria risk analysis of multiple populations is crucial and of great importance whilst compressing limitations. However, the exponential growth in diversity and accumulation of genetic variation data obtained from malaria-infected patients through Genome-Wide Association Studies opens up unprecedented opportunities to explore the significant differences between genetic markers (risk factors), particularly in the resistance or susceptibility of populations to malaria risk. Thus, this study proposes using statistical tests to analyse large-scale genetic variation data, comprising 20,854 samples from 11 populations within three continents: Africa, Oceania, and Asia. METHODS: Even though statistical tests have been utilized to conduct case–control studies since the 1950s to link risk factors to a particular disease, several challenges faced, including the choice of data (ordinal vs. non-ordinal) and test (parametric vs. non-parametric). This study overcomes these challenges by adopting the Mann–Whitney U test to analyse large-scale genetic variation data; to explore the statistical significance of markers between populations; and to further identify the highly differentiated markers. RESULTS: The findings of this study revealed a significant difference in the genetic markers between populations (p < 0.01) in all the case groups and most control groups. However, for the highly differentiated genetic markers, a significant difference (p < 0.01) was present for most genetic markers with varying p-values between the populations in the case and control groups. Moreover, several genetic markers were observed to have very significant differences (p < 0.001) across all populations, while others exist between certain specific populations. Also, several genetic markers have no significant differences between populations. CONCLUSIONS: These findings further support that the genetic markers contribute differently between populations towards malaria resistance or susceptibility, thus showing differences in the likelihood of malaria infection. In addition, this study demonstrated the robustness of the Mann–Whitney U test in analysing genetic markers in large-scale genetic variation data, thereby indicating an alternative method to explore genetic markers in other complex diseases. The findings hold great promise for genetic markers analysis, and the pipeline emphasized in this study can fully be reproduced to analyse new data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12936-022-04104-x. BioMed Central 2022-03-09 /pmc/articles/PMC8905822/ /pubmed/35264165 http://dx.doi.org/10.1186/s12936-022-04104-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Tai, Kah Yee Dhaliwal, Jasbir Balasubramaniam, Vinod Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers |
title | Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers |
title_full | Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers |
title_fullStr | Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers |
title_full_unstemmed | Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers |
title_short | Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers |
title_sort | leveraging mann–whitney u test on large-scale genetic variation data for analysing malaria genetic markers |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8905822/ https://www.ncbi.nlm.nih.gov/pubmed/35264165 http://dx.doi.org/10.1186/s12936-022-04104-x |
work_keys_str_mv | AT taikahyee leveragingmannwhitneyutestonlargescalegeneticvariationdataforanalysingmalariageneticmarkers AT dhaliwaljasbir leveragingmannwhitneyutestonlargescalegeneticvariationdataforanalysingmalariageneticmarkers AT balasubramaniamvinod leveragingmannwhitneyutestonlargescalegeneticvariationdataforanalysingmalariageneticmarkers |