Cargando…

Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers

BACKGROUND: The malaria risk analysis of multiple populations is crucial and of great importance whilst compressing limitations. However, the exponential growth in diversity and accumulation of genetic variation data obtained from malaria-infected patients through Genome-Wide Association Studies ope...

Descripción completa

Detalles Bibliográficos
Autores principales: Tai, Kah Yee, Dhaliwal, Jasbir, Balasubramaniam, Vinod
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8905822/
https://www.ncbi.nlm.nih.gov/pubmed/35264165
http://dx.doi.org/10.1186/s12936-022-04104-x
_version_ 1784665273934020608
author Tai, Kah Yee
Dhaliwal, Jasbir
Balasubramaniam, Vinod
author_facet Tai, Kah Yee
Dhaliwal, Jasbir
Balasubramaniam, Vinod
author_sort Tai, Kah Yee
collection PubMed
description BACKGROUND: The malaria risk analysis of multiple populations is crucial and of great importance whilst compressing limitations. However, the exponential growth in diversity and accumulation of genetic variation data obtained from malaria-infected patients through Genome-Wide Association Studies opens up unprecedented opportunities to explore the significant differences between genetic markers (risk factors), particularly in the resistance or susceptibility of populations to malaria risk. Thus, this study proposes using statistical tests to analyse large-scale genetic variation data, comprising 20,854 samples from 11 populations within three continents: Africa, Oceania, and Asia. METHODS: Even though statistical tests have been utilized to conduct case–control studies since the 1950s to link risk factors to a particular disease, several challenges faced, including the choice of data (ordinal vs. non-ordinal) and test (parametric vs. non-parametric). This study overcomes these challenges by adopting the Mann–Whitney U test to analyse large-scale genetic variation data; to explore the statistical significance of markers between populations; and to further identify the highly differentiated markers. RESULTS: The findings of this study revealed a significant difference in the genetic markers between populations (p < 0.01) in all the case groups and most control groups. However, for the highly differentiated genetic markers, a significant difference (p < 0.01) was present for most genetic markers with varying p-values between the populations in the case and control groups. Moreover, several genetic markers were observed to have very significant differences (p < 0.001) across all populations, while others exist between certain specific populations. Also, several genetic markers have no significant differences between populations. CONCLUSIONS: These findings further support that the genetic markers contribute differently between populations towards malaria resistance or susceptibility, thus showing differences in the likelihood of malaria infection. In addition, this study demonstrated the robustness of the Mann–Whitney U test in analysing genetic markers in large-scale genetic variation data, thereby indicating an alternative method to explore genetic markers in other complex diseases. The findings hold great promise for genetic markers analysis, and the pipeline emphasized in this study can fully be reproduced to analyse new data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12936-022-04104-x.
format Online
Article
Text
id pubmed-8905822
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-89058222022-03-18 Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers Tai, Kah Yee Dhaliwal, Jasbir Balasubramaniam, Vinod Malar J Research BACKGROUND: The malaria risk analysis of multiple populations is crucial and of great importance whilst compressing limitations. However, the exponential growth in diversity and accumulation of genetic variation data obtained from malaria-infected patients through Genome-Wide Association Studies opens up unprecedented opportunities to explore the significant differences between genetic markers (risk factors), particularly in the resistance or susceptibility of populations to malaria risk. Thus, this study proposes using statistical tests to analyse large-scale genetic variation data, comprising 20,854 samples from 11 populations within three continents: Africa, Oceania, and Asia. METHODS: Even though statistical tests have been utilized to conduct case–control studies since the 1950s to link risk factors to a particular disease, several challenges faced, including the choice of data (ordinal vs. non-ordinal) and test (parametric vs. non-parametric). This study overcomes these challenges by adopting the Mann–Whitney U test to analyse large-scale genetic variation data; to explore the statistical significance of markers between populations; and to further identify the highly differentiated markers. RESULTS: The findings of this study revealed a significant difference in the genetic markers between populations (p < 0.01) in all the case groups and most control groups. However, for the highly differentiated genetic markers, a significant difference (p < 0.01) was present for most genetic markers with varying p-values between the populations in the case and control groups. Moreover, several genetic markers were observed to have very significant differences (p < 0.001) across all populations, while others exist between certain specific populations. Also, several genetic markers have no significant differences between populations. CONCLUSIONS: These findings further support that the genetic markers contribute differently between populations towards malaria resistance or susceptibility, thus showing differences in the likelihood of malaria infection. In addition, this study demonstrated the robustness of the Mann–Whitney U test in analysing genetic markers in large-scale genetic variation data, thereby indicating an alternative method to explore genetic markers in other complex diseases. The findings hold great promise for genetic markers analysis, and the pipeline emphasized in this study can fully be reproduced to analyse new data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12936-022-04104-x. BioMed Central 2022-03-09 /pmc/articles/PMC8905822/ /pubmed/35264165 http://dx.doi.org/10.1186/s12936-022-04104-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Tai, Kah Yee
Dhaliwal, Jasbir
Balasubramaniam, Vinod
Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers
title Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers
title_full Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers
title_fullStr Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers
title_full_unstemmed Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers
title_short Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers
title_sort leveraging mann–whitney u test on large-scale genetic variation data for analysing malaria genetic markers
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8905822/
https://www.ncbi.nlm.nih.gov/pubmed/35264165
http://dx.doi.org/10.1186/s12936-022-04104-x
work_keys_str_mv AT taikahyee leveragingmannwhitneyutestonlargescalegeneticvariationdataforanalysingmalariageneticmarkers
AT dhaliwaljasbir leveragingmannwhitneyutestonlargescalegeneticvariationdataforanalysingmalariageneticmarkers
AT balasubramaniamvinod leveragingmannwhitneyutestonlargescalegeneticvariationdataforanalysingmalariageneticmarkers