Cargando…

Nonparametric approaches for population structure analysis

The analysis of population structure has many applications in medical and population genetic research. Such analysis is used to provide clear insight into the underlying genetic population substructure and is a crucial prerequisite for any analysis of genetic data. The analysis involves grouping ind...

Descripción completa

Detalles Bibliográficos
Autores principales: Alhusain, Luluah, Hafez, Alaaeldin M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5944014/
https://www.ncbi.nlm.nih.gov/pubmed/29743099
http://dx.doi.org/10.1186/s40246-018-0156-4
_version_ 1783321740721520640
author Alhusain, Luluah
Hafez, Alaaeldin M.
author_facet Alhusain, Luluah
Hafez, Alaaeldin M.
author_sort Alhusain, Luluah
collection PubMed
description The analysis of population structure has many applications in medical and population genetic research. Such analysis is used to provide clear insight into the underlying genetic population substructure and is a crucial prerequisite for any analysis of genetic data. The analysis involves grouping individuals into subpopulations based on shared genetic variations. The most widely used markers to study the variation of DNA sequences between populations are single nucleotide polymorphisms. Data preprocessing is a necessary step to assess the quality of the data and to determine which markers or individuals can reasonably be included in the analysis. After preprocessing, several methods can be utilized to uncover population substructure, which can be categorized into two broad approaches: parametric and nonparametric. Parametric approaches use statistical models to infer population structure and assign individuals into subpopulations. However, these approaches suffer from many drawbacks that make them impractical for large datasets. In contrast, nonparametric approaches do not suffer from these drawbacks, making them more viable than parametric approaches for analyzing large datasets. Consequently, nonparametric approaches are increasingly used to reveal population substructure. Thus, this paper reviews and discusses the nonparametric approaches that are available for population structure analysis along with some implications to resolve challenges.
format Online
Article
Text
id pubmed-5944014
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59440142018-05-14 Nonparametric approaches for population structure analysis Alhusain, Luluah Hafez, Alaaeldin M. Hum Genomics Review The analysis of population structure has many applications in medical and population genetic research. Such analysis is used to provide clear insight into the underlying genetic population substructure and is a crucial prerequisite for any analysis of genetic data. The analysis involves grouping individuals into subpopulations based on shared genetic variations. The most widely used markers to study the variation of DNA sequences between populations are single nucleotide polymorphisms. Data preprocessing is a necessary step to assess the quality of the data and to determine which markers or individuals can reasonably be included in the analysis. After preprocessing, several methods can be utilized to uncover population substructure, which can be categorized into two broad approaches: parametric and nonparametric. Parametric approaches use statistical models to infer population structure and assign individuals into subpopulations. However, these approaches suffer from many drawbacks that make them impractical for large datasets. In contrast, nonparametric approaches do not suffer from these drawbacks, making them more viable than parametric approaches for analyzing large datasets. Consequently, nonparametric approaches are increasingly used to reveal population substructure. Thus, this paper reviews and discusses the nonparametric approaches that are available for population structure analysis along with some implications to resolve challenges. BioMed Central 2018-05-09 /pmc/articles/PMC5944014/ /pubmed/29743099 http://dx.doi.org/10.1186/s40246-018-0156-4 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Review
Alhusain, Luluah
Hafez, Alaaeldin M.
Nonparametric approaches for population structure analysis
title Nonparametric approaches for population structure analysis
title_full Nonparametric approaches for population structure analysis
title_fullStr Nonparametric approaches for population structure analysis
title_full_unstemmed Nonparametric approaches for population structure analysis
title_short Nonparametric approaches for population structure analysis
title_sort nonparametric approaches for population structure analysis
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5944014/
https://www.ncbi.nlm.nih.gov/pubmed/29743099
http://dx.doi.org/10.1186/s40246-018-0156-4
work_keys_str_mv AT alhusainluluah nonparametricapproachesforpopulationstructureanalysis
AT hafezalaaeldinm nonparametricapproachesforpopulationstructureanalysis