Cargando…

Supervised Machine Learning for Population Genetics: A New Paradigm

As population genomic datasets grow in size, researchers are faced with the daunting task of making sense of a flood of information. To keep pace with this explosion of data, computational methodologies for population genetic inference are rapidly being developed to best utilize genomic sequence dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Schrider, Daniel R., Kern, Andrew D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5905713/
https://www.ncbi.nlm.nih.gov/pubmed/29331490
http://dx.doi.org/10.1016/j.tig.2017.12.005
_version_ 1783315301500190720
author Schrider, Daniel R.
Kern, Andrew D.
author_facet Schrider, Daniel R.
Kern, Andrew D.
author_sort Schrider, Daniel R.
collection PubMed
description As population genomic datasets grow in size, researchers are faced with the daunting task of making sense of a flood of information. To keep pace with this explosion of data, computational methodologies for population genetic inference are rapidly being developed to best utilize genomic sequence data. In this review we discuss a new paradigm that has emerged in computational population genomics: that of supervised machine learning (ML). We review the fundamentals of ML, discuss recent applications of supervised ML to population genetics that outperform competing methods, and describe promising future directions in this area. Ultimately, we argue that supervised ML is an important and underutilized tool that has considerable potential for the world of evolutionary genomics.
format Online
Article
Text
id pubmed-5905713
institution National Center for Biotechnology Information
language English
publishDate 2018
record_format MEDLINE/PubMed
spelling pubmed-59057132018-04-18 Supervised Machine Learning for Population Genetics: A New Paradigm Schrider, Daniel R. Kern, Andrew D. Trends Genet Article As population genomic datasets grow in size, researchers are faced with the daunting task of making sense of a flood of information. To keep pace with this explosion of data, computational methodologies for population genetic inference are rapidly being developed to best utilize genomic sequence data. In this review we discuss a new paradigm that has emerged in computational population genomics: that of supervised machine learning (ML). We review the fundamentals of ML, discuss recent applications of supervised ML to population genetics that outperform competing methods, and describe promising future directions in this area. Ultimately, we argue that supervised ML is an important and underutilized tool that has considerable potential for the world of evolutionary genomics. 2018-01-10 2018-04 /pmc/articles/PMC5905713/ /pubmed/29331490 http://dx.doi.org/10.1016/j.tig.2017.12.005 Text en http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Schrider, Daniel R.
Kern, Andrew D.
Supervised Machine Learning for Population Genetics: A New Paradigm
title Supervised Machine Learning for Population Genetics: A New Paradigm
title_full Supervised Machine Learning for Population Genetics: A New Paradigm
title_fullStr Supervised Machine Learning for Population Genetics: A New Paradigm
title_full_unstemmed Supervised Machine Learning for Population Genetics: A New Paradigm
title_short Supervised Machine Learning for Population Genetics: A New Paradigm
title_sort supervised machine learning for population genetics: a new paradigm
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5905713/
https://www.ncbi.nlm.nih.gov/pubmed/29331490
http://dx.doi.org/10.1016/j.tig.2017.12.005
work_keys_str_mv AT schriderdanielr supervisedmachinelearningforpopulationgeneticsanewparadigm
AT kernandrewd supervisedmachinelearningforpopulationgeneticsanewparadigm