Cargando…

selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection

Haplotype-based scans to detect natural selection are useful to identify recent or ongoing positive selection in genomes. As both real and simulated genomic data sets grow larger, spanning thousands of samples and millions of markers, there is a need for a fast and efficient implementation of these...

Descripción completa

Detalles Bibliográficos
Autores principales: Szpiech, Zachary A., Hernandez, Ryan D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4166924/
https://www.ncbi.nlm.nih.gov/pubmed/25015648
http://dx.doi.org/10.1093/molbev/msu211
_version_ 1782335346497814528
author Szpiech, Zachary A.
Hernandez, Ryan D.
author_facet Szpiech, Zachary A.
Hernandez, Ryan D.
author_sort Szpiech, Zachary A.
collection PubMed
description Haplotype-based scans to detect natural selection are useful to identify recent or ongoing positive selection in genomes. As both real and simulated genomic data sets grow larger, spanning thousands of samples and millions of markers, there is a need for a fast and efficient implementation of these scans for general use. Here, we present selscan, an efficient multithreaded application that implements Extended Haplotype Homozygosity (EHH), Integrated Haplotype Score (iHS), and Cross-population EHH (XPEHH). selscan accepts phased genotypes in multiple formats, including TPED, and performs extremely well on both simulated and real data and over an order of magnitude faster than existing available implementations. It calculates iHS on chromosome 22 (22,147 loci) across 204 CEU haplotypes in 353 s on one thread (33 s on 16 threads) and calculates XPEHH for the same data relative to 210 YRI haplotypes in 578 s on one thread (52 s on 16 threads). Source code and binaries (Windows, OSX, and Linux) are available at https://github.com/szpiech/selscan.
format Online
Article
Text
id pubmed-4166924
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41669242014-09-22 selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection Szpiech, Zachary A. Hernandez, Ryan D. Mol Biol Evol Resources Haplotype-based scans to detect natural selection are useful to identify recent or ongoing positive selection in genomes. As both real and simulated genomic data sets grow larger, spanning thousands of samples and millions of markers, there is a need for a fast and efficient implementation of these scans for general use. Here, we present selscan, an efficient multithreaded application that implements Extended Haplotype Homozygosity (EHH), Integrated Haplotype Score (iHS), and Cross-population EHH (XPEHH). selscan accepts phased genotypes in multiple formats, including TPED, and performs extremely well on both simulated and real data and over an order of magnitude faster than existing available implementations. It calculates iHS on chromosome 22 (22,147 loci) across 204 CEU haplotypes in 353 s on one thread (33 s on 16 threads) and calculates XPEHH for the same data relative to 210 YRI haplotypes in 578 s on one thread (52 s on 16 threads). Source code and binaries (Windows, OSX, and Linux) are available at https://github.com/szpiech/selscan. Oxford University Press 2014-10 2014-07-10 /pmc/articles/PMC4166924/ /pubmed/25015648 http://dx.doi.org/10.1093/molbev/msu211 Text en © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Resources
Szpiech, Zachary A.
Hernandez, Ryan D.
selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection
title selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection
title_full selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection
title_fullStr selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection
title_full_unstemmed selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection
title_short selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection
title_sort selscan: an efficient multithreaded program to perform ehh-based scans for positive selection
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4166924/
https://www.ncbi.nlm.nih.gov/pubmed/25015648
http://dx.doi.org/10.1093/molbev/msu211
work_keys_str_mv AT szpiechzacharya selscananefficientmultithreadedprogramtoperformehhbasedscansforpositiveselection
AT hernandezryand selscananefficientmultithreadedprogramtoperformehhbasedscansforpositiveselection