Cargando…
selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection
Haplotype-based scans to detect natural selection are useful to identify recent or ongoing positive selection in genomes. As both real and simulated genomic data sets grow larger, spanning thousands of samples and millions of markers, there is a need for a fast and efficient implementation of these...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4166924/ https://www.ncbi.nlm.nih.gov/pubmed/25015648 http://dx.doi.org/10.1093/molbev/msu211 |
_version_ | 1782335346497814528 |
---|---|
author | Szpiech, Zachary A. Hernandez, Ryan D. |
author_facet | Szpiech, Zachary A. Hernandez, Ryan D. |
author_sort | Szpiech, Zachary A. |
collection | PubMed |
description | Haplotype-based scans to detect natural selection are useful to identify recent or ongoing positive selection in genomes. As both real and simulated genomic data sets grow larger, spanning thousands of samples and millions of markers, there is a need for a fast and efficient implementation of these scans for general use. Here, we present selscan, an efficient multithreaded application that implements Extended Haplotype Homozygosity (EHH), Integrated Haplotype Score (iHS), and Cross-population EHH (XPEHH). selscan accepts phased genotypes in multiple formats, including TPED, and performs extremely well on both simulated and real data and over an order of magnitude faster than existing available implementations. It calculates iHS on chromosome 22 (22,147 loci) across 204 CEU haplotypes in 353 s on one thread (33 s on 16 threads) and calculates XPEHH for the same data relative to 210 YRI haplotypes in 578 s on one thread (52 s on 16 threads). Source code and binaries (Windows, OSX, and Linux) are available at https://github.com/szpiech/selscan. |
format | Online Article Text |
id | pubmed-4166924 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-41669242014-09-22 selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection Szpiech, Zachary A. Hernandez, Ryan D. Mol Biol Evol Resources Haplotype-based scans to detect natural selection are useful to identify recent or ongoing positive selection in genomes. As both real and simulated genomic data sets grow larger, spanning thousands of samples and millions of markers, there is a need for a fast and efficient implementation of these scans for general use. Here, we present selscan, an efficient multithreaded application that implements Extended Haplotype Homozygosity (EHH), Integrated Haplotype Score (iHS), and Cross-population EHH (XPEHH). selscan accepts phased genotypes in multiple formats, including TPED, and performs extremely well on both simulated and real data and over an order of magnitude faster than existing available implementations. It calculates iHS on chromosome 22 (22,147 loci) across 204 CEU haplotypes in 353 s on one thread (33 s on 16 threads) and calculates XPEHH for the same data relative to 210 YRI haplotypes in 578 s on one thread (52 s on 16 threads). Source code and binaries (Windows, OSX, and Linux) are available at https://github.com/szpiech/selscan. Oxford University Press 2014-10 2014-07-10 /pmc/articles/PMC4166924/ /pubmed/25015648 http://dx.doi.org/10.1093/molbev/msu211 Text en © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Resources Szpiech, Zachary A. Hernandez, Ryan D. selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection |
title | selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection |
title_full | selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection |
title_fullStr | selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection |
title_full_unstemmed | selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection |
title_short | selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection |
title_sort | selscan: an efficient multithreaded program to perform ehh-based scans for positive selection |
topic | Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4166924/ https://www.ncbi.nlm.nih.gov/pubmed/25015648 http://dx.doi.org/10.1093/molbev/msu211 |
work_keys_str_mv | AT szpiechzacharya selscananefficientmultithreadedprogramtoperformehhbasedscansforpositiveselection AT hernandezryand selscananefficientmultithreadedprogramtoperformehhbasedscansforpositiveselection |