Cargando…
PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes
Many comparative genomics studies aim to find the genetic basis of species-specific phenotypic traits. A prevailing strategy is to search genome-wide for genes that evolved under positive selection based on the non-synonymous to synonymous substitution ratio. However, incongruent results largely due...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499814/ https://www.ncbi.nlm.nih.gov/pubmed/28334822 http://dx.doi.org/10.1093/nar/gkx179 |
_version_ | 1783248535142006784 |
---|---|
author | Sahm, Arne Bens, Martin Platzer, Matthias Szafranski, Karol |
author_facet | Sahm, Arne Bens, Martin Platzer, Matthias Szafranski, Karol |
author_sort | Sahm, Arne |
collection | PubMed |
description | Many comparative genomics studies aim to find the genetic basis of species-specific phenotypic traits. A prevailing strategy is to search genome-wide for genes that evolved under positive selection based on the non-synonymous to synonymous substitution ratio. However, incongruent results largely due to high false positive rates indicate the need for standardization of quality criteria and software tools. Main challenges are the ortholog and isoform assignment, the high sensitivity of the statistical models to alignment errors and the imperative to parallelize large parts of the software. We developed the software tool PosiGene that (i) detects positively selected genes (PSGs) on genome-scale, (ii) allows analysis of specific evolutionary branches, (iii) can be used in arbitrary species contexts and (iv) offers visualization of the results for further manual validation and biological interpretation. We exemplify PosiGene's performance using simulated and real data. In the simulated data approach, we determined a false positive rate <1%. With real data, we found that 68.4% of the PSGs detected by PosiGene, were shared by at least one previous study that used the same set of species. PosiGene is a user-friendly, reliable tool for reproducible genome-wide identification of PSGs and freely available at https://github.com/gengit/PosiGene. |
format | Online Article Text |
id | pubmed-5499814 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54998142017-07-12 PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes Sahm, Arne Bens, Martin Platzer, Matthias Szafranski, Karol Nucleic Acids Res Methods Online Many comparative genomics studies aim to find the genetic basis of species-specific phenotypic traits. A prevailing strategy is to search genome-wide for genes that evolved under positive selection based on the non-synonymous to synonymous substitution ratio. However, incongruent results largely due to high false positive rates indicate the need for standardization of quality criteria and software tools. Main challenges are the ortholog and isoform assignment, the high sensitivity of the statistical models to alignment errors and the imperative to parallelize large parts of the software. We developed the software tool PosiGene that (i) detects positively selected genes (PSGs) on genome-scale, (ii) allows analysis of specific evolutionary branches, (iii) can be used in arbitrary species contexts and (iv) offers visualization of the results for further manual validation and biological interpretation. We exemplify PosiGene's performance using simulated and real data. In the simulated data approach, we determined a false positive rate <1%. With real data, we found that 68.4% of the PSGs detected by PosiGene, were shared by at least one previous study that used the same set of species. PosiGene is a user-friendly, reliable tool for reproducible genome-wide identification of PSGs and freely available at https://github.com/gengit/PosiGene. Oxford University Press 2017-06-20 2017-03-15 /pmc/articles/PMC5499814/ /pubmed/28334822 http://dx.doi.org/10.1093/nar/gkx179 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Sahm, Arne Bens, Martin Platzer, Matthias Szafranski, Karol PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes |
title | PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes |
title_full | PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes |
title_fullStr | PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes |
title_full_unstemmed | PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes |
title_short | PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes |
title_sort | posigene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499814/ https://www.ncbi.nlm.nih.gov/pubmed/28334822 http://dx.doi.org/10.1093/nar/gkx179 |
work_keys_str_mv | AT sahmarne posigeneautomatedandeasytousepipelineforgenomewidedetectionofpositivelyselectedgenes AT bensmartin posigeneautomatedandeasytousepipelineforgenomewidedetectionofpositivelyselectedgenes AT platzermatthias posigeneautomatedandeasytousepipelineforgenomewidedetectionofpositivelyselectedgenes AT szafranskikarol posigeneautomatedandeasytousepipelineforgenomewidedetectionofpositivelyselectedgenes |