Cargando…

PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes

Many comparative genomics studies aim to find the genetic basis of species-specific phenotypic traits. A prevailing strategy is to search genome-wide for genes that evolved under positive selection based on the non-synonymous to synonymous substitution ratio. However, incongruent results largely due...

Descripción completa

Detalles Bibliográficos
Autores principales: Sahm, Arne, Bens, Martin, Platzer, Matthias, Szafranski, Karol
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499814/
https://www.ncbi.nlm.nih.gov/pubmed/28334822
http://dx.doi.org/10.1093/nar/gkx179
_version_ 1783248535142006784
author Sahm, Arne
Bens, Martin
Platzer, Matthias
Szafranski, Karol
author_facet Sahm, Arne
Bens, Martin
Platzer, Matthias
Szafranski, Karol
author_sort Sahm, Arne
collection PubMed
description Many comparative genomics studies aim to find the genetic basis of species-specific phenotypic traits. A prevailing strategy is to search genome-wide for genes that evolved under positive selection based on the non-synonymous to synonymous substitution ratio. However, incongruent results largely due to high false positive rates indicate the need for standardization of quality criteria and software tools. Main challenges are the ortholog and isoform assignment, the high sensitivity of the statistical models to alignment errors and the imperative to parallelize large parts of the software. We developed the software tool PosiGene that (i) detects positively selected genes (PSGs) on genome-scale, (ii) allows analysis of specific evolutionary branches, (iii) can be used in arbitrary species contexts and (iv) offers visualization of the results for further manual validation and biological interpretation. We exemplify PosiGene's performance using simulated and real data. In the simulated data approach, we determined a false positive rate <1%. With real data, we found that 68.4% of the PSGs detected by PosiGene, were shared by at least one previous study that used the same set of species. PosiGene is a user-friendly, reliable tool for reproducible genome-wide identification of PSGs and freely available at https://github.com/gengit/PosiGene.
format Online
Article
Text
id pubmed-5499814
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54998142017-07-12 PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes Sahm, Arne Bens, Martin Platzer, Matthias Szafranski, Karol Nucleic Acids Res Methods Online Many comparative genomics studies aim to find the genetic basis of species-specific phenotypic traits. A prevailing strategy is to search genome-wide for genes that evolved under positive selection based on the non-synonymous to synonymous substitution ratio. However, incongruent results largely due to high false positive rates indicate the need for standardization of quality criteria and software tools. Main challenges are the ortholog and isoform assignment, the high sensitivity of the statistical models to alignment errors and the imperative to parallelize large parts of the software. We developed the software tool PosiGene that (i) detects positively selected genes (PSGs) on genome-scale, (ii) allows analysis of specific evolutionary branches, (iii) can be used in arbitrary species contexts and (iv) offers visualization of the results for further manual validation and biological interpretation. We exemplify PosiGene's performance using simulated and real data. In the simulated data approach, we determined a false positive rate <1%. With real data, we found that 68.4% of the PSGs detected by PosiGene, were shared by at least one previous study that used the same set of species. PosiGene is a user-friendly, reliable tool for reproducible genome-wide identification of PSGs and freely available at https://github.com/gengit/PosiGene. Oxford University Press 2017-06-20 2017-03-15 /pmc/articles/PMC5499814/ /pubmed/28334822 http://dx.doi.org/10.1093/nar/gkx179 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Sahm, Arne
Bens, Martin
Platzer, Matthias
Szafranski, Karol
PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes
title PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes
title_full PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes
title_fullStr PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes
title_full_unstemmed PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes
title_short PosiGene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes
title_sort posigene: automated and easy-to-use pipeline for genome-wide detection of positively selected genes
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499814/
https://www.ncbi.nlm.nih.gov/pubmed/28334822
http://dx.doi.org/10.1093/nar/gkx179
work_keys_str_mv AT sahmarne posigeneautomatedandeasytousepipelineforgenomewidedetectionofpositivelyselectedgenes
AT bensmartin posigeneautomatedandeasytousepipelineforgenomewidedetectionofpositivelyselectedgenes
AT platzermatthias posigeneautomatedandeasytousepipelineforgenomewidedetectionofpositivelyselectedgenes
AT szafranskikarol posigeneautomatedandeasytousepipelineforgenomewidedetectionofpositivelyselectedgenes