Cargando…
A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains
The quest for genes representing genetic relationships of strains or individuals within populations and their evolutionary history is acquiring a novel dimension of complexity with the advancement of next-generation sequencing (NGS) technologies. In fact, sequencing an entire genome uncovers genetic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3351171/ https://www.ncbi.nlm.nih.gov/pubmed/22266652 http://dx.doi.org/10.1093/nar/gks005 |
_version_ | 1782232740453679104 |
---|---|
author | Ramazzotti, Matteo Berná, Luisa Stefanini, Irene Cavalieri, Duccio |
author_facet | Ramazzotti, Matteo Berná, Luisa Stefanini, Irene Cavalieri, Duccio |
author_sort | Ramazzotti, Matteo |
collection | PubMed |
description | The quest for genes representing genetic relationships of strains or individuals within populations and their evolutionary history is acquiring a novel dimension of complexity with the advancement of next-generation sequencing (NGS) technologies. In fact, sequencing an entire genome uncovers genetic variation in coding and non-coding regions and offers the possibility of studying Saccharomyces cerevisiae populations at the strain level. Nevertheless, the disadvantageous cost-benefit ratio (the amount of details disclosed by NGS against the time-expensive and expertise-demanding data assembly process) still precludes the application of these techniques to the routinely assignment of yeast strains, making the selection of the most reliable molecular markers greatly desirable. In this work we propose an original computational approach to discover genes that can be used as a descriptor of the population structure. We found 13 genes whose variability can be used to recapitulate the phylogeny obtained from genome-wide sequences. The same approach that we prove to be successful in yeasts can be generalized to any other population of individuals given the availability of high-quality genomic sequences and of a clear population structure to be targeted. |
format | Online Article Text |
id | pubmed-3351171 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-33511712012-05-14 A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains Ramazzotti, Matteo Berná, Luisa Stefanini, Irene Cavalieri, Duccio Nucleic Acids Res Computational Biology The quest for genes representing genetic relationships of strains or individuals within populations and their evolutionary history is acquiring a novel dimension of complexity with the advancement of next-generation sequencing (NGS) technologies. In fact, sequencing an entire genome uncovers genetic variation in coding and non-coding regions and offers the possibility of studying Saccharomyces cerevisiae populations at the strain level. Nevertheless, the disadvantageous cost-benefit ratio (the amount of details disclosed by NGS against the time-expensive and expertise-demanding data assembly process) still precludes the application of these techniques to the routinely assignment of yeast strains, making the selection of the most reliable molecular markers greatly desirable. In this work we propose an original computational approach to discover genes that can be used as a descriptor of the population structure. We found 13 genes whose variability can be used to recapitulate the phylogeny obtained from genome-wide sequences. The same approach that we prove to be successful in yeasts can be generalized to any other population of individuals given the availability of high-quality genomic sequences and of a clear population structure to be targeted. Oxford University Press 2012-05 2012-01-20 /pmc/articles/PMC3351171/ /pubmed/22266652 http://dx.doi.org/10.1093/nar/gks005 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Ramazzotti, Matteo Berná, Luisa Stefanini, Irene Cavalieri, Duccio A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains |
title | A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains |
title_full | A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains |
title_fullStr | A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains |
title_full_unstemmed | A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains |
title_short | A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains |
title_sort | computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to saccharomyces cerevisiae natural strains |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3351171/ https://www.ncbi.nlm.nih.gov/pubmed/22266652 http://dx.doi.org/10.1093/nar/gks005 |
work_keys_str_mv | AT ramazzottimatteo acomputationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains AT bernaluisa acomputationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains AT stefaniniirene acomputationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains AT cavalieriduccio acomputationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains AT ramazzottimatteo computationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains AT bernaluisa computationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains AT stefaniniirene computationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains AT cavalieriduccio computationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains |