Cargando…

A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains

The quest for genes representing genetic relationships of strains or individuals within populations and their evolutionary history is acquiring a novel dimension of complexity with the advancement of next-generation sequencing (NGS) technologies. In fact, sequencing an entire genome uncovers genetic...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramazzotti, Matteo, Berná, Luisa, Stefanini, Irene, Cavalieri, Duccio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3351171/
https://www.ncbi.nlm.nih.gov/pubmed/22266652
http://dx.doi.org/10.1093/nar/gks005
_version_ 1782232740453679104
author Ramazzotti, Matteo
Berná, Luisa
Stefanini, Irene
Cavalieri, Duccio
author_facet Ramazzotti, Matteo
Berná, Luisa
Stefanini, Irene
Cavalieri, Duccio
author_sort Ramazzotti, Matteo
collection PubMed
description The quest for genes representing genetic relationships of strains or individuals within populations and their evolutionary history is acquiring a novel dimension of complexity with the advancement of next-generation sequencing (NGS) technologies. In fact, sequencing an entire genome uncovers genetic variation in coding and non-coding regions and offers the possibility of studying Saccharomyces cerevisiae populations at the strain level. Nevertheless, the disadvantageous cost-benefit ratio (the amount of details disclosed by NGS against the time-expensive and expertise-demanding data assembly process) still precludes the application of these techniques to the routinely assignment of yeast strains, making the selection of the most reliable molecular markers greatly desirable. In this work we propose an original computational approach to discover genes that can be used as a descriptor of the population structure. We found 13 genes whose variability can be used to recapitulate the phylogeny obtained from genome-wide sequences. The same approach that we prove to be successful in yeasts can be generalized to any other population of individuals given the availability of high-quality genomic sequences and of a clear population structure to be targeted.
format Online
Article
Text
id pubmed-3351171
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33511712012-05-14 A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains Ramazzotti, Matteo Berná, Luisa Stefanini, Irene Cavalieri, Duccio Nucleic Acids Res Computational Biology The quest for genes representing genetic relationships of strains or individuals within populations and their evolutionary history is acquiring a novel dimension of complexity with the advancement of next-generation sequencing (NGS) technologies. In fact, sequencing an entire genome uncovers genetic variation in coding and non-coding regions and offers the possibility of studying Saccharomyces cerevisiae populations at the strain level. Nevertheless, the disadvantageous cost-benefit ratio (the amount of details disclosed by NGS against the time-expensive and expertise-demanding data assembly process) still precludes the application of these techniques to the routinely assignment of yeast strains, making the selection of the most reliable molecular markers greatly desirable. In this work we propose an original computational approach to discover genes that can be used as a descriptor of the population structure. We found 13 genes whose variability can be used to recapitulate the phylogeny obtained from genome-wide sequences. The same approach that we prove to be successful in yeasts can be generalized to any other population of individuals given the availability of high-quality genomic sequences and of a clear population structure to be targeted. Oxford University Press 2012-05 2012-01-20 /pmc/articles/PMC3351171/ /pubmed/22266652 http://dx.doi.org/10.1093/nar/gks005 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Ramazzotti, Matteo
Berná, Luisa
Stefanini, Irene
Cavalieri, Duccio
A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains
title A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains
title_full A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains
title_fullStr A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains
title_full_unstemmed A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains
title_short A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to Saccharomyces cerevisiae natural strains
title_sort computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to saccharomyces cerevisiae natural strains
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3351171/
https://www.ncbi.nlm.nih.gov/pubmed/22266652
http://dx.doi.org/10.1093/nar/gks005
work_keys_str_mv AT ramazzottimatteo acomputationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains
AT bernaluisa acomputationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains
AT stefaniniirene acomputationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains
AT cavalieriduccio acomputationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains
AT ramazzottimatteo computationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains
AT bernaluisa computationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains
AT stefaniniirene computationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains
AT cavalieriduccio computationalpipelinetodiscoverhighlyphylogeneticallyinformativegenesinsequencedgenomesapplicationtosaccharomycescerevisiaenaturalstrains