Cargando…

AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae

The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map....

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Giltae, Dickins, Benjamin J. A., Demeter, Janos, Engel, Stacia, Dunn, Barbara, Cherry, J. Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4363492/
https://www.ncbi.nlm.nih.gov/pubmed/25781462
http://dx.doi.org/10.1371/journal.pone.0120671
_version_ 1782361917756538880
author Song, Giltae
Dickins, Benjamin J. A.
Demeter, Janos
Engel, Stacia
Dunn, Barbara
Cherry, J. Michael
author_facet Song, Giltae
Dickins, Benjamin J. A.
Demeter, Janos
Engel, Stacia
Dunn, Barbara
Cherry, J. Michael
author_sort Song, Giltae
collection PubMed
description The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community.
format Online
Article
Text
id pubmed-4363492
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43634922015-03-23 AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae Song, Giltae Dickins, Benjamin J. A. Demeter, Janos Engel, Stacia Dunn, Barbara Cherry, J. Michael PLoS One Research Article The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community. Public Library of Science 2015-03-17 /pmc/articles/PMC4363492/ /pubmed/25781462 http://dx.doi.org/10.1371/journal.pone.0120671 Text en © 2015 Song et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Song, Giltae
Dickins, Benjamin J. A.
Demeter, Janos
Engel, Stacia
Dunn, Barbara
Cherry, J. Michael
AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae
title AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae
title_full AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae
title_fullStr AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae
title_full_unstemmed AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae
title_short AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae
title_sort agape (automated genome analysis pipeline) for pan-genome analysis of saccharomyces cerevisiae
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4363492/
https://www.ncbi.nlm.nih.gov/pubmed/25781462
http://dx.doi.org/10.1371/journal.pone.0120671
work_keys_str_mv AT songgiltae agapeautomatedgenomeanalysispipelineforpangenomeanalysisofsaccharomycescerevisiae
AT dickinsbenjaminja agapeautomatedgenomeanalysispipelineforpangenomeanalysisofsaccharomycescerevisiae
AT demeterjanos agapeautomatedgenomeanalysispipelineforpangenomeanalysisofsaccharomycescerevisiae
AT engelstacia agapeautomatedgenomeanalysispipelineforpangenomeanalysisofsaccharomycescerevisiae
AT dunnbarbara agapeautomatedgenomeanalysispipelineforpangenomeanalysisofsaccharomycescerevisiae
AT cherryjmichael agapeautomatedgenomeanalysispipelineforpangenomeanalysisofsaccharomycescerevisiae