Cargando…

Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure

BACKGROUND: Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely reso...

Descripción completa

Detalles Bibliográficos
Autores principales: Muzzey, Dale, Schwartz, Katja, Weissman, Jonathan S, Sherlock, Gavin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4054093/
https://www.ncbi.nlm.nih.gov/pubmed/24025428
http://dx.doi.org/10.1186/gb-2013-14-9-r97
_version_ 1782320504055529472
author Muzzey, Dale
Schwartz, Katja
Weissman, Jonathan S
Sherlock, Gavin
author_facet Muzzey, Dale
Schwartz, Katja
Weissman, Jonathan S
Sherlock, Gavin
author_sort Muzzey, Dale
collection PubMed
description BACKGROUND: Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely resolved. RESULTS: We construct a phased diploid genome assembly by deep sequencing a standard laboratory wild-type strain and a panel of strains homozygous for particular chromosomes. The assembly has 700-fold coverage on average, allowing extensive revision and expansion of the number of known SNPs and indels. This phased genome significantly enhances the sensitivity and specificity of allele-specific expression measurements by enabling pooling and cross-validation of signal across multiple polymorphic sites. Additionally, the diploid assembly reveals pervasive and unexpected patterns in allelic differences between homologous chromosomes. Firstly, we see striking clustering of indels, concentrated primarily in the repeat sequences in promoters. Secondly, both indels and their repeat-sequence substrate are enriched near replication origins. Finally, we reveal an intimate link between repeat sequences and indels, which argues that repeat length is under selective pressure for most eukaryotes. This connection is described by a concise one-parameter model that explains repeat-sequence abundance in C. albicans as a function of the indel rate, and provides a general framework to interpret repeat abundance in species ranging from bacteria to humans. CONCLUSIONS: The phased genome assembly and insights into repeat plasticity will be valuable for better understanding allele-specific phenomena and genome evolution.
format Online
Article
Text
id pubmed-4054093
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40540932014-06-12 Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure Muzzey, Dale Schwartz, Katja Weissman, Jonathan S Sherlock, Gavin Genome Biol Research BACKGROUND: Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely resolved. RESULTS: We construct a phased diploid genome assembly by deep sequencing a standard laboratory wild-type strain and a panel of strains homozygous for particular chromosomes. The assembly has 700-fold coverage on average, allowing extensive revision and expansion of the number of known SNPs and indels. This phased genome significantly enhances the sensitivity and specificity of allele-specific expression measurements by enabling pooling and cross-validation of signal across multiple polymorphic sites. Additionally, the diploid assembly reveals pervasive and unexpected patterns in allelic differences between homologous chromosomes. Firstly, we see striking clustering of indels, concentrated primarily in the repeat sequences in promoters. Secondly, both indels and their repeat-sequence substrate are enriched near replication origins. Finally, we reveal an intimate link between repeat sequences and indels, which argues that repeat length is under selective pressure for most eukaryotes. This connection is described by a concise one-parameter model that explains repeat-sequence abundance in C. albicans as a function of the indel rate, and provides a general framework to interpret repeat abundance in species ranging from bacteria to humans. CONCLUSIONS: The phased genome assembly and insights into repeat plasticity will be valuable for better understanding allele-specific phenomena and genome evolution. BioMed Central 2013 2013-09-11 /pmc/articles/PMC4054093/ /pubmed/24025428 http://dx.doi.org/10.1186/gb-2013-14-9-r97 Text en Copyright © 2013 Muzzey et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Muzzey, Dale
Schwartz, Katja
Weissman, Jonathan S
Sherlock, Gavin
Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure
title Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure
title_full Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure
title_fullStr Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure
title_full_unstemmed Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure
title_short Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure
title_sort assembly of a phased diploid candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4054093/
https://www.ncbi.nlm.nih.gov/pubmed/24025428
http://dx.doi.org/10.1186/gb-2013-14-9-r97
work_keys_str_mv AT muzzeydale assemblyofaphaseddiploidcandidaalbicansgenomefacilitatesallelespecificmeasurementsandprovidesasimplemodelforrepeatandindelstructure
AT schwartzkatja assemblyofaphaseddiploidcandidaalbicansgenomefacilitatesallelespecificmeasurementsandprovidesasimplemodelforrepeatandindelstructure
AT weissmanjonathans assemblyofaphaseddiploidcandidaalbicansgenomefacilitatesallelespecificmeasurementsandprovidesasimplemodelforrepeatandindelstructure
AT sherlockgavin assemblyofaphaseddiploidcandidaalbicansgenomefacilitatesallelespecificmeasurementsandprovidesasimplemodelforrepeatandindelstructure