Cargando…
Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure
BACKGROUND: Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely reso...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4054093/ https://www.ncbi.nlm.nih.gov/pubmed/24025428 http://dx.doi.org/10.1186/gb-2013-14-9-r97 |
_version_ | 1782320504055529472 |
---|---|
author | Muzzey, Dale Schwartz, Katja Weissman, Jonathan S Sherlock, Gavin |
author_facet | Muzzey, Dale Schwartz, Katja Weissman, Jonathan S Sherlock, Gavin |
author_sort | Muzzey, Dale |
collection | PubMed |
description | BACKGROUND: Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely resolved. RESULTS: We construct a phased diploid genome assembly by deep sequencing a standard laboratory wild-type strain and a panel of strains homozygous for particular chromosomes. The assembly has 700-fold coverage on average, allowing extensive revision and expansion of the number of known SNPs and indels. This phased genome significantly enhances the sensitivity and specificity of allele-specific expression measurements by enabling pooling and cross-validation of signal across multiple polymorphic sites. Additionally, the diploid assembly reveals pervasive and unexpected patterns in allelic differences between homologous chromosomes. Firstly, we see striking clustering of indels, concentrated primarily in the repeat sequences in promoters. Secondly, both indels and their repeat-sequence substrate are enriched near replication origins. Finally, we reveal an intimate link between repeat sequences and indels, which argues that repeat length is under selective pressure for most eukaryotes. This connection is described by a concise one-parameter model that explains repeat-sequence abundance in C. albicans as a function of the indel rate, and provides a general framework to interpret repeat abundance in species ranging from bacteria to humans. CONCLUSIONS: The phased genome assembly and insights into repeat plasticity will be valuable for better understanding allele-specific phenomena and genome evolution. |
format | Online Article Text |
id | pubmed-4054093 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40540932014-06-12 Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure Muzzey, Dale Schwartz, Katja Weissman, Jonathan S Sherlock, Gavin Genome Biol Research BACKGROUND: Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely resolved. RESULTS: We construct a phased diploid genome assembly by deep sequencing a standard laboratory wild-type strain and a panel of strains homozygous for particular chromosomes. The assembly has 700-fold coverage on average, allowing extensive revision and expansion of the number of known SNPs and indels. This phased genome significantly enhances the sensitivity and specificity of allele-specific expression measurements by enabling pooling and cross-validation of signal across multiple polymorphic sites. Additionally, the diploid assembly reveals pervasive and unexpected patterns in allelic differences between homologous chromosomes. Firstly, we see striking clustering of indels, concentrated primarily in the repeat sequences in promoters. Secondly, both indels and their repeat-sequence substrate are enriched near replication origins. Finally, we reveal an intimate link between repeat sequences and indels, which argues that repeat length is under selective pressure for most eukaryotes. This connection is described by a concise one-parameter model that explains repeat-sequence abundance in C. albicans as a function of the indel rate, and provides a general framework to interpret repeat abundance in species ranging from bacteria to humans. CONCLUSIONS: The phased genome assembly and insights into repeat plasticity will be valuable for better understanding allele-specific phenomena and genome evolution. BioMed Central 2013 2013-09-11 /pmc/articles/PMC4054093/ /pubmed/24025428 http://dx.doi.org/10.1186/gb-2013-14-9-r97 Text en Copyright © 2013 Muzzey et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Muzzey, Dale Schwartz, Katja Weissman, Jonathan S Sherlock, Gavin Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure |
title | Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure |
title_full | Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure |
title_fullStr | Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure |
title_full_unstemmed | Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure |
title_short | Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure |
title_sort | assembly of a phased diploid candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4054093/ https://www.ncbi.nlm.nih.gov/pubmed/24025428 http://dx.doi.org/10.1186/gb-2013-14-9-r97 |
work_keys_str_mv | AT muzzeydale assemblyofaphaseddiploidcandidaalbicansgenomefacilitatesallelespecificmeasurementsandprovidesasimplemodelforrepeatandindelstructure AT schwartzkatja assemblyofaphaseddiploidcandidaalbicansgenomefacilitatesallelespecificmeasurementsandprovidesasimplemodelforrepeatandindelstructure AT weissmanjonathans assemblyofaphaseddiploidcandidaalbicansgenomefacilitatesallelespecificmeasurementsandprovidesasimplemodelforrepeatandindelstructure AT sherlockgavin assemblyofaphaseddiploidcandidaalbicansgenomefacilitatesallelespecificmeasurementsandprovidesasimplemodelforrepeatandindelstructure |