Cargando…
The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data
The Candida Genome Database (CGD, http://www.candidagenome.org/) is a freely available online resource that provides gene, protein and sequence information for multiple Candida species, along with web-based tools for accessing, analyzing and exploring these data. The mission of CGD is to facilitate...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210628/ https://www.ncbi.nlm.nih.gov/pubmed/27738138 http://dx.doi.org/10.1093/nar/gkw924 |
_version_ | 1782490922865393664 |
---|---|
author | Skrzypek, Marek S. Binkley, Jonathan Binkley, Gail Miyasato, Stuart R. Simison, Matt Sherlock, Gavin |
author_facet | Skrzypek, Marek S. Binkley, Jonathan Binkley, Gail Miyasato, Stuart R. Simison, Matt Sherlock, Gavin |
author_sort | Skrzypek, Marek S. |
collection | PubMed |
description | The Candida Genome Database (CGD, http://www.candidagenome.org/) is a freely available online resource that provides gene, protein and sequence information for multiple Candida species, along with web-based tools for accessing, analyzing and exploring these data. The mission of CGD is to facilitate and accelerate research into Candida pathogenesis and biology, by curating the scientific literature in real time, and connecting literature-derived annotations to the latest version of the genomic sequence and its annotations. Here, we report the incorporation into CGD of Assembly 22, the first chromosome-level, phased diploid assembly of the C. albicans genome, coupled with improvements that we have made to the assembly using additional available sequence data. We also report the creation of systematic identifiers for C. albicans genes and sequence features using a system similar to that adopted by the yeast community over two decades ago. Finally, we describe the incorporation of JBrowse into CGD, which allows online browsing of mapped high throughput sequencing data, and its implementation for several RNA-Seq data sets, as well as the whole genome sequencing data that was used in the construction of Assembly 22. |
format | Online Article Text |
id | pubmed-5210628 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-52106282017-01-05 The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data Skrzypek, Marek S. Binkley, Jonathan Binkley, Gail Miyasato, Stuart R. Simison, Matt Sherlock, Gavin Nucleic Acids Res Database Issue The Candida Genome Database (CGD, http://www.candidagenome.org/) is a freely available online resource that provides gene, protein and sequence information for multiple Candida species, along with web-based tools for accessing, analyzing and exploring these data. The mission of CGD is to facilitate and accelerate research into Candida pathogenesis and biology, by curating the scientific literature in real time, and connecting literature-derived annotations to the latest version of the genomic sequence and its annotations. Here, we report the incorporation into CGD of Assembly 22, the first chromosome-level, phased diploid assembly of the C. albicans genome, coupled with improvements that we have made to the assembly using additional available sequence data. We also report the creation of systematic identifiers for C. albicans genes and sequence features using a system similar to that adopted by the yeast community over two decades ago. Finally, we describe the incorporation of JBrowse into CGD, which allows online browsing of mapped high throughput sequencing data, and its implementation for several RNA-Seq data sets, as well as the whole genome sequencing data that was used in the construction of Assembly 22. Oxford University Press 2017-01-04 2016-10-13 /pmc/articles/PMC5210628/ /pubmed/27738138 http://dx.doi.org/10.1093/nar/gkw924 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Database Issue Skrzypek, Marek S. Binkley, Jonathan Binkley, Gail Miyasato, Stuart R. Simison, Matt Sherlock, Gavin The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data |
title | The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data |
title_full | The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data |
title_fullStr | The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data |
title_full_unstemmed | The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data |
title_short | The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data |
title_sort | candida genome database (cgd): incorporation of assembly 22, systematic identifiers and visualization of high throughput sequencing data |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210628/ https://www.ncbi.nlm.nih.gov/pubmed/27738138 http://dx.doi.org/10.1093/nar/gkw924 |
work_keys_str_mv | AT skrzypekmareks thecandidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT binkleyjonathan thecandidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT binkleygail thecandidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT miyasatostuartr thecandidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT simisonmatt thecandidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT sherlockgavin thecandidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT skrzypekmareks candidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT binkleyjonathan candidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT binkleygail candidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT miyasatostuartr candidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT simisonmatt candidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata AT sherlockgavin candidagenomedatabasecgdincorporationofassembly22systematicidentifiersandvisualizationofhighthroughputsequencingdata |