Cargando…

De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin

Camptotheca acuminata is 1 of a limited number of species that produce camptothecin, a pentacyclic quinoline alkaloid with anti-cancer activity due to its ability to inhibit DNA topoisomerase. While transcriptome studies have been performed previously with various camptothecin-producing species, no...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Dongyan, Hamilton, John P., Pham, Gina M., Crisovan, Emily, Wiegert-Rininger, Krystle, Vaillancourt, Brieanne, DellaPenna, Dean, Buell, C. Robin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737489/
https://www.ncbi.nlm.nih.gov/pubmed/28922823
http://dx.doi.org/10.1093/gigascience/gix065
_version_ 1783287527656914944
author Zhao, Dongyan
Hamilton, John P.
Pham, Gina M.
Crisovan, Emily
Wiegert-Rininger, Krystle
Vaillancourt, Brieanne
DellaPenna, Dean
Buell, C. Robin
author_facet Zhao, Dongyan
Hamilton, John P.
Pham, Gina M.
Crisovan, Emily
Wiegert-Rininger, Krystle
Vaillancourt, Brieanne
DellaPenna, Dean
Buell, C. Robin
author_sort Zhao, Dongyan
collection PubMed
description Camptotheca acuminata is 1 of a limited number of species that produce camptothecin, a pentacyclic quinoline alkaloid with anti-cancer activity due to its ability to inhibit DNA topoisomerase. While transcriptome studies have been performed previously with various camptothecin-producing species, no genome sequence for a camptothecin-producing species is available to date. We generated a high-quality de novo genome assembly for C. acuminata representing 403 174 860 bp on 1394 scaffolds with an N50 scaffold size of 1752 kbp. Quality assessments of the assembly revealed robust representation of the genome sequence including genic regions. Using a novel genome annotation method, we annotated 31 825 genes encoding 40 332 gene models. Based on sequence identity and orthology with validated genes from Catharanthus roseus as well as Pfam searches, we identified candidate orthologs for genes potentially involved in camptothecin biosynthesis. Extensive gene duplication including tandem duplication was widespread in the C. acuminata genome, with 2571 genes belonging to 997 tandem duplicated gene clusters. To our knowledge, this is the first genome sequence for a camptothecin-producing species, and access to the C. acuminata genome will permit not only discovery of genes encoding the camptothecin biosynthetic pathway but also reagents that can be used for heterologous expression of camptothecin and camptothecin analogs with novel pharmaceutical applications.
format Online
Article
Text
id pubmed-5737489
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57374892018-01-09 De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin Zhao, Dongyan Hamilton, John P. Pham, Gina M. Crisovan, Emily Wiegert-Rininger, Krystle Vaillancourt, Brieanne DellaPenna, Dean Buell, C. Robin Gigascience Data Note Camptotheca acuminata is 1 of a limited number of species that produce camptothecin, a pentacyclic quinoline alkaloid with anti-cancer activity due to its ability to inhibit DNA topoisomerase. While transcriptome studies have been performed previously with various camptothecin-producing species, no genome sequence for a camptothecin-producing species is available to date. We generated a high-quality de novo genome assembly for C. acuminata representing 403 174 860 bp on 1394 scaffolds with an N50 scaffold size of 1752 kbp. Quality assessments of the assembly revealed robust representation of the genome sequence including genic regions. Using a novel genome annotation method, we annotated 31 825 genes encoding 40 332 gene models. Based on sequence identity and orthology with validated genes from Catharanthus roseus as well as Pfam searches, we identified candidate orthologs for genes potentially involved in camptothecin biosynthesis. Extensive gene duplication including tandem duplication was widespread in the C. acuminata genome, with 2571 genes belonging to 997 tandem duplicated gene clusters. To our knowledge, this is the first genome sequence for a camptothecin-producing species, and access to the C. acuminata genome will permit not only discovery of genes encoding the camptothecin biosynthetic pathway but also reagents that can be used for heterologous expression of camptothecin and camptothecin analogs with novel pharmaceutical applications. Oxford University Press 2017-07-24 /pmc/articles/PMC5737489/ /pubmed/28922823 http://dx.doi.org/10.1093/gigascience/gix065 Text en © The Authors 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Zhao, Dongyan
Hamilton, John P.
Pham, Gina M.
Crisovan, Emily
Wiegert-Rininger, Krystle
Vaillancourt, Brieanne
DellaPenna, Dean
Buell, C. Robin
De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin
title De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin
title_full De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin
title_fullStr De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin
title_full_unstemmed De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin
title_short De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin
title_sort de novo genome assembly of camptotheca acuminata, a natural source of the anti-cancer compound camptothecin
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737489/
https://www.ncbi.nlm.nih.gov/pubmed/28922823
http://dx.doi.org/10.1093/gigascience/gix065
work_keys_str_mv AT zhaodongyan denovogenomeassemblyofcamptothecaacuminataanaturalsourceoftheanticancercompoundcamptothecin
AT hamiltonjohnp denovogenomeassemblyofcamptothecaacuminataanaturalsourceoftheanticancercompoundcamptothecin
AT phamginam denovogenomeassemblyofcamptothecaacuminataanaturalsourceoftheanticancercompoundcamptothecin
AT crisovanemily denovogenomeassemblyofcamptothecaacuminataanaturalsourceoftheanticancercompoundcamptothecin
AT wiegertriningerkrystle denovogenomeassemblyofcamptothecaacuminataanaturalsourceoftheanticancercompoundcamptothecin
AT vaillancourtbrieanne denovogenomeassemblyofcamptothecaacuminataanaturalsourceoftheanticancercompoundcamptothecin
AT dellapennadean denovogenomeassemblyofcamptothecaacuminataanaturalsourceoftheanticancercompoundcamptothecin
AT buellcrobin denovogenomeassemblyofcamptothecaacuminataanaturalsourceoftheanticancercompoundcamptothecin