Cargando…

In silico hybridization enables transcriptomic illumination of the nature and evolution of Myxozoa

BACKGROUND: The Myxozoa, a group of oligocellular, obligate endoparasites, has long been poorly understood in an evolutionary context. Recent genome-level sequencing techniques such as RNA-seq have generated large amounts of myxozoan sequence data, providing valuable insight into their evolutionary...

Descripción completa

Detalles Bibliográficos
Autores principales: Foox, Jonathan, Ringuette, Maurice, Desser, Sherwin S., Siddall, Mark E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619090/
https://www.ncbi.nlm.nih.gov/pubmed/26494377
http://dx.doi.org/10.1186/s12864-015-2039-6
_version_ 1782397038584922112
author Foox, Jonathan
Ringuette, Maurice
Desser, Sherwin S.
Siddall, Mark E.
author_facet Foox, Jonathan
Ringuette, Maurice
Desser, Sherwin S.
Siddall, Mark E.
author_sort Foox, Jonathan
collection PubMed
description BACKGROUND: The Myxozoa, a group of oligocellular, obligate endoparasites, has long been poorly understood in an evolutionary context. Recent genome-level sequencing techniques such as RNA-seq have generated large amounts of myxozoan sequence data, providing valuable insight into their evolutionary history. However, sequences from host tissue contamination are present in next-generation sequencing reactions of myxozoan tissue, and differentiating between the two has been inadequately addressed. In order to shed light on the genetic underpinnings of myxozoan biology, assembled contigs generated from these studies that derived from the myxozoan must be decoupled from transcripts derived from host tissue and other contamination. This study describes a pipeline for categorization of transcripts asmyxozoan based on similarity searching with known host and parasite sequences, explores the extent to which host contamination is present in previously existing myxozoan datasets, and implements this pipeline on a newly sequenced transcriptome of Myxobolus pendula, a parasite of the common creek chub gill arch. METHODS: The insilico hybridization pipeline uses iterative BLAST searching and database-driven e-value comparison to categorize transcripts as deriving from host, parasite, or other contamination. Functional genetic analysis of M. pendula was conducted using further BLAST searching, Hidden Markov Modeling, and sequence alignment and phylogenetic reconstruction. RESULTS: Three RNA libraries of encysted M. pendula plasmodia were sequenced and subjected to the method. Nearly half of the final set of contiguous assembly sequences (47.3 %) was identified as putative myxozoan transcripts. Putative contamination was also identified in at least 1/3(rd) of previously published myxozoan transcripts. The set of M. pendula transcripts was mined for a range of biologically insightful genes, including taxonomically restricted nematocyst structural proteins and nematocyst proteins identified through mass tandem spectrometry of other cnidarians. Several novel findings emerged, including a fourth myxozoan minicollagen gene, putative myxozoan toxin proteins,and extracellular matrix glycoproteins. CONCLUSIONS: This study serves as a model for the handling of next-generation myxozoan sequence. The need for careful categorization was demonstrated in both previous and new sets of myxozoan sequences. The final set of confidently assigned myxozoan transcripts can be mined for any biologically relevant gene or gene family without spurious misidentification of host contamination as a myxozoan homolog. As exemplified by M. pendula, the repertoire of myxozoan polar capsules may be more complex than previously thought, with an additional minicollagen homolog and putative expression of toxin proteins. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2039-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4619090
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46190902015-10-25 In silico hybridization enables transcriptomic illumination of the nature and evolution of Myxozoa Foox, Jonathan Ringuette, Maurice Desser, Sherwin S. Siddall, Mark E. BMC Genomics Research Article BACKGROUND: The Myxozoa, a group of oligocellular, obligate endoparasites, has long been poorly understood in an evolutionary context. Recent genome-level sequencing techniques such as RNA-seq have generated large amounts of myxozoan sequence data, providing valuable insight into their evolutionary history. However, sequences from host tissue contamination are present in next-generation sequencing reactions of myxozoan tissue, and differentiating between the two has been inadequately addressed. In order to shed light on the genetic underpinnings of myxozoan biology, assembled contigs generated from these studies that derived from the myxozoan must be decoupled from transcripts derived from host tissue and other contamination. This study describes a pipeline for categorization of transcripts asmyxozoan based on similarity searching with known host and parasite sequences, explores the extent to which host contamination is present in previously existing myxozoan datasets, and implements this pipeline on a newly sequenced transcriptome of Myxobolus pendula, a parasite of the common creek chub gill arch. METHODS: The insilico hybridization pipeline uses iterative BLAST searching and database-driven e-value comparison to categorize transcripts as deriving from host, parasite, or other contamination. Functional genetic analysis of M. pendula was conducted using further BLAST searching, Hidden Markov Modeling, and sequence alignment and phylogenetic reconstruction. RESULTS: Three RNA libraries of encysted M. pendula plasmodia were sequenced and subjected to the method. Nearly half of the final set of contiguous assembly sequences (47.3 %) was identified as putative myxozoan transcripts. Putative contamination was also identified in at least 1/3(rd) of previously published myxozoan transcripts. The set of M. pendula transcripts was mined for a range of biologically insightful genes, including taxonomically restricted nematocyst structural proteins and nematocyst proteins identified through mass tandem spectrometry of other cnidarians. Several novel findings emerged, including a fourth myxozoan minicollagen gene, putative myxozoan toxin proteins,and extracellular matrix glycoproteins. CONCLUSIONS: This study serves as a model for the handling of next-generation myxozoan sequence. The need for careful categorization was demonstrated in both previous and new sets of myxozoan sequences. The final set of confidently assigned myxozoan transcripts can be mined for any biologically relevant gene or gene family without spurious misidentification of host contamination as a myxozoan homolog. As exemplified by M. pendula, the repertoire of myxozoan polar capsules may be more complex than previously thought, with an additional minicollagen homolog and putative expression of toxin proteins. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2039-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-23 /pmc/articles/PMC4619090/ /pubmed/26494377 http://dx.doi.org/10.1186/s12864-015-2039-6 Text en © Foox et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Foox, Jonathan
Ringuette, Maurice
Desser, Sherwin S.
Siddall, Mark E.
In silico hybridization enables transcriptomic illumination of the nature and evolution of Myxozoa
title In silico hybridization enables transcriptomic illumination of the nature and evolution of Myxozoa
title_full In silico hybridization enables transcriptomic illumination of the nature and evolution of Myxozoa
title_fullStr In silico hybridization enables transcriptomic illumination of the nature and evolution of Myxozoa
title_full_unstemmed In silico hybridization enables transcriptomic illumination of the nature and evolution of Myxozoa
title_short In silico hybridization enables transcriptomic illumination of the nature and evolution of Myxozoa
title_sort in silico hybridization enables transcriptomic illumination of the nature and evolution of myxozoa
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619090/
https://www.ncbi.nlm.nih.gov/pubmed/26494377
http://dx.doi.org/10.1186/s12864-015-2039-6
work_keys_str_mv AT fooxjonathan insilicohybridizationenablestranscriptomicilluminationofthenatureandevolutionofmyxozoa
AT ringuettemaurice insilicohybridizationenablestranscriptomicilluminationofthenatureandevolutionofmyxozoa
AT dessersherwins insilicohybridizationenablestranscriptomicilluminationofthenatureandevolutionofmyxozoa
AT siddallmarke insilicohybridizationenablestranscriptomicilluminationofthenatureandevolutionofmyxozoa