Cargando…

Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes

BACKGROUND: The medicinal leech, Hirudo medicinalis, is an important model system for the study of nervous system structure, function, development, regeneration and repair. It is also a unique species in being presently approved for use in medical procedures, such as clearing of pooled blood followi...

Descripción completa

Detalles Bibliográficos
Autores principales: Macagno, Eduardo R, Gaasterland, Terry, Edsall, Lee, Bafna, Vineet, Soares, Marcelo B, Scheetz, Todd, Casavant, Thomas, Da Silva, Corinne, Wincker, Patrick, Tasiemski, Aurélie, Salzet, Michel
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996935/
https://www.ncbi.nlm.nih.gov/pubmed/20579359
http://dx.doi.org/10.1186/1471-2164-11-407
_version_ 1782193238174597120
author Macagno, Eduardo R
Gaasterland, Terry
Edsall, Lee
Bafna, Vineet
Soares, Marcelo B
Scheetz, Todd
Casavant, Thomas
Da Silva, Corinne
Wincker, Patrick
Tasiemski, Aurélie
Salzet, Michel
author_facet Macagno, Eduardo R
Gaasterland, Terry
Edsall, Lee
Bafna, Vineet
Soares, Marcelo B
Scheetz, Todd
Casavant, Thomas
Da Silva, Corinne
Wincker, Patrick
Tasiemski, Aurélie
Salzet, Michel
author_sort Macagno, Eduardo R
collection PubMed
description BACKGROUND: The medicinal leech, Hirudo medicinalis, is an important model system for the study of nervous system structure, function, development, regeneration and repair. It is also a unique species in being presently approved for use in medical procedures, such as clearing of pooled blood following certain surgical procedures. It is a current, and potentially also future, source of medically useful molecular factors, such as anticoagulants and antibacterial peptides, which may have evolved as a result of its parasitizing large mammals, including humans. Despite the broad focus of research on this system, little has been done at the genomic or transcriptomic levels and there is a paucity of openly available sequence data. To begin to address this problem, we constructed whole embryo and adult central nervous system (CNS) EST libraries and created a clustered sequence database of the Hirudo transcriptome that is available to the scientific community. RESULTS: A total of ~133,000 EST clones from two directionally-cloned cDNA libraries, one constructed from mRNA derived from whole embryos at several developmental stages and the other from adult CNS cords, were sequenced in one or both directions by three different groups: Genoscope (French National Sequencing Center), the University of Iowa Sequencing Facility and the DOE Joint Genome Institute. These were assembled using the phrap software package into 31,232 unique contigs and singletons, with an average length of 827 nt. The assembled transcripts were then translated in all six frames and compared to proteins in NCBI's non-redundant (NR) and to the Gene Ontology (GO) protein sequence databases, resulting in 15,565 matches to 11,236 proteins in NR and 13,935 matches to 8,073 proteins in GO. Searching the database for transcripts of genes homologous to those thought to be involved in the innate immune responses of vertebrates and other invertebrates yielded a set of nearly one hundred evolutionarily conserved sequences, representing all known pathways involved in these important functions. CONCLUSIONS: The sequences obtained for Hirudo transcripts represent the first major database of genes expressed in this important model system. Comparison of translated open reading frames (ORFs) with the other openly available leech datasets, the genome and transcriptome of Helobdella robusta, shows an average identity at the amino acid level of 58% in matched sequences. Interestingly, comparison with other available Lophotrochozoans shows similar high levels of amino acid identity, where sequences match, for example, 64% with Capitella capitata (a polychaete) and 56% with Aplysia californica (a mollusk), as well as 58% with Schistosoma mansoni (a platyhelminth). Phylogenetic comparisons of putative Hirudo innate immune response genes present within the Hirudo transcriptome database herein described show a strong resemblance to the corresponding mammalian genes, indicating that this important physiological response may have older origins than what has been previously proposed.
format Text
id pubmed-2996935
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29969352010-12-07 Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes Macagno, Eduardo R Gaasterland, Terry Edsall, Lee Bafna, Vineet Soares, Marcelo B Scheetz, Todd Casavant, Thomas Da Silva, Corinne Wincker, Patrick Tasiemski, Aurélie Salzet, Michel BMC Genomics Research Article BACKGROUND: The medicinal leech, Hirudo medicinalis, is an important model system for the study of nervous system structure, function, development, regeneration and repair. It is also a unique species in being presently approved for use in medical procedures, such as clearing of pooled blood following certain surgical procedures. It is a current, and potentially also future, source of medically useful molecular factors, such as anticoagulants and antibacterial peptides, which may have evolved as a result of its parasitizing large mammals, including humans. Despite the broad focus of research on this system, little has been done at the genomic or transcriptomic levels and there is a paucity of openly available sequence data. To begin to address this problem, we constructed whole embryo and adult central nervous system (CNS) EST libraries and created a clustered sequence database of the Hirudo transcriptome that is available to the scientific community. RESULTS: A total of ~133,000 EST clones from two directionally-cloned cDNA libraries, one constructed from mRNA derived from whole embryos at several developmental stages and the other from adult CNS cords, were sequenced in one or both directions by three different groups: Genoscope (French National Sequencing Center), the University of Iowa Sequencing Facility and the DOE Joint Genome Institute. These were assembled using the phrap software package into 31,232 unique contigs and singletons, with an average length of 827 nt. The assembled transcripts were then translated in all six frames and compared to proteins in NCBI's non-redundant (NR) and to the Gene Ontology (GO) protein sequence databases, resulting in 15,565 matches to 11,236 proteins in NR and 13,935 matches to 8,073 proteins in GO. Searching the database for transcripts of genes homologous to those thought to be involved in the innate immune responses of vertebrates and other invertebrates yielded a set of nearly one hundred evolutionarily conserved sequences, representing all known pathways involved in these important functions. CONCLUSIONS: The sequences obtained for Hirudo transcripts represent the first major database of genes expressed in this important model system. Comparison of translated open reading frames (ORFs) with the other openly available leech datasets, the genome and transcriptome of Helobdella robusta, shows an average identity at the amino acid level of 58% in matched sequences. Interestingly, comparison with other available Lophotrochozoans shows similar high levels of amino acid identity, where sequences match, for example, 64% with Capitella capitata (a polychaete) and 56% with Aplysia californica (a mollusk), as well as 58% with Schistosoma mansoni (a platyhelminth). Phylogenetic comparisons of putative Hirudo innate immune response genes present within the Hirudo transcriptome database herein described show a strong resemblance to the corresponding mammalian genes, indicating that this important physiological response may have older origins than what has been previously proposed. BioMed Central 2010-06-25 /pmc/articles/PMC2996935/ /pubmed/20579359 http://dx.doi.org/10.1186/1471-2164-11-407 Text en Copyright ©2010 Macagno et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Macagno, Eduardo R
Gaasterland, Terry
Edsall, Lee
Bafna, Vineet
Soares, Marcelo B
Scheetz, Todd
Casavant, Thomas
Da Silva, Corinne
Wincker, Patrick
Tasiemski, Aurélie
Salzet, Michel
Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes
title Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes
title_full Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes
title_fullStr Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes
title_full_unstemmed Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes
title_short Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes
title_sort construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996935/
https://www.ncbi.nlm.nih.gov/pubmed/20579359
http://dx.doi.org/10.1186/1471-2164-11-407
work_keys_str_mv AT macagnoeduardor constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT gaasterlandterry constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT edsalllee constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT bafnavineet constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT soaresmarcelob constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT scheetztodd constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT casavantthomas constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT dasilvacorinne constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT winckerpatrick constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT tasiemskiaurelie constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes
AT salzetmichel constructionofamedicinalleechtranscriptomedatabaseanditsapplicationtotheidentificationofleechhomologsofneuralandinnateimmunegenes