Cargando…

Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies

BACKGROUND: Conopeptides, often generically referred to as conotoxins, are small neurotoxins found in the venom of predatory marine cone snails. These molecules are highly stable and are able to efficiently and selectively interact with a wide variety of heterologous receptors and channels, making t...

Descripción completa

Detalles Bibliográficos
Autores principales: Lavergne, Vincent, Dutertre, Sébastien, Jin, Ai-hua, Lewis, Richard J, Taft, Ryan J, Alewood, Paul F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853152/
https://www.ncbi.nlm.nih.gov/pubmed/24131469
http://dx.doi.org/10.1186/1471-2164-14-708
_version_ 1782478789364678656
author Lavergne, Vincent
Dutertre, Sébastien
Jin, Ai-hua
Lewis, Richard J
Taft, Ryan J
Alewood, Paul F
author_facet Lavergne, Vincent
Dutertre, Sébastien
Jin, Ai-hua
Lewis, Richard J
Taft, Ryan J
Alewood, Paul F
author_sort Lavergne, Vincent
collection PubMed
description BACKGROUND: Conopeptides, often generically referred to as conotoxins, are small neurotoxins found in the venom of predatory marine cone snails. These molecules are highly stable and are able to efficiently and selectively interact with a wide variety of heterologous receptors and channels, making them valuable pharmacological probes and potential drug leads. Recent advances in next-generation RNA sequencing and high-throughput proteomics have led to the generation of large data sets that require purpose-built and dedicated bioinformatics tools for efficient data mining. RESULTS: Here we describe ConoSorter, an algorithm that categorizes cDNA or protein sequences into conopeptide superfamilies and classes based on their signal, pro- and mature region sequence composition. ConoSorter also catalogues key sequence characteristics (including relative sequence frequency, length, number of cysteines, N-terminal hydrophobicity, sequence similarity score) and automatically searches the ConoServer database for known precursor sequences, facilitating identification of known and novel conopeptides. When applied to ConoServer and UniProtKB/Swiss-Prot databases, ConoSorter is able to recognize 100% of known conotoxin superfamilies and classes with a minimum species specificity of 99%. As a proof of concept, we performed a reanalysis of Conus marmoreus venom duct transcriptome and (i) correctly classified all sequences previously annotated, (ii) identified 158 novel precursor conopeptide transcripts, 106 of which were confirmed by protein mass spectrometry, and (iii) identified another 13 novel conotoxin gene superfamilies. CONCLUSIONS: Taken together, these findings indicate that ConoSorter is not only capable of robust classification of known conopeptides from large RNA data sets, but can also facilitate de novo identification of conopeptides which may have pharmaceutical importance.
format Online
Article
Text
id pubmed-3853152
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38531522013-12-18 Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies Lavergne, Vincent Dutertre, Sébastien Jin, Ai-hua Lewis, Richard J Taft, Ryan J Alewood, Paul F BMC Genomics Methodology Article BACKGROUND: Conopeptides, often generically referred to as conotoxins, are small neurotoxins found in the venom of predatory marine cone snails. These molecules are highly stable and are able to efficiently and selectively interact with a wide variety of heterologous receptors and channels, making them valuable pharmacological probes and potential drug leads. Recent advances in next-generation RNA sequencing and high-throughput proteomics have led to the generation of large data sets that require purpose-built and dedicated bioinformatics tools for efficient data mining. RESULTS: Here we describe ConoSorter, an algorithm that categorizes cDNA or protein sequences into conopeptide superfamilies and classes based on their signal, pro- and mature region sequence composition. ConoSorter also catalogues key sequence characteristics (including relative sequence frequency, length, number of cysteines, N-terminal hydrophobicity, sequence similarity score) and automatically searches the ConoServer database for known precursor sequences, facilitating identification of known and novel conopeptides. When applied to ConoServer and UniProtKB/Swiss-Prot databases, ConoSorter is able to recognize 100% of known conotoxin superfamilies and classes with a minimum species specificity of 99%. As a proof of concept, we performed a reanalysis of Conus marmoreus venom duct transcriptome and (i) correctly classified all sequences previously annotated, (ii) identified 158 novel precursor conopeptide transcripts, 106 of which were confirmed by protein mass spectrometry, and (iii) identified another 13 novel conotoxin gene superfamilies. CONCLUSIONS: Taken together, these findings indicate that ConoSorter is not only capable of robust classification of known conopeptides from large RNA data sets, but can also facilitate de novo identification of conopeptides which may have pharmaceutical importance. BioMed Central 2013-10-16 /pmc/articles/PMC3853152/ /pubmed/24131469 http://dx.doi.org/10.1186/1471-2164-14-708 Text en Copyright © 2013 Lavergne et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Lavergne, Vincent
Dutertre, Sébastien
Jin, Ai-hua
Lewis, Richard J
Taft, Ryan J
Alewood, Paul F
Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies
title Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies
title_full Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies
title_fullStr Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies
title_full_unstemmed Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies
title_short Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies
title_sort systematic interrogation of the conus marmoreus venom duct transcriptome with conosorter reveals 158 novel conotoxins and 13 new gene superfamilies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853152/
https://www.ncbi.nlm.nih.gov/pubmed/24131469
http://dx.doi.org/10.1186/1471-2164-14-708
work_keys_str_mv AT lavergnevincent systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies
AT dutertresebastien systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies
AT jinaihua systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies
AT lewisrichardj systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies
AT taftryanj systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies
AT alewoodpaulf systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies