Cargando…
Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies
BACKGROUND: Conopeptides, often generically referred to as conotoxins, are small neurotoxins found in the venom of predatory marine cone snails. These molecules are highly stable and are able to efficiently and selectively interact with a wide variety of heterologous receptors and channels, making t...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853152/ https://www.ncbi.nlm.nih.gov/pubmed/24131469 http://dx.doi.org/10.1186/1471-2164-14-708 |
_version_ | 1782478789364678656 |
---|---|
author | Lavergne, Vincent Dutertre, Sébastien Jin, Ai-hua Lewis, Richard J Taft, Ryan J Alewood, Paul F |
author_facet | Lavergne, Vincent Dutertre, Sébastien Jin, Ai-hua Lewis, Richard J Taft, Ryan J Alewood, Paul F |
author_sort | Lavergne, Vincent |
collection | PubMed |
description | BACKGROUND: Conopeptides, often generically referred to as conotoxins, are small neurotoxins found in the venom of predatory marine cone snails. These molecules are highly stable and are able to efficiently and selectively interact with a wide variety of heterologous receptors and channels, making them valuable pharmacological probes and potential drug leads. Recent advances in next-generation RNA sequencing and high-throughput proteomics have led to the generation of large data sets that require purpose-built and dedicated bioinformatics tools for efficient data mining. RESULTS: Here we describe ConoSorter, an algorithm that categorizes cDNA or protein sequences into conopeptide superfamilies and classes based on their signal, pro- and mature region sequence composition. ConoSorter also catalogues key sequence characteristics (including relative sequence frequency, length, number of cysteines, N-terminal hydrophobicity, sequence similarity score) and automatically searches the ConoServer database for known precursor sequences, facilitating identification of known and novel conopeptides. When applied to ConoServer and UniProtKB/Swiss-Prot databases, ConoSorter is able to recognize 100% of known conotoxin superfamilies and classes with a minimum species specificity of 99%. As a proof of concept, we performed a reanalysis of Conus marmoreus venom duct transcriptome and (i) correctly classified all sequences previously annotated, (ii) identified 158 novel precursor conopeptide transcripts, 106 of which were confirmed by protein mass spectrometry, and (iii) identified another 13 novel conotoxin gene superfamilies. CONCLUSIONS: Taken together, these findings indicate that ConoSorter is not only capable of robust classification of known conopeptides from large RNA data sets, but can also facilitate de novo identification of conopeptides which may have pharmaceutical importance. |
format | Online Article Text |
id | pubmed-3853152 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38531522013-12-18 Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies Lavergne, Vincent Dutertre, Sébastien Jin, Ai-hua Lewis, Richard J Taft, Ryan J Alewood, Paul F BMC Genomics Methodology Article BACKGROUND: Conopeptides, often generically referred to as conotoxins, are small neurotoxins found in the venom of predatory marine cone snails. These molecules are highly stable and are able to efficiently and selectively interact with a wide variety of heterologous receptors and channels, making them valuable pharmacological probes and potential drug leads. Recent advances in next-generation RNA sequencing and high-throughput proteomics have led to the generation of large data sets that require purpose-built and dedicated bioinformatics tools for efficient data mining. RESULTS: Here we describe ConoSorter, an algorithm that categorizes cDNA or protein sequences into conopeptide superfamilies and classes based on their signal, pro- and mature region sequence composition. ConoSorter also catalogues key sequence characteristics (including relative sequence frequency, length, number of cysteines, N-terminal hydrophobicity, sequence similarity score) and automatically searches the ConoServer database for known precursor sequences, facilitating identification of known and novel conopeptides. When applied to ConoServer and UniProtKB/Swiss-Prot databases, ConoSorter is able to recognize 100% of known conotoxin superfamilies and classes with a minimum species specificity of 99%. As a proof of concept, we performed a reanalysis of Conus marmoreus venom duct transcriptome and (i) correctly classified all sequences previously annotated, (ii) identified 158 novel precursor conopeptide transcripts, 106 of which were confirmed by protein mass spectrometry, and (iii) identified another 13 novel conotoxin gene superfamilies. CONCLUSIONS: Taken together, these findings indicate that ConoSorter is not only capable of robust classification of known conopeptides from large RNA data sets, but can also facilitate de novo identification of conopeptides which may have pharmaceutical importance. BioMed Central 2013-10-16 /pmc/articles/PMC3853152/ /pubmed/24131469 http://dx.doi.org/10.1186/1471-2164-14-708 Text en Copyright © 2013 Lavergne et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Lavergne, Vincent Dutertre, Sébastien Jin, Ai-hua Lewis, Richard J Taft, Ryan J Alewood, Paul F Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies |
title | Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies |
title_full | Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies |
title_fullStr | Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies |
title_full_unstemmed | Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies |
title_short | Systematic interrogation of the Conus marmoreus venom duct transcriptome with ConoSorter reveals 158 novel conotoxins and 13 new gene superfamilies |
title_sort | systematic interrogation of the conus marmoreus venom duct transcriptome with conosorter reveals 158 novel conotoxins and 13 new gene superfamilies |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853152/ https://www.ncbi.nlm.nih.gov/pubmed/24131469 http://dx.doi.org/10.1186/1471-2164-14-708 |
work_keys_str_mv | AT lavergnevincent systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies AT dutertresebastien systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies AT jinaihua systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies AT lewisrichardj systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies AT taftryanj systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies AT alewoodpaulf systematicinterrogationoftheconusmarmoreusvenomducttranscriptomewithconosorterreveals158novelconotoxinsand13newgenesuperfamilies |