Cargando…

Discovery of Novel Conotoxin Candidates Using Machine Learning

Cone snails (genus Conus) are venomous marine snails that inject prey with a lethal cocktail of conotoxins, small, secreted, and cysteine-rich peptides. Given the diversity and often high affinity for their molecular targets, consisting of ion channels, receptors or transporters, many conotoxins hav...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Qing, Watkins, Maren, Robinson, Samuel D., Safavi-Hemami, Helena, Yandell, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6315676/
https://www.ncbi.nlm.nih.gov/pubmed/30513724
http://dx.doi.org/10.3390/toxins10120503
_version_ 1783384351243763712
author Li, Qing
Watkins, Maren
Robinson, Samuel D.
Safavi-Hemami, Helena
Yandell, Mark
author_facet Li, Qing
Watkins, Maren
Robinson, Samuel D.
Safavi-Hemami, Helena
Yandell, Mark
author_sort Li, Qing
collection PubMed
description Cone snails (genus Conus) are venomous marine snails that inject prey with a lethal cocktail of conotoxins, small, secreted, and cysteine-rich peptides. Given the diversity and often high affinity for their molecular targets, consisting of ion channels, receptors or transporters, many conotoxins have become invaluable pharmacological probes, drug leads, and therapeutics. Transcriptome sequencing of Conus venom glands followed by de novo assembly and homology-based toxin identification and annotation is currently the state-of-the-art for discovery of new conotoxins. However, homology-based search techniques, by definition, can only detect novel toxins that are homologous to previously reported conotoxins. To overcome these obstacles for discovery, we have created ConusPipe, a machine learning tool that utilizes prominent chemical characters of conotoxins to predict whether a certain transcript in a Conus transcriptome, which has no otherwise detectable homologs in current reference databases, is a putative conotoxin. By using ConusPipe on RNASeq data of 10 species, we report 5148 new putative conotoxin transcripts that have no homologues in current reference databases. 896 of these were identified by at least three out of four models used. These data significantly expand current publicly available conotoxin datasets and our approach provides a new computational avenue for the discovery of novel toxin families.
format Online
Article
Text
id pubmed-6315676
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-63156762019-01-11 Discovery of Novel Conotoxin Candidates Using Machine Learning Li, Qing Watkins, Maren Robinson, Samuel D. Safavi-Hemami, Helena Yandell, Mark Toxins (Basel) Article Cone snails (genus Conus) are venomous marine snails that inject prey with a lethal cocktail of conotoxins, small, secreted, and cysteine-rich peptides. Given the diversity and often high affinity for their molecular targets, consisting of ion channels, receptors or transporters, many conotoxins have become invaluable pharmacological probes, drug leads, and therapeutics. Transcriptome sequencing of Conus venom glands followed by de novo assembly and homology-based toxin identification and annotation is currently the state-of-the-art for discovery of new conotoxins. However, homology-based search techniques, by definition, can only detect novel toxins that are homologous to previously reported conotoxins. To overcome these obstacles for discovery, we have created ConusPipe, a machine learning tool that utilizes prominent chemical characters of conotoxins to predict whether a certain transcript in a Conus transcriptome, which has no otherwise detectable homologs in current reference databases, is a putative conotoxin. By using ConusPipe on RNASeq data of 10 species, we report 5148 new putative conotoxin transcripts that have no homologues in current reference databases. 896 of these were identified by at least three out of four models used. These data significantly expand current publicly available conotoxin datasets and our approach provides a new computational avenue for the discovery of novel toxin families. MDPI 2018-12-01 /pmc/articles/PMC6315676/ /pubmed/30513724 http://dx.doi.org/10.3390/toxins10120503 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Qing
Watkins, Maren
Robinson, Samuel D.
Safavi-Hemami, Helena
Yandell, Mark
Discovery of Novel Conotoxin Candidates Using Machine Learning
title Discovery of Novel Conotoxin Candidates Using Machine Learning
title_full Discovery of Novel Conotoxin Candidates Using Machine Learning
title_fullStr Discovery of Novel Conotoxin Candidates Using Machine Learning
title_full_unstemmed Discovery of Novel Conotoxin Candidates Using Machine Learning
title_short Discovery of Novel Conotoxin Candidates Using Machine Learning
title_sort discovery of novel conotoxin candidates using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6315676/
https://www.ncbi.nlm.nih.gov/pubmed/30513724
http://dx.doi.org/10.3390/toxins10120503
work_keys_str_mv AT liqing discoveryofnovelconotoxincandidatesusingmachinelearning
AT watkinsmaren discoveryofnovelconotoxincandidatesusingmachinelearning
AT robinsonsamueld discoveryofnovelconotoxincandidatesusingmachinelearning
AT safavihemamihelena discoveryofnovelconotoxincandidatesusingmachinelearning
AT yandellmark discoveryofnovelconotoxincandidatesusingmachinelearning