Cargando…
CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes
Motivation: Secondary metabolites (SM) are structurally diverse natural products of high pharmaceutical importance. Genes involved in their biosynthesis are often organized in clusters, i.e., are co-localized and co-expressed. In silico cluster prediction in eukaryotic genomes remains problematic ma...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4824125/ https://www.ncbi.nlm.nih.gov/pubmed/26656005 http://dx.doi.org/10.1093/bioinformatics/btv713 |
_version_ | 1782426046307500032 |
---|---|
author | Wolf, Thomas Shelest, Vladimir Nath, Neetika Shelest, Ekaterina |
author_facet | Wolf, Thomas Shelest, Vladimir Nath, Neetika Shelest, Ekaterina |
author_sort | Wolf, Thomas |
collection | PubMed |
description | Motivation: Secondary metabolites (SM) are structurally diverse natural products of high pharmaceutical importance. Genes involved in their biosynthesis are often organized in clusters, i.e., are co-localized and co-expressed. In silico cluster prediction in eukaryotic genomes remains problematic mainly due to the high variability of the clusters’ content and lack of other distinguishing sequence features. Results: We present Cluster Assignment by Islands of Sites (CASSIS), a method for SM cluster prediction in eukaryotic genomes, and Secondary Metabolites by InterProScan (SMIPS), a tool for genome-wide detection of SM key enzymes (‘anchor’ genes): polyketide synthases, non-ribosomal peptide synthetases and dimethylallyl tryptophan synthases. Unlike other tools based on protein similarity, CASSIS exploits the idea of co-regulation of the cluster genes, which assumes the existence of common regulatory patterns in the cluster promoters. The method searches for ‘islands’ of enriched cluster-specific motifs in the vicinity of anchor genes. It was validated in a series of cross-validation experiments and showed high sensitivity and specificity. Availability and implementation: CASSIS and SMIPS are freely available at https://sbi.hki-jena.de/cassis. Contact: thomas.wolf@leibniz-hki.de or ekaterina.shelest@leibniz-hki.de Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4824125 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-48241252016-04-08 CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes Wolf, Thomas Shelest, Vladimir Nath, Neetika Shelest, Ekaterina Bioinformatics Original Papers Motivation: Secondary metabolites (SM) are structurally diverse natural products of high pharmaceutical importance. Genes involved in their biosynthesis are often organized in clusters, i.e., are co-localized and co-expressed. In silico cluster prediction in eukaryotic genomes remains problematic mainly due to the high variability of the clusters’ content and lack of other distinguishing sequence features. Results: We present Cluster Assignment by Islands of Sites (CASSIS), a method for SM cluster prediction in eukaryotic genomes, and Secondary Metabolites by InterProScan (SMIPS), a tool for genome-wide detection of SM key enzymes (‘anchor’ genes): polyketide synthases, non-ribosomal peptide synthetases and dimethylallyl tryptophan synthases. Unlike other tools based on protein similarity, CASSIS exploits the idea of co-regulation of the cluster genes, which assumes the existence of common regulatory patterns in the cluster promoters. The method searches for ‘islands’ of enriched cluster-specific motifs in the vicinity of anchor genes. It was validated in a series of cross-validation experiments and showed high sensitivity and specificity. Availability and implementation: CASSIS and SMIPS are freely available at https://sbi.hki-jena.de/cassis. Contact: thomas.wolf@leibniz-hki.de or ekaterina.shelest@leibniz-hki.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-04-15 2015-12-09 /pmc/articles/PMC4824125/ /pubmed/26656005 http://dx.doi.org/10.1093/bioinformatics/btv713 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Wolf, Thomas Shelest, Vladimir Nath, Neetika Shelest, Ekaterina CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes |
title | CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes |
title_full | CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes |
title_fullStr | CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes |
title_full_unstemmed | CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes |
title_short | CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes |
title_sort | cassis and smips: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4824125/ https://www.ncbi.nlm.nih.gov/pubmed/26656005 http://dx.doi.org/10.1093/bioinformatics/btv713 |
work_keys_str_mv | AT wolfthomas cassisandsmipspromoterbasedpredictionofsecondarymetabolitegeneclustersineukaryoticgenomes AT shelestvladimir cassisandsmipspromoterbasedpredictionofsecondarymetabolitegeneclustersineukaryoticgenomes AT nathneetika cassisandsmipspromoterbasedpredictionofsecondarymetabolitegeneclustersineukaryoticgenomes AT shelestekaterina cassisandsmipspromoterbasedpredictionofsecondarymetabolitegeneclustersineukaryoticgenomes |