Cargando…
MIDDAS-M: Motif-Independent De Novo Detection of Secondary Metabolite Gene Clusters through the Integration of Genome Sequencing and Transcriptome Data
Many bioactive natural products are produced as “secondary metabolites” by plants, bacteria, and fungi. During the middle of the 20th century, several secondary metabolites from fungi revolutionized the pharmaceutical industry, for example, penicillin, lovastatin, and cyclosporine. They are generall...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3877130/ https://www.ncbi.nlm.nih.gov/pubmed/24391870 http://dx.doi.org/10.1371/journal.pone.0084028 |
_version_ | 1782297599331532800 |
---|---|
author | Umemura, Myco Koike, Hideaki Nagano, Nozomi Ishii, Tomoko Kawano, Jin Yamane, Noriko Kozone, Ikuko Horimoto, Katsuhisa Shin-ya, Kazuo Asai, Kiyoshi Yu, Jiujiang Bennett, Joan W. Machida, Masayuki |
author_facet | Umemura, Myco Koike, Hideaki Nagano, Nozomi Ishii, Tomoko Kawano, Jin Yamane, Noriko Kozone, Ikuko Horimoto, Katsuhisa Shin-ya, Kazuo Asai, Kiyoshi Yu, Jiujiang Bennett, Joan W. Machida, Masayuki |
author_sort | Umemura, Myco |
collection | PubMed |
description | Many bioactive natural products are produced as “secondary metabolites” by plants, bacteria, and fungi. During the middle of the 20th century, several secondary metabolites from fungi revolutionized the pharmaceutical industry, for example, penicillin, lovastatin, and cyclosporine. They are generally biosynthesized by enzymes encoded by clusters of coordinately regulated genes, and several motif-based methods have been developed to detect secondary metabolite biosynthetic (SMB) gene clusters using the sequence information of typical SMB core genes such as polyketide synthases (PKS) and non-ribosomal peptide synthetases (NRPS). However, no detection method exists for SMB gene clusters that are functional and do not include core SMB genes at present. To advance the exploration of SMB gene clusters, especially those without known core genes, we developed MIDDAS-M, a motif-independent de novo detection algorithm for SMB gene clusters. We integrated virtual gene cluster generation in an annotated genome sequence with highly sensitive scoring of the cooperative transcriptional regulation of cluster member genes. MIDDAS-M accurately predicted 38 SMB gene clusters that have been experimentally confirmed and/or predicted by other motif-based methods in 3 fungal strains. MIDDAS-M further identified a new SMB gene cluster for ustiloxin B, which was experimentally validated. Sequence analysis of the cluster genes indicated a novel mechanism for peptide biosynthesis independent of NRPS. Because it is fully computational and independent of empirical knowledge about SMB core genes, MIDDAS-M allows a large-scale, comprehensive analysis of SMB gene clusters, including those with novel biosynthetic mechanisms that do not contain any functionally characterized genes. |
format | Online Article Text |
id | pubmed-3877130 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38771302014-01-03 MIDDAS-M: Motif-Independent De Novo Detection of Secondary Metabolite Gene Clusters through the Integration of Genome Sequencing and Transcriptome Data Umemura, Myco Koike, Hideaki Nagano, Nozomi Ishii, Tomoko Kawano, Jin Yamane, Noriko Kozone, Ikuko Horimoto, Katsuhisa Shin-ya, Kazuo Asai, Kiyoshi Yu, Jiujiang Bennett, Joan W. Machida, Masayuki PLoS One Research Article Many bioactive natural products are produced as “secondary metabolites” by plants, bacteria, and fungi. During the middle of the 20th century, several secondary metabolites from fungi revolutionized the pharmaceutical industry, for example, penicillin, lovastatin, and cyclosporine. They are generally biosynthesized by enzymes encoded by clusters of coordinately regulated genes, and several motif-based methods have been developed to detect secondary metabolite biosynthetic (SMB) gene clusters using the sequence information of typical SMB core genes such as polyketide synthases (PKS) and non-ribosomal peptide synthetases (NRPS). However, no detection method exists for SMB gene clusters that are functional and do not include core SMB genes at present. To advance the exploration of SMB gene clusters, especially those without known core genes, we developed MIDDAS-M, a motif-independent de novo detection algorithm for SMB gene clusters. We integrated virtual gene cluster generation in an annotated genome sequence with highly sensitive scoring of the cooperative transcriptional regulation of cluster member genes. MIDDAS-M accurately predicted 38 SMB gene clusters that have been experimentally confirmed and/or predicted by other motif-based methods in 3 fungal strains. MIDDAS-M further identified a new SMB gene cluster for ustiloxin B, which was experimentally validated. Sequence analysis of the cluster genes indicated a novel mechanism for peptide biosynthesis independent of NRPS. Because it is fully computational and independent of empirical knowledge about SMB core genes, MIDDAS-M allows a large-scale, comprehensive analysis of SMB gene clusters, including those with novel biosynthetic mechanisms that do not contain any functionally characterized genes. Public Library of Science 2013-12-31 /pmc/articles/PMC3877130/ /pubmed/24391870 http://dx.doi.org/10.1371/journal.pone.0084028 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Research Article Umemura, Myco Koike, Hideaki Nagano, Nozomi Ishii, Tomoko Kawano, Jin Yamane, Noriko Kozone, Ikuko Horimoto, Katsuhisa Shin-ya, Kazuo Asai, Kiyoshi Yu, Jiujiang Bennett, Joan W. Machida, Masayuki MIDDAS-M: Motif-Independent De Novo Detection of Secondary Metabolite Gene Clusters through the Integration of Genome Sequencing and Transcriptome Data |
title | MIDDAS-M: Motif-Independent De Novo Detection of Secondary Metabolite Gene Clusters through the Integration of Genome Sequencing and Transcriptome Data |
title_full | MIDDAS-M: Motif-Independent De Novo Detection of Secondary Metabolite Gene Clusters through the Integration of Genome Sequencing and Transcriptome Data |
title_fullStr | MIDDAS-M: Motif-Independent De Novo Detection of Secondary Metabolite Gene Clusters through the Integration of Genome Sequencing and Transcriptome Data |
title_full_unstemmed | MIDDAS-M: Motif-Independent De Novo Detection of Secondary Metabolite Gene Clusters through the Integration of Genome Sequencing and Transcriptome Data |
title_short | MIDDAS-M: Motif-Independent De Novo Detection of Secondary Metabolite Gene Clusters through the Integration of Genome Sequencing and Transcriptome Data |
title_sort | middas-m: motif-independent de novo detection of secondary metabolite gene clusters through the integration of genome sequencing and transcriptome data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3877130/ https://www.ncbi.nlm.nih.gov/pubmed/24391870 http://dx.doi.org/10.1371/journal.pone.0084028 |
work_keys_str_mv | AT umemuramyco middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT koikehideaki middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT naganonozomi middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT ishiitomoko middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT kawanojin middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT yamanenoriko middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT kozoneikuko middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT horimotokatsuhisa middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT shinyakazuo middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT asaikiyoshi middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT yujiujiang middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT bennettjoanw middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata AT machidamasayuki middasmmotifindependentdenovodetectionofsecondarymetabolitegeneclustersthroughtheintegrationofgenomesequencingandtranscriptomedata |