Cargando…

CusProSe: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes

We report here a new application, CustomProteinSearch (CusProSe), whose purpose is to help users to search for proteins of interest based on their domain composition. The application is customizable. It consists of two independent tools, IterHMMBuild and ProSeCDA. IterHMMBuild allows the iterative c...

Descripción completa

Detalles Bibliográficos
Autores principales: Oliveira, Leonor, Chevrollier, Nicolas, Dallery, Jean-Felix, O’Connell, Richard J., Lebrun, Marc-Henri, Viaud, Muriel, Lespinet, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9876896/
https://www.ncbi.nlm.nih.gov/pubmed/36697464
http://dx.doi.org/10.1038/s41598-023-27813-y
_version_ 1784878264230084608
author Oliveira, Leonor
Chevrollier, Nicolas
Dallery, Jean-Felix
O’Connell, Richard J.
Lebrun, Marc-Henri
Viaud, Muriel
Lespinet, Olivier
author_facet Oliveira, Leonor
Chevrollier, Nicolas
Dallery, Jean-Felix
O’Connell, Richard J.
Lebrun, Marc-Henri
Viaud, Muriel
Lespinet, Olivier
author_sort Oliveira, Leonor
collection PubMed
description We report here a new application, CustomProteinSearch (CusProSe), whose purpose is to help users to search for proteins of interest based on their domain composition. The application is customizable. It consists of two independent tools, IterHMMBuild and ProSeCDA. IterHMMBuild allows the iterative construction of Hidden Markov Model (HMM) profiles for conserved domains of selected protein sequences, while ProSeCDA scans a proteome of interest against an HMM profile database, and annotates identified proteins using user-defined rules. CusProSe was successfully used to identify, in fungal genomes, genes encoding key enzyme families involved in secondary metabolism, such as polyketide synthases (PKS), non-ribosomal peptide synthetases (NRPS), hybrid PKS-NRPS and dimethylallyl tryptophan synthases (DMATS), as well as to characterize distinct terpene synthases (TS) sub-families. The highly configurable characteristics of this application makes it a generic tool, which allows the user to refine the function of predicted proteins, to extend detection to new enzymes families, and may also be applied to biological systems other than fungi and to other proteins than those involved in secondary metabolism.
format Online
Article
Text
id pubmed-9876896
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-98768962023-01-27 CusProSe: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes Oliveira, Leonor Chevrollier, Nicolas Dallery, Jean-Felix O’Connell, Richard J. Lebrun, Marc-Henri Viaud, Muriel Lespinet, Olivier Sci Rep Article We report here a new application, CustomProteinSearch (CusProSe), whose purpose is to help users to search for proteins of interest based on their domain composition. The application is customizable. It consists of two independent tools, IterHMMBuild and ProSeCDA. IterHMMBuild allows the iterative construction of Hidden Markov Model (HMM) profiles for conserved domains of selected protein sequences, while ProSeCDA scans a proteome of interest against an HMM profile database, and annotates identified proteins using user-defined rules. CusProSe was successfully used to identify, in fungal genomes, genes encoding key enzyme families involved in secondary metabolism, such as polyketide synthases (PKS), non-ribosomal peptide synthetases (NRPS), hybrid PKS-NRPS and dimethylallyl tryptophan synthases (DMATS), as well as to characterize distinct terpene synthases (TS) sub-families. The highly configurable characteristics of this application makes it a generic tool, which allows the user to refine the function of predicted proteins, to extend detection to new enzymes families, and may also be applied to biological systems other than fungi and to other proteins than those involved in secondary metabolism. Nature Publishing Group UK 2023-01-25 /pmc/articles/PMC9876896/ /pubmed/36697464 http://dx.doi.org/10.1038/s41598-023-27813-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Oliveira, Leonor
Chevrollier, Nicolas
Dallery, Jean-Felix
O’Connell, Richard J.
Lebrun, Marc-Henri
Viaud, Muriel
Lespinet, Olivier
CusProSe: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes
title CusProSe: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes
title_full CusProSe: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes
title_fullStr CusProSe: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes
title_full_unstemmed CusProSe: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes
title_short CusProSe: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes
title_sort cusprose: a customizable protein annotation software with an application to the prediction of fungal secondary metabolism genes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9876896/
https://www.ncbi.nlm.nih.gov/pubmed/36697464
http://dx.doi.org/10.1038/s41598-023-27813-y
work_keys_str_mv AT oliveiraleonor cusproseacustomizableproteinannotationsoftwarewithanapplicationtothepredictionoffungalsecondarymetabolismgenes
AT chevrolliernicolas cusproseacustomizableproteinannotationsoftwarewithanapplicationtothepredictionoffungalsecondarymetabolismgenes
AT dalleryjeanfelix cusproseacustomizableproteinannotationsoftwarewithanapplicationtothepredictionoffungalsecondarymetabolismgenes
AT oconnellrichardj cusproseacustomizableproteinannotationsoftwarewithanapplicationtothepredictionoffungalsecondarymetabolismgenes
AT lebrunmarchenri cusproseacustomizableproteinannotationsoftwarewithanapplicationtothepredictionoffungalsecondarymetabolismgenes
AT viaudmuriel cusproseacustomizableproteinannotationsoftwarewithanapplicationtothepredictionoffungalsecondarymetabolismgenes
AT lespinetolivier cusproseacustomizableproteinannotationsoftwarewithanapplicationtothepredictionoffungalsecondarymetabolismgenes