Cargando…

Macrel: antimicrobial peptide screening in genomes and metagenomes

MOTIVATION: Antimicrobial peptides (AMPs) have the potential to tackle multidrug-resistant pathogens in both clinical and non-clinical contexts. The recent growth in the availability of genomes and metagenomes provides an opportunity for in silico prediction of novel AMP molecules. However, due to t...

Descripción completa

Detalles Bibliográficos
Autores principales: Santos-Júnior, Célio Dias, Pan, Shaojun, Zhao, Xing-Ming, Coelho, Luis Pedro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7751412/
https://www.ncbi.nlm.nih.gov/pubmed/33384902
http://dx.doi.org/10.7717/peerj.10555
_version_ 1783625660623749120
author Santos-Júnior, Célio Dias
Pan, Shaojun
Zhao, Xing-Ming
Coelho, Luis Pedro
author_facet Santos-Júnior, Célio Dias
Pan, Shaojun
Zhao, Xing-Ming
Coelho, Luis Pedro
author_sort Santos-Júnior, Célio Dias
collection PubMed
description MOTIVATION: Antimicrobial peptides (AMPs) have the potential to tackle multidrug-resistant pathogens in both clinical and non-clinical contexts. The recent growth in the availability of genomes and metagenomes provides an opportunity for in silico prediction of novel AMP molecules. However, due to the small size of these peptides, standard gene prospection methods cannot be applied in this domain and alternative approaches are necessary. In particular, standard gene prediction methods have low precision for short peptides, and functional classification by homology results in low recall. RESULTS: Here, we present Macrel (for metagenomic AMP classification and retrieval), which is an end-to-end pipeline for the prospection of high-quality AMP candidates from (meta)genomes. For this, we introduce a novel set of 22 peptide features. These were used to build classifiers which perform similarly to the state-of-the-art in the prediction of both antimicrobial and hemolytic activity of peptides, but with enhanced precision (using standard benchmarks as well as a stricter testing regime). We demonstrate that Macrel recovers high-quality AMP candidates using realistic simulations and real data. AVAILABILITY: Macrel is implemented in Python 3. It is available as open source at https://github.com/BigDataBiology/macrel and through bioconda. Classification of peptides or prediction of AMPs in contigs can also be performed on the webserver: https://big-data-biology.org/software/macrel.
format Online
Article
Text
id pubmed-7751412
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-77514122020-12-30 Macrel: antimicrobial peptide screening in genomes and metagenomes Santos-Júnior, Célio Dias Pan, Shaojun Zhao, Xing-Ming Coelho, Luis Pedro PeerJ Bioinformatics MOTIVATION: Antimicrobial peptides (AMPs) have the potential to tackle multidrug-resistant pathogens in both clinical and non-clinical contexts. The recent growth in the availability of genomes and metagenomes provides an opportunity for in silico prediction of novel AMP molecules. However, due to the small size of these peptides, standard gene prospection methods cannot be applied in this domain and alternative approaches are necessary. In particular, standard gene prediction methods have low precision for short peptides, and functional classification by homology results in low recall. RESULTS: Here, we present Macrel (for metagenomic AMP classification and retrieval), which is an end-to-end pipeline for the prospection of high-quality AMP candidates from (meta)genomes. For this, we introduce a novel set of 22 peptide features. These were used to build classifiers which perform similarly to the state-of-the-art in the prediction of both antimicrobial and hemolytic activity of peptides, but with enhanced precision (using standard benchmarks as well as a stricter testing regime). We demonstrate that Macrel recovers high-quality AMP candidates using realistic simulations and real data. AVAILABILITY: Macrel is implemented in Python 3. It is available as open source at https://github.com/BigDataBiology/macrel and through bioconda. Classification of peptides or prediction of AMPs in contigs can also be performed on the webserver: https://big-data-biology.org/software/macrel. PeerJ Inc. 2020-12-18 /pmc/articles/PMC7751412/ /pubmed/33384902 http://dx.doi.org/10.7717/peerj.10555 Text en © 2020 Santos-Júnior et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Santos-Júnior, Célio Dias
Pan, Shaojun
Zhao, Xing-Ming
Coelho, Luis Pedro
Macrel: antimicrobial peptide screening in genomes and metagenomes
title Macrel: antimicrobial peptide screening in genomes and metagenomes
title_full Macrel: antimicrobial peptide screening in genomes and metagenomes
title_fullStr Macrel: antimicrobial peptide screening in genomes and metagenomes
title_full_unstemmed Macrel: antimicrobial peptide screening in genomes and metagenomes
title_short Macrel: antimicrobial peptide screening in genomes and metagenomes
title_sort macrel: antimicrobial peptide screening in genomes and metagenomes
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7751412/
https://www.ncbi.nlm.nih.gov/pubmed/33384902
http://dx.doi.org/10.7717/peerj.10555
work_keys_str_mv AT santosjuniorceliodias macrelantimicrobialpeptidescreeningingenomesandmetagenomes
AT panshaojun macrelantimicrobialpeptidescreeningingenomesandmetagenomes
AT zhaoxingming macrelantimicrobialpeptidescreeningingenomesandmetagenomes
AT coelholuispedro macrelantimicrobialpeptidescreeningingenomesandmetagenomes