Cargando…

Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering

[Image: see text] In the past decade, macrocyclic peptides gained increasing interest as a new therapeutic modality to tackle intracellular and extracellular therapeutic targets that had been previously classified as “undruggable”. Several technological advances have made discovering macrocyclic pep...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Man-Ling, Farag, Sherif, Del Cid, Joselyn S., Bashore, Charlene, Hallenbeck, Kenneth K., Gobbi, Alberto, Cunningham, Christian N.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Chemical Society 2023
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10278063/ https://www.ncbi.nlm.nih.gov/pubmed/37220419 http://dx.doi.org/10.1021/acschembio.3c00159

_version_	1785060405844901888
author	Lee, Man-Ling Farag, Sherif Del Cid, Joselyn S. Bashore, Charlene Hallenbeck, Kenneth K. Gobbi, Alberto Cunningham, Christian N.
author_facet	Lee, Man-Ling Farag, Sherif Del Cid, Joselyn S. Bashore, Charlene Hallenbeck, Kenneth K. Gobbi, Alberto Cunningham, Christian N.
author_sort	Lee, Man-Ling
collection	PubMed
description	[Image: see text] In the past decade, macrocyclic peptides gained increasing interest as a new therapeutic modality to tackle intracellular and extracellular therapeutic targets that had been previously classified as “undruggable”. Several technological advances have made discovering macrocyclic peptides against these targets possible: 1) the inclusion of noncanonical amino acids (NCAAs) into mRNA display, 2) increased availability of next generation sequencing (NGS), and 3) improvements in rapid peptide synthesis platforms. This type of directed-evolution based screening can produce large numbers of potential hit sequences given that DNA sequencing is the functional output of this platform. The current standard for selecting hit peptides from these selections for downstream follow-up relies on the frequency counting and sorting of unique peptide sequences which can result in the generation of false negatives due to technical reasons including low translation efficiency or other experimental factors. To overcome our inability to detect weakly enriched peptide sequences among our large data sets, we wanted to develop a clustering method that would enable the identification of peptide families. Unfortunately, utilizing traditional clustering algorithms, such as ClustalW, is not possible for this technology due to the incorporation of NCAAs in these libraries. Therefore, we developed a new atomistic clustering method with a Pairwise Aligned Peptide (PAP) chemical similarity metric to perform sequence alignments and identify macrocyclic peptide families. With this method, low enriched peptides, including isolated sequences (singletons), can now be clustered into families providing a comprehensive analysis of NGS data resulting from macrocycle discovery selections. Additionally, upon identification of a hit peptide with the desired activity, this clustering algorithm can be used to identify derivatives from the initial data set for structure–activity relationship (SAR) analysis without requiring additional selection experiments.
format	Online Article Text
id	pubmed-10278063
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	American Chemical Society
record_format	MEDLINE/PubMed
spelling	pubmed-102780632023-06-20 Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering Lee, Man-Ling Farag, Sherif Del Cid, Joselyn S. Bashore, Charlene Hallenbeck, Kenneth K. Gobbi, Alberto Cunningham, Christian N. ACS Chem Biol [Image: see text] In the past decade, macrocyclic peptides gained increasing interest as a new therapeutic modality to tackle intracellular and extracellular therapeutic targets that had been previously classified as “undruggable”. Several technological advances have made discovering macrocyclic peptides against these targets possible: 1) the inclusion of noncanonical amino acids (NCAAs) into mRNA display, 2) increased availability of next generation sequencing (NGS), and 3) improvements in rapid peptide synthesis platforms. This type of directed-evolution based screening can produce large numbers of potential hit sequences given that DNA sequencing is the functional output of this platform. The current standard for selecting hit peptides from these selections for downstream follow-up relies on the frequency counting and sorting of unique peptide sequences which can result in the generation of false negatives due to technical reasons including low translation efficiency or other experimental factors. To overcome our inability to detect weakly enriched peptide sequences among our large data sets, we wanted to develop a clustering method that would enable the identification of peptide families. Unfortunately, utilizing traditional clustering algorithms, such as ClustalW, is not possible for this technology due to the incorporation of NCAAs in these libraries. Therefore, we developed a new atomistic clustering method with a Pairwise Aligned Peptide (PAP) chemical similarity metric to perform sequence alignments and identify macrocyclic peptide families. With this method, low enriched peptides, including isolated sequences (singletons), can now be clustered into families providing a comprehensive analysis of NGS data resulting from macrocycle discovery selections. Additionally, upon identification of a hit peptide with the desired activity, this clustering algorithm can be used to identify derivatives from the initial data set for structure–activity relationship (SAR) analysis without requiring additional selection experiments. American Chemical Society 2023-05-23 /pmc/articles/PMC10278063/ /pubmed/37220419 http://dx.doi.org/10.1021/acschembio.3c00159 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Lee, Man-Ling Farag, Sherif Del Cid, Joselyn S. Bashore, Charlene Hallenbeck, Kenneth K. Gobbi, Alberto Cunningham, Christian N. Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering
title	Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering
title_full	Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering
title_fullStr	Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering
title_full_unstemmed	Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering
title_short	Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering
title_sort	identification of macrocyclic peptide families from combinatorial libraries containing noncanonical amino acids using cheminformatics and bioinformatics inspired clustering
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10278063/ https://www.ncbi.nlm.nih.gov/pubmed/37220419 http://dx.doi.org/10.1021/acschembio.3c00159
work_keys_str_mv	AT leemanling identificationofmacrocyclicpeptidefamiliesfromcombinatoriallibrariescontainingnoncanonicalaminoacidsusingcheminformaticsandbioinformaticsinspiredclustering AT faragsherif identificationofmacrocyclicpeptidefamiliesfromcombinatoriallibrariescontainingnoncanonicalaminoacidsusingcheminformaticsandbioinformaticsinspiredclustering AT delcidjoselyns identificationofmacrocyclicpeptidefamiliesfromcombinatoriallibrariescontainingnoncanonicalaminoacidsusingcheminformaticsandbioinformaticsinspiredclustering AT bashorecharlene identificationofmacrocyclicpeptidefamiliesfromcombinatoriallibrariescontainingnoncanonicalaminoacidsusingcheminformaticsandbioinformaticsinspiredclustering AT hallenbeckkennethk identificationofmacrocyclicpeptidefamiliesfromcombinatoriallibrariescontainingnoncanonicalaminoacidsusingcheminformaticsandbioinformaticsinspiredclustering AT gobbialberto identificationofmacrocyclicpeptidefamiliesfromcombinatoriallibrariescontainingnoncanonicalaminoacidsusingcheminformaticsandbioinformaticsinspiredclustering AT cunninghamchristiann identificationofmacrocyclicpeptidefamiliesfromcombinatoriallibrariescontainingnoncanonicalaminoacidsusingcheminformaticsandbioinformaticsinspiredclustering

Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering

Ejemplares similares