Cargando…

KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies

Antibodies with similar amino acid sequences, especially across their complementarity-determining regions, often share properties. Finding that an antibody of interest has a similar sequence to naturally expressed antibodies in healthy or diseased repertoires is a powerful approach for the predictio...

Descripción completa

Detalles Bibliográficos
Autores principales: Olsen, Tobias H., Abanades, Brennan, Moal, Iain H., Deane, Charlotte M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10354155/
https://www.ncbi.nlm.nih.gov/pubmed/37463925
http://dx.doi.org/10.1038/s41598-023-38108-7
_version_ 1785074866538414080
author Olsen, Tobias H.
Abanades, Brennan
Moal, Iain H.
Deane, Charlotte M.
author_facet Olsen, Tobias H.
Abanades, Brennan
Moal, Iain H.
Deane, Charlotte M.
author_sort Olsen, Tobias H.
collection PubMed
description Antibodies with similar amino acid sequences, especially across their complementarity-determining regions, often share properties. Finding that an antibody of interest has a similar sequence to naturally expressed antibodies in healthy or diseased repertoires is a powerful approach for the prediction of antibody properties, such as immunogenicity or antigen specificity. However, as the number of available antibody sequences is now in the billions and continuing to grow, repertoire mining for similar sequences has become increasingly computationally expensive. Existing approaches are limited by either being low-throughput, non-exhaustive, not antibody specific, or only searching against entire chain sequences. Therefore, there is a need for a specialized tool, optimized for a rapid and exhaustive search of any antibody region against all known antibodies, to better utilize the full breadth of available repertoire sequences. We introduce Known Antibody Search (KA-Search), a tool that allows for the rapid search of billions of antibody variable domains by amino acid sequence identity across either the variable domain, the complementarity-determining regions, or a user defined antibody region. We show KA-Search in operation on the [Formula: see text] 2.4 billion antibody sequences available in the OAS database. KA-Search can be used to find the most similar sequences from OAS within 30 minutes and a representative subset of 10 million sequences in less than 9 seconds. We give examples of how KA-Search can be used to obtain new insights about an antibody of interest. KA-Search is freely available at https://github.com/oxpig/kasearch.
format Online
Article
Text
id pubmed-10354155
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103541552023-07-20 KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies Olsen, Tobias H. Abanades, Brennan Moal, Iain H. Deane, Charlotte M. Sci Rep Article Antibodies with similar amino acid sequences, especially across their complementarity-determining regions, often share properties. Finding that an antibody of interest has a similar sequence to naturally expressed antibodies in healthy or diseased repertoires is a powerful approach for the prediction of antibody properties, such as immunogenicity or antigen specificity. However, as the number of available antibody sequences is now in the billions and continuing to grow, repertoire mining for similar sequences has become increasingly computationally expensive. Existing approaches are limited by either being low-throughput, non-exhaustive, not antibody specific, or only searching against entire chain sequences. Therefore, there is a need for a specialized tool, optimized for a rapid and exhaustive search of any antibody region against all known antibodies, to better utilize the full breadth of available repertoire sequences. We introduce Known Antibody Search (KA-Search), a tool that allows for the rapid search of billions of antibody variable domains by amino acid sequence identity across either the variable domain, the complementarity-determining regions, or a user defined antibody region. We show KA-Search in operation on the [Formula: see text] 2.4 billion antibody sequences available in the OAS database. KA-Search can be used to find the most similar sequences from OAS within 30 minutes and a representative subset of 10 million sequences in less than 9 seconds. We give examples of how KA-Search can be used to obtain new insights about an antibody of interest. KA-Search is freely available at https://github.com/oxpig/kasearch. Nature Publishing Group UK 2023-07-18 /pmc/articles/PMC10354155/ /pubmed/37463925 http://dx.doi.org/10.1038/s41598-023-38108-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Olsen, Tobias H.
Abanades, Brennan
Moal, Iain H.
Deane, Charlotte M.
KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies
title KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies
title_full KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies
title_fullStr KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies
title_full_unstemmed KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies
title_short KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies
title_sort ka-search, a method for rapid and exhaustive sequence identity search of known antibodies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10354155/
https://www.ncbi.nlm.nih.gov/pubmed/37463925
http://dx.doi.org/10.1038/s41598-023-38108-7
work_keys_str_mv AT olsentobiash kasearchamethodforrapidandexhaustivesequenceidentitysearchofknownantibodies
AT abanadesbrennan kasearchamethodforrapidandexhaustivesequenceidentitysearchofknownantibodies
AT moaliainh kasearchamethodforrapidandexhaustivesequenceidentitysearchofknownantibodies
AT deanecharlottem kasearchamethodforrapidandexhaustivesequenceidentitysearchofknownantibodies