Cargando…
Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly
Monoclonal antibodies are biotechnologically produced proteins with various applications in research, therapeutics and diagnostics. Their ability to recognize and bind to specific molecule structures makes them essential research tools and therapeutic agents. Sequence information of antibodies is he...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851299/ https://www.ncbi.nlm.nih.gov/pubmed/36545804 http://dx.doi.org/10.1093/bib/bbac542 |
_version_ | 1784872366900248576 |
---|---|
author | Beslic, Denis Tscheuschner, Georg Renard, Bernhard Y Weller, Michael G Muth, Thilo |
author_facet | Beslic, Denis Tscheuschner, Georg Renard, Bernhard Y Weller, Michael G Muth, Thilo |
author_sort | Beslic, Denis |
collection | PubMed |
description | Monoclonal antibodies are biotechnologically produced proteins with various applications in research, therapeutics and diagnostics. Their ability to recognize and bind to specific molecule structures makes them essential research tools and therapeutic agents. Sequence information of antibodies is helpful for understanding antibody–antigen interactions and ensuring their affinity and specificity. De novo protein sequencing based on mass spectrometry is a valuable method to obtain the amino acid sequence of peptides and proteins without a priori knowledge. In this study, we evaluated six recently developed de novo peptide sequencing algorithms (Novor, pNovo 3, DeepNovo, SMSNet, PointNovo and Casanovo), which were not specifically designed for antibody data. We validated their ability to identify and assemble antibody sequences on three multi-enzymatic data sets. The deep learning-based tools Casanovo and PointNovo showed an increased peptide recall across different enzymes and data sets compared with spectrum-graph-based approaches. We evaluated different error types of de novo peptide sequencing tools and their performance for different numbers of missing cleavage sites, noisy spectra and peptides of various lengths. We achieved a sequence coverage of 97.69–99.53% on the light chains of three different antibody data sets using the de Bruijn assembler ALPS and the predictions from Casanovo. However, low sequence coverage and accuracy on the heavy chains demonstrate that complete de novo protein sequencing remains a challenging issue in proteomics that requires improved de novo error correction, alternative digestion strategies and hybrid approaches such as homology search to achieve high accuracy on long protein sequences. |
format | Online Article Text |
id | pubmed-9851299 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98512992023-01-20 Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly Beslic, Denis Tscheuschner, Georg Renard, Bernhard Y Weller, Michael G Muth, Thilo Brief Bioinform Problem Solving Protocol Monoclonal antibodies are biotechnologically produced proteins with various applications in research, therapeutics and diagnostics. Their ability to recognize and bind to specific molecule structures makes them essential research tools and therapeutic agents. Sequence information of antibodies is helpful for understanding antibody–antigen interactions and ensuring their affinity and specificity. De novo protein sequencing based on mass spectrometry is a valuable method to obtain the amino acid sequence of peptides and proteins without a priori knowledge. In this study, we evaluated six recently developed de novo peptide sequencing algorithms (Novor, pNovo 3, DeepNovo, SMSNet, PointNovo and Casanovo), which were not specifically designed for antibody data. We validated their ability to identify and assemble antibody sequences on three multi-enzymatic data sets. The deep learning-based tools Casanovo and PointNovo showed an increased peptide recall across different enzymes and data sets compared with spectrum-graph-based approaches. We evaluated different error types of de novo peptide sequencing tools and their performance for different numbers of missing cleavage sites, noisy spectra and peptides of various lengths. We achieved a sequence coverage of 97.69–99.53% on the light chains of three different antibody data sets using the de Bruijn assembler ALPS and the predictions from Casanovo. However, low sequence coverage and accuracy on the heavy chains demonstrate that complete de novo protein sequencing remains a challenging issue in proteomics that requires improved de novo error correction, alternative digestion strategies and hybrid approaches such as homology search to achieve high accuracy on long protein sequences. Oxford University Press 2022-12-21 /pmc/articles/PMC9851299/ /pubmed/36545804 http://dx.doi.org/10.1093/bib/bbac542 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Problem Solving Protocol Beslic, Denis Tscheuschner, Georg Renard, Bernhard Y Weller, Michael G Muth, Thilo Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly |
title | Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly |
title_full | Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly |
title_fullStr | Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly |
title_full_unstemmed | Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly |
title_short | Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly |
title_sort | comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851299/ https://www.ncbi.nlm.nih.gov/pubmed/36545804 http://dx.doi.org/10.1093/bib/bbac542 |
work_keys_str_mv | AT beslicdenis comprehensiveevaluationofpeptidedenovosequencingtoolsformonoclonalantibodyassembly AT tscheuschnergeorg comprehensiveevaluationofpeptidedenovosequencingtoolsformonoclonalantibodyassembly AT renardbernhardy comprehensiveevaluationofpeptidedenovosequencingtoolsformonoclonalantibodyassembly AT wellermichaelg comprehensiveevaluationofpeptidedenovosequencingtoolsformonoclonalantibodyassembly AT muththilo comprehensiveevaluationofpeptidedenovosequencingtoolsformonoclonalantibodyassembly |