Cargando…

Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data

Next-generation single-molecule protein sequencing technologies have the potential to significantly accelerate biomedical research. These technologies offer sensitivity and scalability for proteomic analysis. One auspicious method is fluorosequencing, which involves: cutting naturalized proteins int...

Descripción completa

Detalles Bibliográficos
Autores principales: Kipen, Javier, Jaldén, Joakim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10656014/
https://www.ncbi.nlm.nih.gov/pubmed/37934778
http://dx.doi.org/10.1371/journal.pcbi.1011345
_version_ 1785136925201399808
author Kipen, Javier
Jaldén, Joakim
author_facet Kipen, Javier
Jaldén, Joakim
author_sort Kipen, Javier
collection PubMed
description Next-generation single-molecule protein sequencing technologies have the potential to significantly accelerate biomedical research. These technologies offer sensitivity and scalability for proteomic analysis. One auspicious method is fluorosequencing, which involves: cutting naturalized proteins into peptides, attaching fluorophores to specific amino acids, and observing variations in light intensity as one amino acid is removed at a time. The original peptide is classified from the sequence of light-intensity reads, and proteins can subsequently be recognized with this information. The amino acid step removal is achieved by attaching the peptides to a wall on the C-terminal and using a process called Edman Degradation to remove an amino acid from the N-Terminal. Even though a framework (Whatprot) has been proposed for the peptide classification task, processing times remain restrictive due to the massively parallel data acquisicion system. In this paper, we propose a new beam search decoder with a novel state formulation that obtains considerably lower processing times at the expense of only a slight accuracy drop compared to Whatprot. Furthermore, we explore how our novel state formulation may lead to even faster decoders in the future.
format Online
Article
Text
id pubmed-10656014
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-106560142023-11-07 Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data Kipen, Javier Jaldén, Joakim PLoS Comput Biol Research Article Next-generation single-molecule protein sequencing technologies have the potential to significantly accelerate biomedical research. These technologies offer sensitivity and scalability for proteomic analysis. One auspicious method is fluorosequencing, which involves: cutting naturalized proteins into peptides, attaching fluorophores to specific amino acids, and observing variations in light intensity as one amino acid is removed at a time. The original peptide is classified from the sequence of light-intensity reads, and proteins can subsequently be recognized with this information. The amino acid step removal is achieved by attaching the peptides to a wall on the C-terminal and using a process called Edman Degradation to remove an amino acid from the N-Terminal. Even though a framework (Whatprot) has been proposed for the peptide classification task, processing times remain restrictive due to the massively parallel data acquisicion system. In this paper, we propose a new beam search decoder with a novel state formulation that obtains considerably lower processing times at the expense of only a slight accuracy drop compared to Whatprot. Furthermore, we explore how our novel state formulation may lead to even faster decoders in the future. Public Library of Science 2023-11-07 /pmc/articles/PMC10656014/ /pubmed/37934778 http://dx.doi.org/10.1371/journal.pcbi.1011345 Text en © 2023 Kipen, Jaldén https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kipen, Javier
Jaldén, Joakim
Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
title Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
title_full Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
title_fullStr Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
title_full_unstemmed Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
title_short Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
title_sort beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10656014/
https://www.ncbi.nlm.nih.gov/pubmed/37934778
http://dx.doi.org/10.1371/journal.pcbi.1011345
work_keys_str_mv AT kipenjavier beamsearchdecoderforenhancingsequencedecodingspeedinsinglemoleculepeptidesequencingdata
AT jaldenjoakim beamsearchdecoderforenhancingsequencedecodingspeedinsinglemoleculepeptidesequencingdata