Cargando…
Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
Next-generation single-molecule protein sequencing technologies have the potential to significantly accelerate biomedical research. These technologies offer sensitivity and scalability for proteomic analysis. One auspicious method is fluorosequencing, which involves: cutting naturalized proteins int...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10656014/ https://www.ncbi.nlm.nih.gov/pubmed/37934778 http://dx.doi.org/10.1371/journal.pcbi.1011345 |
_version_ | 1785136925201399808 |
---|---|
author | Kipen, Javier Jaldén, Joakim |
author_facet | Kipen, Javier Jaldén, Joakim |
author_sort | Kipen, Javier |
collection | PubMed |
description | Next-generation single-molecule protein sequencing technologies have the potential to significantly accelerate biomedical research. These technologies offer sensitivity and scalability for proteomic analysis. One auspicious method is fluorosequencing, which involves: cutting naturalized proteins into peptides, attaching fluorophores to specific amino acids, and observing variations in light intensity as one amino acid is removed at a time. The original peptide is classified from the sequence of light-intensity reads, and proteins can subsequently be recognized with this information. The amino acid step removal is achieved by attaching the peptides to a wall on the C-terminal and using a process called Edman Degradation to remove an amino acid from the N-Terminal. Even though a framework (Whatprot) has been proposed for the peptide classification task, processing times remain restrictive due to the massively parallel data acquisicion system. In this paper, we propose a new beam search decoder with a novel state formulation that obtains considerably lower processing times at the expense of only a slight accuracy drop compared to Whatprot. Furthermore, we explore how our novel state formulation may lead to even faster decoders in the future. |
format | Online Article Text |
id | pubmed-10656014 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-106560142023-11-07 Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data Kipen, Javier Jaldén, Joakim PLoS Comput Biol Research Article Next-generation single-molecule protein sequencing technologies have the potential to significantly accelerate biomedical research. These technologies offer sensitivity and scalability for proteomic analysis. One auspicious method is fluorosequencing, which involves: cutting naturalized proteins into peptides, attaching fluorophores to specific amino acids, and observing variations in light intensity as one amino acid is removed at a time. The original peptide is classified from the sequence of light-intensity reads, and proteins can subsequently be recognized with this information. The amino acid step removal is achieved by attaching the peptides to a wall on the C-terminal and using a process called Edman Degradation to remove an amino acid from the N-Terminal. Even though a framework (Whatprot) has been proposed for the peptide classification task, processing times remain restrictive due to the massively parallel data acquisicion system. In this paper, we propose a new beam search decoder with a novel state formulation that obtains considerably lower processing times at the expense of only a slight accuracy drop compared to Whatprot. Furthermore, we explore how our novel state formulation may lead to even faster decoders in the future. Public Library of Science 2023-11-07 /pmc/articles/PMC10656014/ /pubmed/37934778 http://dx.doi.org/10.1371/journal.pcbi.1011345 Text en © 2023 Kipen, Jaldén https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Kipen, Javier Jaldén, Joakim Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data |
title | Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data |
title_full | Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data |
title_fullStr | Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data |
title_full_unstemmed | Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data |
title_short | Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data |
title_sort | beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10656014/ https://www.ncbi.nlm.nih.gov/pubmed/37934778 http://dx.doi.org/10.1371/journal.pcbi.1011345 |
work_keys_str_mv | AT kipenjavier beamsearchdecoderforenhancingsequencedecodingspeedinsinglemoleculepeptidesequencingdata AT jaldenjoakim beamsearchdecoderforenhancingsequencedecodingspeedinsinglemoleculepeptidesequencingdata |