Cargando…

MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search

MOTIVATION: Proteogenomics has proven its utility by integrating genomics and proteomics. Typical approaches use data from next-generation sequencing to infer proteins expressed. A sample-specific protein sequence database is often adopted to identify novel peptides from matched mass spectrometry-ba...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Seunghyuk, Paek, Eunok
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355298/
https://www.ncbi.nlm.nih.gov/pubmed/32657416
http://dx.doi.org/10.1093/bioinformatics/btaa504
_version_ 1783558247422099456
author Choi, Seunghyuk
Paek, Eunok
author_facet Choi, Seunghyuk
Paek, Eunok
author_sort Choi, Seunghyuk
collection PubMed
description MOTIVATION: Proteogenomics has proven its utility by integrating genomics and proteomics. Typical approaches use data from next-generation sequencing to infer proteins expressed. A sample-specific protein sequence database is often adopted to identify novel peptides from matched mass spectrometry-based proteomics; nevertheless, there is no software that can practically identify all possible forms of mutated peptides suggested by various genomic information sources. RESULTS: We propose MutCombinator, which enables us to practically identify mutated peptides from tandem mass spectra allowing combinatorial mutations during the database search. It uses an upgraded version of a variant graph, keeping track of frame information. The variant graph is indexed by nine nucleotides for fast access. Using MutCombinator, we could identify more mutated peptides than previous methods, because combinations of point mutations are considered and also because it can be practically applied together with a large mutation database such as COSMIC. Furthermore, MutCombinator supports in-frame search for coding regions and three-frame search for non-coding regions. AVAILABILITY AND IMPLEMENTATION: https://prix.hanyang.ac.kr/download/mutcombinator.jsp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7355298
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73552982020-07-16 MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search Choi, Seunghyuk Paek, Eunok Bioinformatics Macromolecular Sequence, Structure, and Function MOTIVATION: Proteogenomics has proven its utility by integrating genomics and proteomics. Typical approaches use data from next-generation sequencing to infer proteins expressed. A sample-specific protein sequence database is often adopted to identify novel peptides from matched mass spectrometry-based proteomics; nevertheless, there is no software that can practically identify all possible forms of mutated peptides suggested by various genomic information sources. RESULTS: We propose MutCombinator, which enables us to practically identify mutated peptides from tandem mass spectra allowing combinatorial mutations during the database search. It uses an upgraded version of a variant graph, keeping track of frame information. The variant graph is indexed by nine nucleotides for fast access. Using MutCombinator, we could identify more mutated peptides than previous methods, because combinations of point mutations are considered and also because it can be practically applied together with a large mutation database such as COSMIC. Furthermore, MutCombinator supports in-frame search for coding regions and three-frame search for non-coding regions. AVAILABILITY AND IMPLEMENTATION: https://prix.hanyang.ac.kr/download/mutcombinator.jsp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355298/ /pubmed/32657416 http://dx.doi.org/10.1093/bioinformatics/btaa504 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Macromolecular Sequence, Structure, and Function
Choi, Seunghyuk
Paek, Eunok
MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search
title MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search
title_full MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search
title_fullStr MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search
title_full_unstemmed MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search
title_short MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search
title_sort mutcombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search
topic Macromolecular Sequence, Structure, and Function
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355298/
https://www.ncbi.nlm.nih.gov/pubmed/32657416
http://dx.doi.org/10.1093/bioinformatics/btaa504
work_keys_str_mv AT choiseunghyuk mutcombinatoridentificationofmutatedpeptidesallowingcombinatorialmutationsusingnucleotidebasedgraphsearch
AT paekeunok mutcombinatoridentificationofmutatedpeptidesallowingcombinatorialmutationsusingnucleotidebasedgraphsearch