Cargando…
Homology-driven assembly of NOn-redundant protEin sequence sets (NOmESS) for mass spectrometry
Summary: To enable mass spectrometry (MS)-based proteomic studies with poorly characterized organisms, we developed a computational workflow for the homology-driven assembly of a non-redundant reference sequence dataset. In the automated pipeline, translated DNA sequences (e.g. ESTs, RNA deep-sequen...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848398/ https://www.ncbi.nlm.nih.gov/pubmed/26743511 http://dx.doi.org/10.1093/bioinformatics/btv756 |
_version_ | 1782429334392274944 |
---|---|
author | Temu, Tikira Mann, Matthias Räschle, Markus Cox, Jürgen |
author_facet | Temu, Tikira Mann, Matthias Räschle, Markus Cox, Jürgen |
author_sort | Temu, Tikira |
collection | PubMed |
description | Summary: To enable mass spectrometry (MS)-based proteomic studies with poorly characterized organisms, we developed a computational workflow for the homology-driven assembly of a non-redundant reference sequence dataset. In the automated pipeline, translated DNA sequences (e.g. ESTs, RNA deep-sequencing data) are aligned to those of a closely related and fully sequenced organism. Representative sequences are derived from each cluster and joined, resulting in a non-redundant reference set representing the maximal available amino acid sequence information for each protein. We here applied NOmESS to assemble a reference database for the widely used model organism Xenopus laevis and demonstrate its use in proteomic applications. Availability and implementation: NOmESS is written in C#. The source code as well as the executables can be downloaded from http://www.biochem.mpg.de/cox. Execution of NOmESS requires BLASTp and cd-hit in addition. Contact: cox@biochem.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4848398 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-48483982016-04-29 Homology-driven assembly of NOn-redundant protEin sequence sets (NOmESS) for mass spectrometry Temu, Tikira Mann, Matthias Räschle, Markus Cox, Jürgen Bioinformatics Applications Notes Summary: To enable mass spectrometry (MS)-based proteomic studies with poorly characterized organisms, we developed a computational workflow for the homology-driven assembly of a non-redundant reference sequence dataset. In the automated pipeline, translated DNA sequences (e.g. ESTs, RNA deep-sequencing data) are aligned to those of a closely related and fully sequenced organism. Representative sequences are derived from each cluster and joined, resulting in a non-redundant reference set representing the maximal available amino acid sequence information for each protein. We here applied NOmESS to assemble a reference database for the widely used model organism Xenopus laevis and demonstrate its use in proteomic applications. Availability and implementation: NOmESS is written in C#. The source code as well as the executables can be downloaded from http://www.biochem.mpg.de/cox. Execution of NOmESS requires BLASTp and cd-hit in addition. Contact: cox@biochem.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-05-01 2015-01-06 /pmc/articles/PMC4848398/ /pubmed/26743511 http://dx.doi.org/10.1093/bioinformatics/btv756 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Temu, Tikira Mann, Matthias Räschle, Markus Cox, Jürgen Homology-driven assembly of NOn-redundant protEin sequence sets (NOmESS) for mass spectrometry |
title | Homology-driven assembly of NOn-redundant protEin sequence sets (NOmESS) for mass spectrometry |
title_full | Homology-driven assembly of NOn-redundant protEin sequence sets (NOmESS) for mass spectrometry |
title_fullStr | Homology-driven assembly of NOn-redundant protEin sequence sets (NOmESS) for mass spectrometry |
title_full_unstemmed | Homology-driven assembly of NOn-redundant protEin sequence sets (NOmESS) for mass spectrometry |
title_short | Homology-driven assembly of NOn-redundant protEin sequence sets (NOmESS) for mass spectrometry |
title_sort | homology-driven assembly of non-redundant protein sequence sets (nomess) for mass spectrometry |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848398/ https://www.ncbi.nlm.nih.gov/pubmed/26743511 http://dx.doi.org/10.1093/bioinformatics/btv756 |
work_keys_str_mv | AT temutikira homologydrivenassemblyofnonredundantproteinsequencesetsnomessformassspectrometry AT mannmatthias homologydrivenassemblyofnonredundantproteinsequencesetsnomessformassspectrometry AT raschlemarkus homologydrivenassemblyofnonredundantproteinsequencesetsnomessformassspectrometry AT coxjurgen homologydrivenassemblyofnonredundantproteinsequencesetsnomessformassspectrometry |