Cargando…

NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data

Recent advances in high-throughput technologies have made it possible to generate both gene and protein sequence data at an unprecedented rate and scale thereby enabling entirely new “omics”-based approaches towards the analysis of complex biological processes. However, the amount and complexity of...

Descripción completa

Detalles Bibliográficos
Autores principales: Andreatta, Massimo, Schafer-Nielsen, Claus, Lund, Ole, Buus, Søren, Nielsen, Morten
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3206854/
https://www.ncbi.nlm.nih.gov/pubmed/22073191
http://dx.doi.org/10.1371/journal.pone.0026781
_version_ 1782215495980679168
author Andreatta, Massimo
Schafer-Nielsen, Claus
Lund, Ole
Buus, Søren
Nielsen, Morten
author_facet Andreatta, Massimo
Schafer-Nielsen, Claus
Lund, Ole
Buus, Søren
Nielsen, Morten
author_sort Andreatta, Massimo
collection PubMed
description Recent advances in high-throughput technologies have made it possible to generate both gene and protein sequence data at an unprecedented rate and scale thereby enabling entirely new “omics”-based approaches towards the analysis of complex biological processes. However, the amount and complexity of data that even a single experiment can produce seriously challenges researchers with limited bioinformatics expertise, who need to handle, analyze and interpret the data before it can be understood in a biological context. Thus, there is an unmet need for tools allowing non-bioinformatics users to interpret large data sets. We have recently developed a method, NNAlign, which is generally applicable to any biological problem where quantitative peptide data is available. This method efficiently identifies underlying sequence patterns by simultaneously aligning peptide sequences and identifying motifs associated with quantitative readouts. Here, we provide a web-based implementation of NNAlign allowing non-expert end-users to submit their data (optionally adjusting method parameters), and in return receive a trained method (including a visual representation of the identified motif) that subsequently can be used as prediction method and applied to unknown proteins/peptides. We have successfully applied this method to several different data sets including peptide microarray-derived sets containing more than 100,000 data points. NNAlign is available online at http://www.cbs.dtu.dk/services/NNAlign.
format Online
Article
Text
id pubmed-3206854
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32068542011-11-09 NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data Andreatta, Massimo Schafer-Nielsen, Claus Lund, Ole Buus, Søren Nielsen, Morten PLoS One Research Article Recent advances in high-throughput technologies have made it possible to generate both gene and protein sequence data at an unprecedented rate and scale thereby enabling entirely new “omics”-based approaches towards the analysis of complex biological processes. However, the amount and complexity of data that even a single experiment can produce seriously challenges researchers with limited bioinformatics expertise, who need to handle, analyze and interpret the data before it can be understood in a biological context. Thus, there is an unmet need for tools allowing non-bioinformatics users to interpret large data sets. We have recently developed a method, NNAlign, which is generally applicable to any biological problem where quantitative peptide data is available. This method efficiently identifies underlying sequence patterns by simultaneously aligning peptide sequences and identifying motifs associated with quantitative readouts. Here, we provide a web-based implementation of NNAlign allowing non-expert end-users to submit their data (optionally adjusting method parameters), and in return receive a trained method (including a visual representation of the identified motif) that subsequently can be used as prediction method and applied to unknown proteins/peptides. We have successfully applied this method to several different data sets including peptide microarray-derived sets containing more than 100,000 data points. NNAlign is available online at http://www.cbs.dtu.dk/services/NNAlign. Public Library of Science 2011-11-02 /pmc/articles/PMC3206854/ /pubmed/22073191 http://dx.doi.org/10.1371/journal.pone.0026781 Text en Andreatta et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Andreatta, Massimo
Schafer-Nielsen, Claus
Lund, Ole
Buus, Søren
Nielsen, Morten
NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data
title NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data
title_full NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data
title_fullStr NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data
title_full_unstemmed NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data
title_short NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data
title_sort nnalign: a web-based prediction method allowing non-expert end-user discovery of sequence motifs in quantitative peptide data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3206854/
https://www.ncbi.nlm.nih.gov/pubmed/22073191
http://dx.doi.org/10.1371/journal.pone.0026781
work_keys_str_mv AT andreattamassimo nnalignawebbasedpredictionmethodallowingnonexpertenduserdiscoveryofsequencemotifsinquantitativepeptidedata
AT schafernielsenclaus nnalignawebbasedpredictionmethodallowingnonexpertenduserdiscoveryofsequencemotifsinquantitativepeptidedata
AT lundole nnalignawebbasedpredictionmethodallowingnonexpertenduserdiscoveryofsequencemotifsinquantitativepeptidedata
AT buussøren nnalignawebbasedpredictionmethodallowingnonexpertenduserdiscoveryofsequencemotifsinquantitativepeptidedata
AT nielsenmorten nnalignawebbasedpredictionmethodallowingnonexpertenduserdiscoveryofsequencemotifsinquantitativepeptidedata