Cargando…

A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy

An algorithm is presented that returns the optimal pairwise gapped alignment of two sets of signed numerical sequence values. One distinguishing feature of this algorithm is a flexible comparison engine (based on both relative shape and absolute similarity measures) that does not rely on explicit ga...

Descripción completa

Detalles Bibliográficos
Autores principales: Hadzipasic, Omar, Wrabl, James O., Hilser, Vincent J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3794901/
https://www.ncbi.nlm.nih.gov/pubmed/24130469
http://dx.doi.org/10.1371/journal.pcbi.1003247
_version_ 1782287286932602880
author Hadzipasic, Omar
Wrabl, James O.
Hilser, Vincent J.
author_facet Hadzipasic, Omar
Wrabl, James O.
Hilser, Vincent J.
author_sort Hadzipasic, Omar
collection PubMed
description An algorithm is presented that returns the optimal pairwise gapped alignment of two sets of signed numerical sequence values. One distinguishing feature of this algorithm is a flexible comparison engine (based on both relative shape and absolute similarity measures) that does not rely on explicit gap penalties. Additionally, an empirical probability model is developed to estimate the significance of the returned alignment with respect to randomized data. The algorithm's utility for biological hypothesis formulation is demonstrated with test cases including database search and pairwise alignment of protein hydropathy. However, the algorithm and probability model could possibly be extended to accommodate other diverse types of protein or nucleic acid data, including positional thermodynamic stability and mRNA translation efficiency. The algorithm requires only numerical values as input and will readily compare data other than protein hydropathy. The tool is therefore expected to complement, rather than replace, existing sequence and structure based tools and may inform medical discovery, as exemplified by proposed similarity between a chlamydial ORFan protein and bacterial colicin pore-forming domain. The source code, documentation, and a basic web-server application are available.
format Online
Article
Text
id pubmed-3794901
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37949012013-10-15 A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy Hadzipasic, Omar Wrabl, James O. Hilser, Vincent J. PLoS Comput Biol Research Article An algorithm is presented that returns the optimal pairwise gapped alignment of two sets of signed numerical sequence values. One distinguishing feature of this algorithm is a flexible comparison engine (based on both relative shape and absolute similarity measures) that does not rely on explicit gap penalties. Additionally, an empirical probability model is developed to estimate the significance of the returned alignment with respect to randomized data. The algorithm's utility for biological hypothesis formulation is demonstrated with test cases including database search and pairwise alignment of protein hydropathy. However, the algorithm and probability model could possibly be extended to accommodate other diverse types of protein or nucleic acid data, including positional thermodynamic stability and mRNA translation efficiency. The algorithm requires only numerical values as input and will readily compare data other than protein hydropathy. The tool is therefore expected to complement, rather than replace, existing sequence and structure based tools and may inform medical discovery, as exemplified by proposed similarity between a chlamydial ORFan protein and bacterial colicin pore-forming domain. The source code, documentation, and a basic web-server application are available. Public Library of Science 2013-10-10 /pmc/articles/PMC3794901/ /pubmed/24130469 http://dx.doi.org/10.1371/journal.pcbi.1003247 Text en © 2013 Hadzipasic et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hadzipasic, Omar
Wrabl, James O.
Hilser, Vincent J.
A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy
title A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy
title_full A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy
title_fullStr A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy
title_full_unstemmed A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy
title_short A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy
title_sort horizontal alignment tool for numerical trend discovery in sequence data: application to protein hydropathy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3794901/
https://www.ncbi.nlm.nih.gov/pubmed/24130469
http://dx.doi.org/10.1371/journal.pcbi.1003247
work_keys_str_mv AT hadzipasicomar ahorizontalalignmenttoolfornumericaltrenddiscoveryinsequencedataapplicationtoproteinhydropathy
AT wrabljameso ahorizontalalignmenttoolfornumericaltrenddiscoveryinsequencedataapplicationtoproteinhydropathy
AT hilservincentj ahorizontalalignmenttoolfornumericaltrenddiscoveryinsequencedataapplicationtoproteinhydropathy
AT hadzipasicomar horizontalalignmenttoolfornumericaltrenddiscoveryinsequencedataapplicationtoproteinhydropathy
AT wrabljameso horizontalalignmenttoolfornumericaltrenddiscoveryinsequencedataapplicationtoproteinhydropathy
AT hilservincentj horizontalalignmenttoolfornumericaltrenddiscoveryinsequencedataapplicationtoproteinhydropathy