Cargando…

Increasing Sequence Search Sensitivity with Transitive Alignments

Sequence alignment is an important bioinformatics tool for identifying homology, but searching against the full set of available sequences is likely to result in many hits to poorly annotated sequences providing very little information. Consequently, we often want alignments against a specific subse...

Descripción completa

Detalles Bibliográficos
Autores principales: Malde, Ketil, Furmanek, Tomasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3573025/
https://www.ncbi.nlm.nih.gov/pubmed/23457449
http://dx.doi.org/10.1371/journal.pone.0054422
_version_ 1782259387781349376
author Malde, Ketil
Furmanek, Tomasz
author_facet Malde, Ketil
Furmanek, Tomasz
author_sort Malde, Ketil
collection PubMed
description Sequence alignment is an important bioinformatics tool for identifying homology, but searching against the full set of available sequences is likely to result in many hits to poorly annotated sequences providing very little information. Consequently, we often want alignments against a specific subset of sequences: for instance, we are looking for sequences from a particular species, sequences that have known 3d-structures, sequences that have a reliable (curated) function annotation, and so on. Although such subset databases are readily available, they only represent a small fraction of all sequences. Thus, the likelihood of finding close homologs for query sequences is smaller, and the alignments will in general have lower scores. This makes it difficult to distinguish hits to homologous sequences from random hits to unrelated sequences. Here, we propose a method that addresses this problem by first aligning query sequences against a large database representing the corpus of known sequences, and then constructing indirect (or transitive) alignments by combining the results with alignments from the large database against the desired target database. We compare the results to direct pairwise alignments, and show that our method gives us higher sensitivity alignments against the target database.
format Online
Article
Text
id pubmed-3573025
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35730252013-03-01 Increasing Sequence Search Sensitivity with Transitive Alignments Malde, Ketil Furmanek, Tomasz PLoS One Research Article Sequence alignment is an important bioinformatics tool for identifying homology, but searching against the full set of available sequences is likely to result in many hits to poorly annotated sequences providing very little information. Consequently, we often want alignments against a specific subset of sequences: for instance, we are looking for sequences from a particular species, sequences that have known 3d-structures, sequences that have a reliable (curated) function annotation, and so on. Although such subset databases are readily available, they only represent a small fraction of all sequences. Thus, the likelihood of finding close homologs for query sequences is smaller, and the alignments will in general have lower scores. This makes it difficult to distinguish hits to homologous sequences from random hits to unrelated sequences. Here, we propose a method that addresses this problem by first aligning query sequences against a large database representing the corpus of known sequences, and then constructing indirect (or transitive) alignments by combining the results with alignments from the large database against the desired target database. We compare the results to direct pairwise alignments, and show that our method gives us higher sensitivity alignments against the target database. Public Library of Science 2013-02-14 /pmc/articles/PMC3573025/ /pubmed/23457449 http://dx.doi.org/10.1371/journal.pone.0054422 Text en © 2013 Malde, Furmanek http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Malde, Ketil
Furmanek, Tomasz
Increasing Sequence Search Sensitivity with Transitive Alignments
title Increasing Sequence Search Sensitivity with Transitive Alignments
title_full Increasing Sequence Search Sensitivity with Transitive Alignments
title_fullStr Increasing Sequence Search Sensitivity with Transitive Alignments
title_full_unstemmed Increasing Sequence Search Sensitivity with Transitive Alignments
title_short Increasing Sequence Search Sensitivity with Transitive Alignments
title_sort increasing sequence search sensitivity with transitive alignments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3573025/
https://www.ncbi.nlm.nih.gov/pubmed/23457449
http://dx.doi.org/10.1371/journal.pone.0054422
work_keys_str_mv AT maldeketil increasingsequencesearchsensitivitywithtransitivealignments
AT furmanektomasz increasingsequencesearchsensitivitywithtransitivealignments