Cargando…

ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes

Motivation: Many analyses in modern biological research are based on comparisons between biological sequences, resulting in functional, evolutionary and structural inferences. When large numbers of sequences are compared, heuristics are often used resulting in a certain lack of accuracy. In order to...

Descripción completa

Detalles Bibliográficos
Autores principales: Otto, Thomas Dan, Catanho, Marcos, Tristão, Cristian, Bezerra, Márcia, Fernandes, Renan Mathias, Elias, Guilherme Steinberger, Scaglia, Alexandre Capeletto, Bovermann, Bill, Berstis, Viktors, Lifschitz, Sergio, de Miranda, Antonio Basílio, Degrave, Wim
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828119/
https://www.ncbi.nlm.nih.gov/pubmed/20089515
http://dx.doi.org/10.1093/bioinformatics/btq011
_version_ 1782177994286039040
author Otto, Thomas Dan
Catanho, Marcos
Tristão, Cristian
Bezerra, Márcia
Fernandes, Renan Mathias
Elias, Guilherme Steinberger
Scaglia, Alexandre Capeletto
Bovermann, Bill
Berstis, Viktors
Lifschitz, Sergio
de Miranda, Antonio Basílio
Degrave, Wim
author_facet Otto, Thomas Dan
Catanho, Marcos
Tristão, Cristian
Bezerra, Márcia
Fernandes, Renan Mathias
Elias, Guilherme Steinberger
Scaglia, Alexandre Capeletto
Bovermann, Bill
Berstis, Viktors
Lifschitz, Sergio
de Miranda, Antonio Basílio
Degrave, Wim
author_sort Otto, Thomas Dan
collection PubMed
description Motivation: Many analyses in modern biological research are based on comparisons between biological sequences, resulting in functional, evolutionary and structural inferences. When large numbers of sequences are compared, heuristics are often used resulting in a certain lack of accuracy. In order to improve and validate results of such comparisons, we have performed radical all-against-all comparisons of 4 million protein sequences belonging to the RefSeq database, using an implementation of the Smith–Waterman algorithm. This extremely intensive computational approach was made possible with the help of World Community Grid™, through the Genome Comparison Project. The resulting database, ProteinWorldDB, which contains coordinates of pairwise protein alignments and their respective scores, is now made available. Users can download, compare and analyze the results, filtered by genomes, protein functions or clusters. ProteinWorldDB is integrated with annotations derived from Swiss-Prot, Pfam, KEGG, NCBI Taxonomy database and gene ontology. The database is a unique and valuable asset, representing a major effort to create a reliable and consistent dataset of cross-comparisons of the whole protein content encoded in hundreds of completely sequenced genomes using a rigorous dynamic programming approach. Availability: The database can be accessed through http://proteinworlddb.org Contact: otto@fiocruz.br
format Text
id pubmed-2828119
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28281192010-02-25 ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes Otto, Thomas Dan Catanho, Marcos Tristão, Cristian Bezerra, Márcia Fernandes, Renan Mathias Elias, Guilherme Steinberger Scaglia, Alexandre Capeletto Bovermann, Bill Berstis, Viktors Lifschitz, Sergio de Miranda, Antonio Basílio Degrave, Wim Bioinformatics Applications Note Motivation: Many analyses in modern biological research are based on comparisons between biological sequences, resulting in functional, evolutionary and structural inferences. When large numbers of sequences are compared, heuristics are often used resulting in a certain lack of accuracy. In order to improve and validate results of such comparisons, we have performed radical all-against-all comparisons of 4 million protein sequences belonging to the RefSeq database, using an implementation of the Smith–Waterman algorithm. This extremely intensive computational approach was made possible with the help of World Community Grid™, through the Genome Comparison Project. The resulting database, ProteinWorldDB, which contains coordinates of pairwise protein alignments and their respective scores, is now made available. Users can download, compare and analyze the results, filtered by genomes, protein functions or clusters. ProteinWorldDB is integrated with annotations derived from Swiss-Prot, Pfam, KEGG, NCBI Taxonomy database and gene ontology. The database is a unique and valuable asset, representing a major effort to create a reliable and consistent dataset of cross-comparisons of the whole protein content encoded in hundreds of completely sequenced genomes using a rigorous dynamic programming approach. Availability: The database can be accessed through http://proteinworlddb.org Contact: otto@fiocruz.br Oxford University Press 2010-03-01 2010-01-19 /pmc/articles/PMC2828119/ /pubmed/20089515 http://dx.doi.org/10.1093/bioinformatics/btq011 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Otto, Thomas Dan
Catanho, Marcos
Tristão, Cristian
Bezerra, Márcia
Fernandes, Renan Mathias
Elias, Guilherme Steinberger
Scaglia, Alexandre Capeletto
Bovermann, Bill
Berstis, Viktors
Lifschitz, Sergio
de Miranda, Antonio Basílio
Degrave, Wim
ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes
title ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes
title_full ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes
title_fullStr ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes
title_full_unstemmed ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes
title_short ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes
title_sort proteinworlddb: querying radical pairwise alignments among protein sets from complete genomes
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828119/
https://www.ncbi.nlm.nih.gov/pubmed/20089515
http://dx.doi.org/10.1093/bioinformatics/btq011
work_keys_str_mv AT ottothomasdan proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT catanhomarcos proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT tristaocristian proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT bezerramarcia proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT fernandesrenanmathias proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT eliasguilhermesteinberger proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT scagliaalexandrecapeletto proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT bovermannbill proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT berstisviktors proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT lifschitzsergio proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT demirandaantoniobasilio proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes
AT degravewim proteinworlddbqueryingradicalpairwisealignmentsamongproteinsetsfromcompletegenomes