Cargando…

ProtoBug: functional families from the complete proteomes of insects

ProtoBug (http://www.protobug.cs.huji.ac.il) is a database and resource of protein families in Arthropod genomes. ProtoBug platform presents the relatedness of complete proteomes from 17 insects as well as a proteome of the crustacean, Daphnia pulex. The represented proteomes from insects include lo...

Descripción completa

Detalles Bibliográficos
Autores principales: Rappoport, Nadav, Linial, Michal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4408594/
https://www.ncbi.nlm.nih.gov/pubmed/25911153
http://dx.doi.org/10.1093/database/bau122
_version_ 1782368069389123584
author Rappoport, Nadav
Linial, Michal
author_facet Rappoport, Nadav
Linial, Michal
author_sort Rappoport, Nadav
collection PubMed
description ProtoBug (http://www.protobug.cs.huji.ac.il) is a database and resource of protein families in Arthropod genomes. ProtoBug platform presents the relatedness of complete proteomes from 17 insects as well as a proteome of the crustacean, Daphnia pulex. The represented proteomes from insects include louse, bee, beetle, ants, flies and mosquitoes. Based on an unsupervised clustering method, protein sequences were clustered into a hierarchical tree, called ProtoBug. ProtoBug covers about 300 000 sequences that are partitioned to families. At the default setting, all sequences are partitioned to ∼20 000 families (excluding singletons). From the species perspective, each of the 18 analysed proteomes is composed of 5000–8000 families. In the regime of the advanced operational mode, the ProtoBug provides rich navigation capabilities for touring the hierarchy of the families at any selected resolution. A proteome viewer shows the composition of sequences from any of the 18 analysed proteomes. Using functional annotation from an expert system (Pfam) we assigned domains, families and repeats by 4400 keywords that cover 73% of the sequences. A strict inference protocol is applied for expanding the functional knowledge. Consequently, secured annotations were associated with 81% of the proteins, and with 70% of the families (≥10 proteins each). ProtoBug is a database and webtool with rich visualization and navigation tools. The properties of each family in relation to other families in the ProtoBug tree, and in view of the taxonomy composition are reported. Furthermore, the user can paste its own sequences to find relatedness to any of the ProtoBug families. The database and the navigation tools are the basis for functional discoveries that span 350 million years of evolution of Arthropods. ProtoBug is available with no restriction at: www.protobug.cs.huji.ac.il. Database URL: www.protobug.cs.huji.ac.il.
format Online
Article
Text
id pubmed-4408594
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44085942015-06-26 ProtoBug: functional families from the complete proteomes of insects Rappoport, Nadav Linial, Michal Database (Oxford) Database Tool ProtoBug (http://www.protobug.cs.huji.ac.il) is a database and resource of protein families in Arthropod genomes. ProtoBug platform presents the relatedness of complete proteomes from 17 insects as well as a proteome of the crustacean, Daphnia pulex. The represented proteomes from insects include louse, bee, beetle, ants, flies and mosquitoes. Based on an unsupervised clustering method, protein sequences were clustered into a hierarchical tree, called ProtoBug. ProtoBug covers about 300 000 sequences that are partitioned to families. At the default setting, all sequences are partitioned to ∼20 000 families (excluding singletons). From the species perspective, each of the 18 analysed proteomes is composed of 5000–8000 families. In the regime of the advanced operational mode, the ProtoBug provides rich navigation capabilities for touring the hierarchy of the families at any selected resolution. A proteome viewer shows the composition of sequences from any of the 18 analysed proteomes. Using functional annotation from an expert system (Pfam) we assigned domains, families and repeats by 4400 keywords that cover 73% of the sequences. A strict inference protocol is applied for expanding the functional knowledge. Consequently, secured annotations were associated with 81% of the proteins, and with 70% of the families (≥10 proteins each). ProtoBug is a database and webtool with rich visualization and navigation tools. The properties of each family in relation to other families in the ProtoBug tree, and in view of the taxonomy composition are reported. Furthermore, the user can paste its own sequences to find relatedness to any of the ProtoBug families. The database and the navigation tools are the basis for functional discoveries that span 350 million years of evolution of Arthropods. ProtoBug is available with no restriction at: www.protobug.cs.huji.ac.il. Database URL: www.protobug.cs.huji.ac.il. Oxford University Press 2015-04-24 /pmc/articles/PMC4408594/ /pubmed/25911153 http://dx.doi.org/10.1093/database/bau122 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Rappoport, Nadav
Linial, Michal
ProtoBug: functional families from the complete proteomes of insects
title ProtoBug: functional families from the complete proteomes of insects
title_full ProtoBug: functional families from the complete proteomes of insects
title_fullStr ProtoBug: functional families from the complete proteomes of insects
title_full_unstemmed ProtoBug: functional families from the complete proteomes of insects
title_short ProtoBug: functional families from the complete proteomes of insects
title_sort protobug: functional families from the complete proteomes of insects
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4408594/
https://www.ncbi.nlm.nih.gov/pubmed/25911153
http://dx.doi.org/10.1093/database/bau122
work_keys_str_mv AT rappoportnadav protobugfunctionalfamiliesfromthecompleteproteomesofinsects
AT linialmichal protobugfunctionalfamiliesfromthecompleteproteomesofinsects