Cargando…

ProtoNet 4.0: A hierarchical classification of one million protein sequences

ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases. In addition to rich visualization and analysis tools to navigate the clustering hiera...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaplan, Noam, Sasson, Ori, Inbar, Uri, Friedlich, Moriah, Fromer, Menachem, Fleischer, Hillel, Portugaly, Elon, Linial, Nathan, Linial, Michal
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539961/
https://www.ncbi.nlm.nih.gov/pubmed/15608180
http://dx.doi.org/10.1093/nar/gki007
_version_ 1782122091191992320
author Kaplan, Noam
Sasson, Ori
Inbar, Uri
Friedlich, Moriah
Fromer, Menachem
Fleischer, Hillel
Portugaly, Elon
Linial, Nathan
Linial, Michal
author_facet Kaplan, Noam
Sasson, Ori
Inbar, Uri
Friedlich, Moriah
Fromer, Menachem
Fleischer, Hillel
Portugaly, Elon
Linial, Nathan
Linial, Michal
author_sort Kaplan, Noam
collection PubMed
description ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases. In addition to rich visualization and analysis tools to navigate the clustering hierarchy, we incorporated several improvements that allow a simplified view of the scaffold of the proteins. An unsupervised, biologically valid method that was developed resulted in a condensation of the ProtoNet hierarchy to only 12% of the clusters. A large portion of these clusters was automatically assigned high confidence biological names according to their correspondence with functional annotations. ProtoNet is available at: http://www.protonet.cs.huji.ac.il.
format Text
id pubmed-539961
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-5399612005-01-04 ProtoNet 4.0: A hierarchical classification of one million protein sequences Kaplan, Noam Sasson, Ori Inbar, Uri Friedlich, Moriah Fromer, Menachem Fleischer, Hillel Portugaly, Elon Linial, Nathan Linial, Michal Nucleic Acids Res Articles ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases. In addition to rich visualization and analysis tools to navigate the clustering hierarchy, we incorporated several improvements that allow a simplified view of the scaffold of the proteins. An unsupervised, biologically valid method that was developed resulted in a condensation of the ProtoNet hierarchy to only 12% of the clusters. A large portion of these clusters was automatically assigned high confidence biological names according to their correspondence with functional annotations. ProtoNet is available at: http://www.protonet.cs.huji.ac.il. Oxford University Press 2005-01-01 2004-12-17 /pmc/articles/PMC539961/ /pubmed/15608180 http://dx.doi.org/10.1093/nar/gki007 Text en Copyright © 2005 Oxford University Press
spellingShingle Articles
Kaplan, Noam
Sasson, Ori
Inbar, Uri
Friedlich, Moriah
Fromer, Menachem
Fleischer, Hillel
Portugaly, Elon
Linial, Nathan
Linial, Michal
ProtoNet 4.0: A hierarchical classification of one million protein sequences
title ProtoNet 4.0: A hierarchical classification of one million protein sequences
title_full ProtoNet 4.0: A hierarchical classification of one million protein sequences
title_fullStr ProtoNet 4.0: A hierarchical classification of one million protein sequences
title_full_unstemmed ProtoNet 4.0: A hierarchical classification of one million protein sequences
title_short ProtoNet 4.0: A hierarchical classification of one million protein sequences
title_sort protonet 4.0: a hierarchical classification of one million protein sequences
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539961/
https://www.ncbi.nlm.nih.gov/pubmed/15608180
http://dx.doi.org/10.1093/nar/gki007
work_keys_str_mv AT kaplannoam protonet40ahierarchicalclassificationofonemillionproteinsequences
AT sassonori protonet40ahierarchicalclassificationofonemillionproteinsequences
AT inbaruri protonet40ahierarchicalclassificationofonemillionproteinsequences
AT friedlichmoriah protonet40ahierarchicalclassificationofonemillionproteinsequences
AT fromermenachem protonet40ahierarchicalclassificationofonemillionproteinsequences
AT fleischerhillel protonet40ahierarchicalclassificationofonemillionproteinsequences
AT portugalyelon protonet40ahierarchicalclassificationofonemillionproteinsequences
AT linialnathan protonet40ahierarchicalclassificationofonemillionproteinsequences
AT linialmichal protonet40ahierarchicalclassificationofonemillionproteinsequences