Cargando…
ProtoNet 4.0: A hierarchical classification of one million protein sequences
ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases. In addition to rich visualization and analysis tools to navigate the clustering hiera...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539961/ https://www.ncbi.nlm.nih.gov/pubmed/15608180 http://dx.doi.org/10.1093/nar/gki007 |
_version_ | 1782122091191992320 |
---|---|
author | Kaplan, Noam Sasson, Ori Inbar, Uri Friedlich, Moriah Fromer, Menachem Fleischer, Hillel Portugaly, Elon Linial, Nathan Linial, Michal |
author_facet | Kaplan, Noam Sasson, Ori Inbar, Uri Friedlich, Moriah Fromer, Menachem Fleischer, Hillel Portugaly, Elon Linial, Nathan Linial, Michal |
author_sort | Kaplan, Noam |
collection | PubMed |
description | ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases. In addition to rich visualization and analysis tools to navigate the clustering hierarchy, we incorporated several improvements that allow a simplified view of the scaffold of the proteins. An unsupervised, biologically valid method that was developed resulted in a condensation of the ProtoNet hierarchy to only 12% of the clusters. A large portion of these clusters was automatically assigned high confidence biological names according to their correspondence with functional annotations. ProtoNet is available at: http://www.protonet.cs.huji.ac.il. |
format | Text |
id | pubmed-539961 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-5399612005-01-04 ProtoNet 4.0: A hierarchical classification of one million protein sequences Kaplan, Noam Sasson, Ori Inbar, Uri Friedlich, Moriah Fromer, Menachem Fleischer, Hillel Portugaly, Elon Linial, Nathan Linial, Michal Nucleic Acids Res Articles ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases. In addition to rich visualization and analysis tools to navigate the clustering hierarchy, we incorporated several improvements that allow a simplified view of the scaffold of the proteins. An unsupervised, biologically valid method that was developed resulted in a condensation of the ProtoNet hierarchy to only 12% of the clusters. A large portion of these clusters was automatically assigned high confidence biological names according to their correspondence with functional annotations. ProtoNet is available at: http://www.protonet.cs.huji.ac.il. Oxford University Press 2005-01-01 2004-12-17 /pmc/articles/PMC539961/ /pubmed/15608180 http://dx.doi.org/10.1093/nar/gki007 Text en Copyright © 2005 Oxford University Press |
spellingShingle | Articles Kaplan, Noam Sasson, Ori Inbar, Uri Friedlich, Moriah Fromer, Menachem Fleischer, Hillel Portugaly, Elon Linial, Nathan Linial, Michal ProtoNet 4.0: A hierarchical classification of one million protein sequences |
title | ProtoNet 4.0: A hierarchical classification of one million protein sequences |
title_full | ProtoNet 4.0: A hierarchical classification of one million protein sequences |
title_fullStr | ProtoNet 4.0: A hierarchical classification of one million protein sequences |
title_full_unstemmed | ProtoNet 4.0: A hierarchical classification of one million protein sequences |
title_short | ProtoNet 4.0: A hierarchical classification of one million protein sequences |
title_sort | protonet 4.0: a hierarchical classification of one million protein sequences |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539961/ https://www.ncbi.nlm.nih.gov/pubmed/15608180 http://dx.doi.org/10.1093/nar/gki007 |
work_keys_str_mv | AT kaplannoam protonet40ahierarchicalclassificationofonemillionproteinsequences AT sassonori protonet40ahierarchicalclassificationofonemillionproteinsequences AT inbaruri protonet40ahierarchicalclassificationofonemillionproteinsequences AT friedlichmoriah protonet40ahierarchicalclassificationofonemillionproteinsequences AT fromermenachem protonet40ahierarchicalclassificationofonemillionproteinsequences AT fleischerhillel protonet40ahierarchicalclassificationofonemillionproteinsequences AT portugalyelon protonet40ahierarchicalclassificationofonemillionproteinsequences AT linialnathan protonet40ahierarchicalclassificationofonemillionproteinsequences AT linialmichal protonet40ahierarchicalclassificationofonemillionproteinsequences |