Cargando…
Where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins
Intrinsic disorder (ID) in proteins is involved in crucial interactions in the living cell. As the importance of ID is increasingly recognized, so are detailed analyses aimed at its identification and characterization. An open question remains the existence of ID `flavors’ representing different sub...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6301333/ https://www.ncbi.nlm.nih.gov/pubmed/30576490 http://dx.doi.org/10.1093/database/bay127 |
_version_ | 1783381818491273216 |
---|---|
author | Necci, Marco Piovesan, Damiano Tosatto, Silvio C E |
author_facet | Necci, Marco Piovesan, Damiano Tosatto, Silvio C E |
author_sort | Necci, Marco |
collection | PubMed |
description | Intrinsic disorder (ID) in proteins is involved in crucial interactions in the living cell. As the importance of ID is increasingly recognized, so are detailed analyses aimed at its identification and characterization. An open question remains the existence of ID `flavors’ representing different sub-phenomena. Several databases collect manually curated examples of experimentally validated ID, focusing on apparently different aspects of this phenomenon. The recent update of MobiDB presented the opportunity to carry out an in-depth comparison of the content of these validated ID collections, namely DIBS, DisProt, IDEAL, MFIB, FuzDB, ELM and UniProt. In order to assess what is specific to different ID flavors, we analyzed relevant sequence-based features, such as amino acid composition, length, taxa and gene ontology terms, highlighting differences and similarities among datasets. Despite that, the majority of the considered features are not statistically different across databases, with the exception of ELM. FuzDB also shares half of its entries with DisProt. In general, different ID databases describe similar phenomena. DisProt, which is the largest database, better represents the entire spectrum of different disorder flavors and the corresponding sequence diversity. |
format | Online Article Text |
id | pubmed-6301333 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-63013332018-12-27 Where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins Necci, Marco Piovesan, Damiano Tosatto, Silvio C E Database (Oxford) Original Article Intrinsic disorder (ID) in proteins is involved in crucial interactions in the living cell. As the importance of ID is increasingly recognized, so are detailed analyses aimed at its identification and characterization. An open question remains the existence of ID `flavors’ representing different sub-phenomena. Several databases collect manually curated examples of experimentally validated ID, focusing on apparently different aspects of this phenomenon. The recent update of MobiDB presented the opportunity to carry out an in-depth comparison of the content of these validated ID collections, namely DIBS, DisProt, IDEAL, MFIB, FuzDB, ELM and UniProt. In order to assess what is specific to different ID flavors, we analyzed relevant sequence-based features, such as amino acid composition, length, taxa and gene ontology terms, highlighting differences and similarities among datasets. Despite that, the majority of the considered features are not statistically different across databases, with the exception of ELM. FuzDB also shares half of its entries with DisProt. In general, different ID databases describe similar phenomena. DisProt, which is the largest database, better represents the entire spectrum of different disorder flavors and the corresponding sequence diversity. Oxford University Press 2018-12-14 /pmc/articles/PMC6301333/ /pubmed/30576490 http://dx.doi.org/10.1093/database/bay127 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Necci, Marco Piovesan, Damiano Tosatto, Silvio C E Where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins |
title | Where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins |
title_full | Where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins |
title_fullStr | Where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins |
title_full_unstemmed | Where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins |
title_short | Where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins |
title_sort | where differences resemble: sequence-feature analysis in curated databases of intrinsically disordered proteins |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6301333/ https://www.ncbi.nlm.nih.gov/pubmed/30576490 http://dx.doi.org/10.1093/database/bay127 |
work_keys_str_mv | AT neccimarco wheredifferencesresemblesequencefeatureanalysisincurateddatabasesofintrinsicallydisorderedproteins AT piovesandamiano wheredifferencesresemblesequencefeatureanalysisincurateddatabasesofintrinsicallydisorderedproteins AT tosattosilvioce wheredifferencesresemblesequencefeatureanalysisincurateddatabasesofintrinsicallydisorderedproteins |