Cargando…
A survey of protein interaction data and multigenic inherited disorders
BACKGROUND: Multigenic diseases are often associated with protein complexes or interactions involved in the same pathway. We wanted to estimate to what extent this is true given a consolidated protein interaction data set. The study stresses data integration and data representation issues. RESULTS:...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3598893/ https://www.ncbi.nlm.nih.gov/pubmed/23398688 http://dx.doi.org/10.1186/1471-2105-14-47 |
_version_ | 1782262844276867072 |
---|---|
author | Mora, Antonio Michalickova, Katerina Donaldson, Ian M |
author_facet | Mora, Antonio Michalickova, Katerina Donaldson, Ian M |
author_sort | Mora, Antonio |
collection | PubMed |
description | BACKGROUND: Multigenic diseases are often associated with protein complexes or interactions involved in the same pathway. We wanted to estimate to what extent this is true given a consolidated protein interaction data set. The study stresses data integration and data representation issues. RESULTS: We constructed 497 multigenic disease groups from OMIM and tested for overlaps with interaction and pathway data. A total of 159 disease groups had significant overlaps with protein interaction data consolidated by iRefIndex. A further 68 disease overlaps were found only in the KEGG pathway database. No single database contained all significant overlaps thus stressing the importance of data integration. We also found that disease groups overlapped with all three interaction data types: n-ary, spoke-represented complexes and binary data – thus stressing the importance of considering each of these data types separately. CONCLUSIONS: Almost half of our multigenic disease groups could potentially be explained by protein complexes and pathways. However, the fact that no database or data type was able to cover all disease groups suggests that no single database has systematically covered all disease groups for potential related complex and pathway data. This survey provides a basis for further curation efforts to confirm and search for overlaps between diseases and interaction data. The accompanying R script can be used to reproduce the work and track progress in this area as databases change. Disease group overlaps can be further explored using the iRefscape plugin for Cytoscape. |
format | Online Article Text |
id | pubmed-3598893 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35988932013-03-17 A survey of protein interaction data and multigenic inherited disorders Mora, Antonio Michalickova, Katerina Donaldson, Ian M BMC Bioinformatics Research Article BACKGROUND: Multigenic diseases are often associated with protein complexes or interactions involved in the same pathway. We wanted to estimate to what extent this is true given a consolidated protein interaction data set. The study stresses data integration and data representation issues. RESULTS: We constructed 497 multigenic disease groups from OMIM and tested for overlaps with interaction and pathway data. A total of 159 disease groups had significant overlaps with protein interaction data consolidated by iRefIndex. A further 68 disease overlaps were found only in the KEGG pathway database. No single database contained all significant overlaps thus stressing the importance of data integration. We also found that disease groups overlapped with all three interaction data types: n-ary, spoke-represented complexes and binary data – thus stressing the importance of considering each of these data types separately. CONCLUSIONS: Almost half of our multigenic disease groups could potentially be explained by protein complexes and pathways. However, the fact that no database or data type was able to cover all disease groups suggests that no single database has systematically covered all disease groups for potential related complex and pathway data. This survey provides a basis for further curation efforts to confirm and search for overlaps between diseases and interaction data. The accompanying R script can be used to reproduce the work and track progress in this area as databases change. Disease group overlaps can be further explored using the iRefscape plugin for Cytoscape. BioMed Central 2013-02-11 /pmc/articles/PMC3598893/ /pubmed/23398688 http://dx.doi.org/10.1186/1471-2105-14-47 Text en Copyright ©2013 Mora et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Mora, Antonio Michalickova, Katerina Donaldson, Ian M A survey of protein interaction data and multigenic inherited disorders |
title | A survey of protein interaction data and multigenic inherited disorders |
title_full | A survey of protein interaction data and multigenic inherited disorders |
title_fullStr | A survey of protein interaction data and multigenic inherited disorders |
title_full_unstemmed | A survey of protein interaction data and multigenic inherited disorders |
title_short | A survey of protein interaction data and multigenic inherited disorders |
title_sort | survey of protein interaction data and multigenic inherited disorders |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3598893/ https://www.ncbi.nlm.nih.gov/pubmed/23398688 http://dx.doi.org/10.1186/1471-2105-14-47 |
work_keys_str_mv | AT moraantonio asurveyofproteininteractiondataandmultigenicinheriteddisorders AT michalickovakaterina asurveyofproteininteractiondataandmultigenicinheriteddisorders AT donaldsonianm asurveyofproteininteractiondataandmultigenicinheriteddisorders AT moraantonio surveyofproteininteractiondataandmultigenicinheriteddisorders AT michalickovakaterina surveyofproteininteractiondataandmultigenicinheriteddisorders AT donaldsonianm surveyofproteininteractiondataandmultigenicinheriteddisorders |