Cargando…

PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology

It has been acknowledged that source databases recording experimentally supported human protein-protein interactions (PPIs) exhibit limited overlap. Thus, the reconstruction of a comprehensive PPI network requires appropriate integration of multiple heterogeneous primary datasets, presenting the PPI...

Descripción completa

Detalles Bibliográficos
Autores principales: Gioutlakis, Aris, Klapa, Maria I., Moschonas, Nicholas K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638325/
https://www.ncbi.nlm.nih.gov/pubmed/29023571
http://dx.doi.org/10.1371/journal.pone.0186039
_version_ 1783270725245730816
author Gioutlakis, Aris
Klapa, Maria I.
Moschonas, Nicholas K.
author_facet Gioutlakis, Aris
Klapa, Maria I.
Moschonas, Nicholas K.
author_sort Gioutlakis, Aris
collection PubMed
description It has been acknowledged that source databases recording experimentally supported human protein-protein interactions (PPIs) exhibit limited overlap. Thus, the reconstruction of a comprehensive PPI network requires appropriate integration of multiple heterogeneous primary datasets, presenting the PPIs at various genetic reference levels. Existing PPI meta-databases perform integration via normalization; namely, PPIs are merged after converted to a certain target level. Hence, the node set of the integrated network depends each time on the number and type of the combined datasets. Moreover, the irreversible a priori normalization process hinders the identification of normalization artifacts in the integrated network, which originate from the nonlinearity characterizing the genetic information flow. PICKLE (Protein InteraCtion KnowLedgebasE) 2.0 implements a new architecture for this recently introduced human PPI meta-database. Its main novel feature over the existing meta-databases is its approach to primary PPI dataset integration via genetic information ontology. Building upon the PICKLE principles of using the reviewed human complete proteome (RHCP) of UniProtKB/Swiss-Prot as the reference protein interactor set, and filtering out protein interactions with low probability of being direct based on the available evidence, PICKLE 2.0 first assembles the RHCP genetic information ontology network by connecting the corresponding genes, nucleotide sequences (mRNAs) and proteins (UniProt entries) and then integrates PPI datasets by superimposing them on the ontology network without any a priori transformations. Importantly, this process allows the resulting heterogeneous integrated network to be reversibly normalized to any level of genetic reference without loss of the original information, the latter being used for identification of normalization biases, and enables the appraisal of potential false positive interactions through PPI source database cross-checking. The PICKLE web-based interface (www.pickle.gr) allows for the simultaneous query of multiple entities and provides integrated human PPI networks at either the protein (UniProt) or the gene level, at three PPI filtering modes.
format Online
Article
Text
id pubmed-5638325
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56383252017-10-20 PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology Gioutlakis, Aris Klapa, Maria I. Moschonas, Nicholas K. PLoS One Research Article It has been acknowledged that source databases recording experimentally supported human protein-protein interactions (PPIs) exhibit limited overlap. Thus, the reconstruction of a comprehensive PPI network requires appropriate integration of multiple heterogeneous primary datasets, presenting the PPIs at various genetic reference levels. Existing PPI meta-databases perform integration via normalization; namely, PPIs are merged after converted to a certain target level. Hence, the node set of the integrated network depends each time on the number and type of the combined datasets. Moreover, the irreversible a priori normalization process hinders the identification of normalization artifacts in the integrated network, which originate from the nonlinearity characterizing the genetic information flow. PICKLE (Protein InteraCtion KnowLedgebasE) 2.0 implements a new architecture for this recently introduced human PPI meta-database. Its main novel feature over the existing meta-databases is its approach to primary PPI dataset integration via genetic information ontology. Building upon the PICKLE principles of using the reviewed human complete proteome (RHCP) of UniProtKB/Swiss-Prot as the reference protein interactor set, and filtering out protein interactions with low probability of being direct based on the available evidence, PICKLE 2.0 first assembles the RHCP genetic information ontology network by connecting the corresponding genes, nucleotide sequences (mRNAs) and proteins (UniProt entries) and then integrates PPI datasets by superimposing them on the ontology network without any a priori transformations. Importantly, this process allows the resulting heterogeneous integrated network to be reversibly normalized to any level of genetic reference without loss of the original information, the latter being used for identification of normalization biases, and enables the appraisal of potential false positive interactions through PPI source database cross-checking. The PICKLE web-based interface (www.pickle.gr) allows for the simultaneous query of multiple entities and provides integrated human PPI networks at either the protein (UniProt) or the gene level, at three PPI filtering modes. Public Library of Science 2017-10-12 /pmc/articles/PMC5638325/ /pubmed/29023571 http://dx.doi.org/10.1371/journal.pone.0186039 Text en © 2017 Gioutlakis et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gioutlakis, Aris
Klapa, Maria I.
Moschonas, Nicholas K.
PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology
title PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology
title_full PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology
title_fullStr PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology
title_full_unstemmed PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology
title_short PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology
title_sort pickle 2.0: a human protein-protein interaction meta-database employing data integration via genetic information ontology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638325/
https://www.ncbi.nlm.nih.gov/pubmed/29023571
http://dx.doi.org/10.1371/journal.pone.0186039
work_keys_str_mv AT gioutlakisaris pickle20ahumanproteinproteininteractionmetadatabaseemployingdataintegrationviageneticinformationontology
AT klapamariai pickle20ahumanproteinproteininteractionmetadatabaseemployingdataintegrationviageneticinformationontology
AT moschonasnicholask pickle20ahumanproteinproteininteractionmetadatabaseemployingdataintegrationviageneticinformationontology