Cargando…

pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis

Molecular pathway data are essential in current computational and systems biology research. While there are many primary and integrated pathway databases, several challenges remain, including low proteome coverage (57%), low overlap across different databases, unavailability of direct information ab...

Descripción completa

Detalles Bibliográficos
Autores principales: Rahmati, Sara, Abovsky, Mark, Pastrello, Chiara, Jurisica, Igor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210562/
https://www.ncbi.nlm.nih.gov/pubmed/27899558
http://dx.doi.org/10.1093/nar/gkw1082
_version_ 1782490908412870656
author Rahmati, Sara
Abovsky, Mark
Pastrello, Chiara
Jurisica, Igor
author_facet Rahmati, Sara
Abovsky, Mark
Pastrello, Chiara
Jurisica, Igor
author_sort Rahmati, Sara
collection PubMed
description Molecular pathway data are essential in current computational and systems biology research. While there are many primary and integrated pathway databases, several challenges remain, including low proteome coverage (57%), low overlap across different databases, unavailability of direct information about underlying physical connectivity of pathway members, and high fraction of protein-coding genes without any pathway annotations, i.e. ‘pathway orphans’. In order to address all these challenges, we developed pathDIP, which integrates data from 20 source pathway databases, ‘core pathways’, with physical protein–protein interactions to predict biologically relevant protein–pathway associations, referred to as ‘extended pathways’. Cross-validation determined 71% recovery rate of our predictions. Data integration and predictions increase coverage of pathway annotations for protein-coding genes to 86%, and provide novel annotations for 5732 pathway orphans. PathDIP (http://ophid.utoronto.ca/pathdip) annotates 17 070 protein-coding genes with 4678 pathways, and provides multiple query, analysis and output options.
format Online
Article
Text
id pubmed-5210562
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-52105622017-01-05 pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis Rahmati, Sara Abovsky, Mark Pastrello, Chiara Jurisica, Igor Nucleic Acids Res Database Issue Molecular pathway data are essential in current computational and systems biology research. While there are many primary and integrated pathway databases, several challenges remain, including low proteome coverage (57%), low overlap across different databases, unavailability of direct information about underlying physical connectivity of pathway members, and high fraction of protein-coding genes without any pathway annotations, i.e. ‘pathway orphans’. In order to address all these challenges, we developed pathDIP, which integrates data from 20 source pathway databases, ‘core pathways’, with physical protein–protein interactions to predict biologically relevant protein–pathway associations, referred to as ‘extended pathways’. Cross-validation determined 71% recovery rate of our predictions. Data integration and predictions increase coverage of pathway annotations for protein-coding genes to 86%, and provide novel annotations for 5732 pathway orphans. PathDIP (http://ophid.utoronto.ca/pathdip) annotates 17 070 protein-coding genes with 4678 pathways, and provides multiple query, analysis and output options. Oxford University Press 2017-01-04 2016-11-29 /pmc/articles/PMC5210562/ /pubmed/27899558 http://dx.doi.org/10.1093/nar/gkw1082 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Database Issue
Rahmati, Sara
Abovsky, Mark
Pastrello, Chiara
Jurisica, Igor
pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis
title pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis
title_full pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis
title_fullStr pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis
title_full_unstemmed pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis
title_short pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis
title_sort pathdip: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210562/
https://www.ncbi.nlm.nih.gov/pubmed/27899558
http://dx.doi.org/10.1093/nar/gkw1082
work_keys_str_mv AT rahmatisara pathdipanannotatedresourceforknownandpredictedhumangenepathwayassociationsandpathwayenrichmentanalysis
AT abovskymark pathdipanannotatedresourceforknownandpredictedhumangenepathwayassociationsandpathwayenrichmentanalysis
AT pastrellochiara pathdipanannotatedresourceforknownandpredictedhumangenepathwayassociationsandpathwayenrichmentanalysis
AT jurisicaigor pathdipanannotatedresourceforknownandpredictedhumangenepathwayassociationsandpathwayenrichmentanalysis