Cargando…
pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis
Molecular pathway data are essential in current computational and systems biology research. While there are many primary and integrated pathway databases, several challenges remain, including low proteome coverage (57%), low overlap across different databases, unavailability of direct information ab...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210562/ https://www.ncbi.nlm.nih.gov/pubmed/27899558 http://dx.doi.org/10.1093/nar/gkw1082 |
_version_ | 1782490908412870656 |
---|---|
author | Rahmati, Sara Abovsky, Mark Pastrello, Chiara Jurisica, Igor |
author_facet | Rahmati, Sara Abovsky, Mark Pastrello, Chiara Jurisica, Igor |
author_sort | Rahmati, Sara |
collection | PubMed |
description | Molecular pathway data are essential in current computational and systems biology research. While there are many primary and integrated pathway databases, several challenges remain, including low proteome coverage (57%), low overlap across different databases, unavailability of direct information about underlying physical connectivity of pathway members, and high fraction of protein-coding genes without any pathway annotations, i.e. ‘pathway orphans’. In order to address all these challenges, we developed pathDIP, which integrates data from 20 source pathway databases, ‘core pathways’, with physical protein–protein interactions to predict biologically relevant protein–pathway associations, referred to as ‘extended pathways’. Cross-validation determined 71% recovery rate of our predictions. Data integration and predictions increase coverage of pathway annotations for protein-coding genes to 86%, and provide novel annotations for 5732 pathway orphans. PathDIP (http://ophid.utoronto.ca/pathdip) annotates 17 070 protein-coding genes with 4678 pathways, and provides multiple query, analysis and output options. |
format | Online Article Text |
id | pubmed-5210562 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-52105622017-01-05 pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis Rahmati, Sara Abovsky, Mark Pastrello, Chiara Jurisica, Igor Nucleic Acids Res Database Issue Molecular pathway data are essential in current computational and systems biology research. While there are many primary and integrated pathway databases, several challenges remain, including low proteome coverage (57%), low overlap across different databases, unavailability of direct information about underlying physical connectivity of pathway members, and high fraction of protein-coding genes without any pathway annotations, i.e. ‘pathway orphans’. In order to address all these challenges, we developed pathDIP, which integrates data from 20 source pathway databases, ‘core pathways’, with physical protein–protein interactions to predict biologically relevant protein–pathway associations, referred to as ‘extended pathways’. Cross-validation determined 71% recovery rate of our predictions. Data integration and predictions increase coverage of pathway annotations for protein-coding genes to 86%, and provide novel annotations for 5732 pathway orphans. PathDIP (http://ophid.utoronto.ca/pathdip) annotates 17 070 protein-coding genes with 4678 pathways, and provides multiple query, analysis and output options. Oxford University Press 2017-01-04 2016-11-29 /pmc/articles/PMC5210562/ /pubmed/27899558 http://dx.doi.org/10.1093/nar/gkw1082 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Database Issue Rahmati, Sara Abovsky, Mark Pastrello, Chiara Jurisica, Igor pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis |
title | pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis |
title_full | pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis |
title_fullStr | pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis |
title_full_unstemmed | pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis |
title_short | pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis |
title_sort | pathdip: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210562/ https://www.ncbi.nlm.nih.gov/pubmed/27899558 http://dx.doi.org/10.1093/nar/gkw1082 |
work_keys_str_mv | AT rahmatisara pathdipanannotatedresourceforknownandpredictedhumangenepathwayassociationsandpathwayenrichmentanalysis AT abovskymark pathdipanannotatedresourceforknownandpredictedhumangenepathwayassociationsandpathwayenrichmentanalysis AT pastrellochiara pathdipanannotatedresourceforknownandpredictedhumangenepathwayassociationsandpathwayenrichmentanalysis AT jurisicaigor pathdipanannotatedresourceforknownandpredictedhumangenepathwayassociationsandpathwayenrichmentanalysis |