Cargando…
Integration of probabilistic functional networks without an external Gold Standard
BACKGROUND: Probabilistic functional integrated networks (PFINs) are designed to aid our understanding of cellular biology and can be used to generate testable hypotheses about protein function. PFINs are generally created by scoring the quality of interaction datasets against a Gold Standard datase...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9316706/ https://www.ncbi.nlm.nih.gov/pubmed/35879662 http://dx.doi.org/10.1186/s12859-022-04834-4 |
_version_ | 1784754881028947968 |
---|---|
author | James, Katherine Alsobhe, Aoesha Cockell, Simon J Wipat, Anil Pocock, Matthew |
author_facet | James, Katherine Alsobhe, Aoesha Cockell, Simon J Wipat, Anil Pocock, Matthew |
author_sort | James, Katherine |
collection | PubMed |
description | BACKGROUND: Probabilistic functional integrated networks (PFINs) are designed to aid our understanding of cellular biology and can be used to generate testable hypotheses about protein function. PFINs are generally created by scoring the quality of interaction datasets against a Gold Standard dataset, usually chosen from a separate high-quality data source, prior to their integration. Use of an external Gold Standard has several drawbacks, including data redundancy, data loss and the need for identifier mapping, which can complicate the network build and impact on PFIN performance. Additionally, there typically are no Gold Standard data for non-model organisms. RESULTS: We describe the development of an integration technique, ssNet, that scores and integrates both high-throughput and low-throughout data from a single source database in a consistent manner without the need for an external Gold Standard dataset. Using data from Saccharomyces cerevisiae we show that ssNet is easier and faster, overcoming the challenges of data redundancy, Gold Standard bias and ID mapping. In addition ssNet results in less loss of data and produces a more complete network. CONCLUSIONS: The ssNet method allows PFINs to be built successfully from a single database, while producing comparable network performance to networks scored using an external Gold Standard source and with reduced data loss. |
format | Online Article Text |
id | pubmed-9316706 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-93167062022-07-27 Integration of probabilistic functional networks without an external Gold Standard James, Katherine Alsobhe, Aoesha Cockell, Simon J Wipat, Anil Pocock, Matthew BMC Bioinformatics Research BACKGROUND: Probabilistic functional integrated networks (PFINs) are designed to aid our understanding of cellular biology and can be used to generate testable hypotheses about protein function. PFINs are generally created by scoring the quality of interaction datasets against a Gold Standard dataset, usually chosen from a separate high-quality data source, prior to their integration. Use of an external Gold Standard has several drawbacks, including data redundancy, data loss and the need for identifier mapping, which can complicate the network build and impact on PFIN performance. Additionally, there typically are no Gold Standard data for non-model organisms. RESULTS: We describe the development of an integration technique, ssNet, that scores and integrates both high-throughput and low-throughout data from a single source database in a consistent manner without the need for an external Gold Standard dataset. Using data from Saccharomyces cerevisiae we show that ssNet is easier and faster, overcoming the challenges of data redundancy, Gold Standard bias and ID mapping. In addition ssNet results in less loss of data and produces a more complete network. CONCLUSIONS: The ssNet method allows PFINs to be built successfully from a single database, while producing comparable network performance to networks scored using an external Gold Standard source and with reduced data loss. BioMed Central 2022-07-25 /pmc/articles/PMC9316706/ /pubmed/35879662 http://dx.doi.org/10.1186/s12859-022-04834-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research James, Katherine Alsobhe, Aoesha Cockell, Simon J Wipat, Anil Pocock, Matthew Integration of probabilistic functional networks without an external Gold Standard |
title | Integration of probabilistic functional networks without an external Gold Standard |
title_full | Integration of probabilistic functional networks without an external Gold Standard |
title_fullStr | Integration of probabilistic functional networks without an external Gold Standard |
title_full_unstemmed | Integration of probabilistic functional networks without an external Gold Standard |
title_short | Integration of probabilistic functional networks without an external Gold Standard |
title_sort | integration of probabilistic functional networks without an external gold standard |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9316706/ https://www.ncbi.nlm.nih.gov/pubmed/35879662 http://dx.doi.org/10.1186/s12859-022-04834-4 |
work_keys_str_mv | AT jameskatherine integrationofprobabilisticfunctionalnetworkswithoutanexternalgoldstandard AT alsobheaoesha integrationofprobabilisticfunctionalnetworkswithoutanexternalgoldstandard AT cockellsimonj integrationofprobabilisticfunctionalnetworkswithoutanexternalgoldstandard AT wipatanil integrationofprobabilisticfunctionalnetworkswithoutanexternalgoldstandard AT pocockmatthew integrationofprobabilisticfunctionalnetworkswithoutanexternalgoldstandard |