Cargando…
A nonparametric significance test for sampled networks
MOTIVATION: Our work is motivated by an interest in constructing a protein–protein interaction network that captures key features associated with Parkinson’s disease. While there is an abundance of subnetwork construction methods available, it is often far from obvious which subnetwork is the most s...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870844/ https://www.ncbi.nlm.nih.gov/pubmed/29036452 http://dx.doi.org/10.1093/bioinformatics/btx419 |
_version_ | 1783309555972702208 |
---|---|
author | Elliott, Andrew Leicht, Elizabeth Whitmore, Alan Reinert, Gesine Reed-Tsochas, Felix |
author_facet | Elliott, Andrew Leicht, Elizabeth Whitmore, Alan Reinert, Gesine Reed-Tsochas, Felix |
author_sort | Elliott, Andrew |
collection | PubMed |
description | MOTIVATION: Our work is motivated by an interest in constructing a protein–protein interaction network that captures key features associated with Parkinson’s disease. While there is an abundance of subnetwork construction methods available, it is often far from obvious which subnetwork is the most suitable starting point for further investigation. RESULTS: We provide a method to assess whether a subnetwork constructed from a seed list (a list of nodes known to be important in the area of interest) differs significantly from a randomly generated subnetwork. The proposed method uses a Monte Carlo approach. As different seed lists can give rise to the same subnetwork, we control for redundancy by constructing a minimal seed list as the starting point for the significance test. The null model is based on random seed lists of the same length as a minimum seed list that generates the subnetwork; in this random seed list the nodes have (approximately) the same degree distribution as the nodes in the minimum seed list. We use this null model to select subnetworks which deviate significantly from random on an appropriate set of statistics and might capture useful information for a real world protein–protein interaction network. AVAILABILITY AND IMPLEMENTATION: The software used in this paper are available for download at https://sites.google.com/site/elliottande/. The software is written in Python and uses the NetworkX library. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5870844 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58708442018-03-29 A nonparametric significance test for sampled networks Elliott, Andrew Leicht, Elizabeth Whitmore, Alan Reinert, Gesine Reed-Tsochas, Felix Bioinformatics Original Papers MOTIVATION: Our work is motivated by an interest in constructing a protein–protein interaction network that captures key features associated with Parkinson’s disease. While there is an abundance of subnetwork construction methods available, it is often far from obvious which subnetwork is the most suitable starting point for further investigation. RESULTS: We provide a method to assess whether a subnetwork constructed from a seed list (a list of nodes known to be important in the area of interest) differs significantly from a randomly generated subnetwork. The proposed method uses a Monte Carlo approach. As different seed lists can give rise to the same subnetwork, we control for redundancy by constructing a minimal seed list as the starting point for the significance test. The null model is based on random seed lists of the same length as a minimum seed list that generates the subnetwork; in this random seed list the nodes have (approximately) the same degree distribution as the nodes in the minimum seed list. We use this null model to select subnetworks which deviate significantly from random on an appropriate set of statistics and might capture useful information for a real world protein–protein interaction network. AVAILABILITY AND IMPLEMENTATION: The software used in this paper are available for download at https://sites.google.com/site/elliottande/. The software is written in Python and uses the NetworkX library. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-01-01 2017-07-07 /pmc/articles/PMC5870844/ /pubmed/29036452 http://dx.doi.org/10.1093/bioinformatics/btx419 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Elliott, Andrew Leicht, Elizabeth Whitmore, Alan Reinert, Gesine Reed-Tsochas, Felix A nonparametric significance test for sampled networks |
title | A nonparametric significance test for sampled networks |
title_full | A nonparametric significance test for sampled networks |
title_fullStr | A nonparametric significance test for sampled networks |
title_full_unstemmed | A nonparametric significance test for sampled networks |
title_short | A nonparametric significance test for sampled networks |
title_sort | nonparametric significance test for sampled networks |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870844/ https://www.ncbi.nlm.nih.gov/pubmed/29036452 http://dx.doi.org/10.1093/bioinformatics/btx419 |
work_keys_str_mv | AT elliottandrew anonparametricsignificancetestforsamplednetworks AT leichtelizabeth anonparametricsignificancetestforsamplednetworks AT whitmorealan anonparametricsignificancetestforsamplednetworks AT reinertgesine anonparametricsignificancetestforsamplednetworks AT reedtsochasfelix anonparametricsignificancetestforsamplednetworks AT elliottandrew nonparametricsignificancetestforsamplednetworks AT leichtelizabeth nonparametricsignificancetestforsamplednetworks AT whitmorealan nonparametricsignificancetestforsamplednetworks AT reinertgesine nonparametricsignificancetestforsamplednetworks AT reedtsochasfelix nonparametricsignificancetestforsamplednetworks |