Cargando…
USNAP: fast unique dense region detection and its application to lung cancer
MOTIVATION: Many real-world problems can be modeled as annotated graphs. Scalable graph algorithms that extract actionable information from such data are in demand since these graphs are large, varying in topology, and have diverse node/edge annotations. When these graphs change over time they creat...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10425186/ https://www.ncbi.nlm.nih.gov/pubmed/37527019 http://dx.doi.org/10.1093/bioinformatics/btad477 |
_version_ | 1785089783592124416 |
---|---|
author | Wong, Serene W H Pastrello, Chiara Kotlyar, Max Faloutsos, Christos Jurisica, Igor |
author_facet | Wong, Serene W H Pastrello, Chiara Kotlyar, Max Faloutsos, Christos Jurisica, Igor |
author_sort | Wong, Serene W H |
collection | PubMed |
description | MOTIVATION: Many real-world problems can be modeled as annotated graphs. Scalable graph algorithms that extract actionable information from such data are in demand since these graphs are large, varying in topology, and have diverse node/edge annotations. When these graphs change over time they create dynamic graphs, and open the possibility to find patterns across different time points. In this article, we introduce a scalable algorithm that finds unique dense regions across time points in dynamic graphs. Such algorithms have applications in many different areas, including the biological, financial, and social domains. RESULTS: There are three important contributions to this manuscript. First, we designed a scalable algorithm, USNAP, to effectively identify dense subgraphs that are unique to a time stamp given a dynamic graph. Importantly, USNAP provides a lower bound of the density measure in each step of the greedy algorithm. Second, insights and understanding obtained from validating USNAP on real data show its effectiveness. While USNAP is domain independent, we applied it to four non-small cell lung cancer gene expression datasets. Stages in non-small cell lung cancer were modeled as dynamic graphs, and input to USNAP. Pathway enrichment analyses and comprehensive interpretations from literature show that USNAP identified biologically relevant mechanisms for different stages of cancer progression. Third, USNAP is scalable, and has a time complexity of [Formula: see text] , where m is the number of edges, and n is the number of vertices in the dynamic graph; [Formula: see text] is the number of edges, and [Formula: see text] is the number of vertices in the collapsed graph. AVAILABILITY AND IMPLEMENTATION: The code of USNAP is available at https://www.cs.utoronto.ca/~juris/data/USNAP22. |
format | Online Article Text |
id | pubmed-10425186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-104251862023-08-15 USNAP: fast unique dense region detection and its application to lung cancer Wong, Serene W H Pastrello, Chiara Kotlyar, Max Faloutsos, Christos Jurisica, Igor Bioinformatics Original Paper MOTIVATION: Many real-world problems can be modeled as annotated graphs. Scalable graph algorithms that extract actionable information from such data are in demand since these graphs are large, varying in topology, and have diverse node/edge annotations. When these graphs change over time they create dynamic graphs, and open the possibility to find patterns across different time points. In this article, we introduce a scalable algorithm that finds unique dense regions across time points in dynamic graphs. Such algorithms have applications in many different areas, including the biological, financial, and social domains. RESULTS: There are three important contributions to this manuscript. First, we designed a scalable algorithm, USNAP, to effectively identify dense subgraphs that are unique to a time stamp given a dynamic graph. Importantly, USNAP provides a lower bound of the density measure in each step of the greedy algorithm. Second, insights and understanding obtained from validating USNAP on real data show its effectiveness. While USNAP is domain independent, we applied it to four non-small cell lung cancer gene expression datasets. Stages in non-small cell lung cancer were modeled as dynamic graphs, and input to USNAP. Pathway enrichment analyses and comprehensive interpretations from literature show that USNAP identified biologically relevant mechanisms for different stages of cancer progression. Third, USNAP is scalable, and has a time complexity of [Formula: see text] , where m is the number of edges, and n is the number of vertices in the dynamic graph; [Formula: see text] is the number of edges, and [Formula: see text] is the number of vertices in the collapsed graph. AVAILABILITY AND IMPLEMENTATION: The code of USNAP is available at https://www.cs.utoronto.ca/~juris/data/USNAP22. Oxford University Press 2023-08-01 /pmc/articles/PMC10425186/ /pubmed/37527019 http://dx.doi.org/10.1093/bioinformatics/btad477 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Wong, Serene W H Pastrello, Chiara Kotlyar, Max Faloutsos, Christos Jurisica, Igor USNAP: fast unique dense region detection and its application to lung cancer |
title |
USNAP: fast unique dense region detection and its application to lung cancer |
title_full |
USNAP: fast unique dense region detection and its application to lung cancer |
title_fullStr |
USNAP: fast unique dense region detection and its application to lung cancer |
title_full_unstemmed |
USNAP: fast unique dense region detection and its application to lung cancer |
title_short |
USNAP: fast unique dense region detection and its application to lung cancer |
title_sort | usnap: fast unique dense region detection and its application to lung cancer |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10425186/ https://www.ncbi.nlm.nih.gov/pubmed/37527019 http://dx.doi.org/10.1093/bioinformatics/btad477 |
work_keys_str_mv | AT wongserenewh usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer AT pastrellochiara usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer AT kotlyarmax usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer AT faloutsoschristos usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer AT jurisicaigor usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer |