Cargando…

USNAP: fast unique dense region detection and its application to lung cancer

MOTIVATION: Many real-world problems can be modeled as annotated graphs. Scalable graph algorithms that extract actionable information from such data are in demand since these graphs are large, varying in topology, and have diverse node/edge annotations. When these graphs change over time they creat...

Descripción completa

Detalles Bibliográficos
Autores principales: Wong, Serene W H, Pastrello, Chiara, Kotlyar, Max, Faloutsos, Christos, Jurisica, Igor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10425186/
https://www.ncbi.nlm.nih.gov/pubmed/37527019
http://dx.doi.org/10.1093/bioinformatics/btad477
_version_ 1785089783592124416
author Wong, Serene W H
Pastrello, Chiara
Kotlyar, Max
Faloutsos, Christos
Jurisica, Igor
author_facet Wong, Serene W H
Pastrello, Chiara
Kotlyar, Max
Faloutsos, Christos
Jurisica, Igor
author_sort Wong, Serene W H
collection PubMed
description MOTIVATION: Many real-world problems can be modeled as annotated graphs. Scalable graph algorithms that extract actionable information from such data are in demand since these graphs are large, varying in topology, and have diverse node/edge annotations. When these graphs change over time they create dynamic graphs, and open the possibility to find patterns across different time points. In this article, we introduce a scalable algorithm that finds unique dense regions across time points in dynamic graphs. Such algorithms have applications in many different areas, including the biological, financial, and social domains. RESULTS: There are three important contributions to this manuscript. First, we designed a scalable algorithm, USNAP, to effectively identify dense subgraphs that are unique to a time stamp given a dynamic graph. Importantly, USNAP provides a lower bound of the density measure in each step of the greedy algorithm. Second, insights and understanding obtained from validating USNAP on real data show its effectiveness. While USNAP is domain independent, we applied it to four non-small cell lung cancer gene expression datasets. Stages in non-small cell lung cancer were modeled as dynamic graphs, and input to USNAP. Pathway enrichment analyses and comprehensive interpretations from literature show that USNAP identified biologically relevant mechanisms for different stages of cancer progression. Third, USNAP is scalable, and has a time complexity of [Formula: see text] , where m is the number of edges, and n is the number of vertices in the dynamic graph; [Formula: see text] is the number of edges, and [Formula: see text] is the number of vertices in the collapsed graph. AVAILABILITY AND IMPLEMENTATION: The code of USNAP is available at https://www.cs.utoronto.ca/~juris/data/USNAP22.
format Online
Article
Text
id pubmed-10425186
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104251862023-08-15 USNAP: fast unique dense region detection and its application to lung cancer Wong, Serene W H Pastrello, Chiara Kotlyar, Max Faloutsos, Christos Jurisica, Igor Bioinformatics Original Paper MOTIVATION: Many real-world problems can be modeled as annotated graphs. Scalable graph algorithms that extract actionable information from such data are in demand since these graphs are large, varying in topology, and have diverse node/edge annotations. When these graphs change over time they create dynamic graphs, and open the possibility to find patterns across different time points. In this article, we introduce a scalable algorithm that finds unique dense regions across time points in dynamic graphs. Such algorithms have applications in many different areas, including the biological, financial, and social domains. RESULTS: There are three important contributions to this manuscript. First, we designed a scalable algorithm, USNAP, to effectively identify dense subgraphs that are unique to a time stamp given a dynamic graph. Importantly, USNAP provides a lower bound of the density measure in each step of the greedy algorithm. Second, insights and understanding obtained from validating USNAP on real data show its effectiveness. While USNAP is domain independent, we applied it to four non-small cell lung cancer gene expression datasets. Stages in non-small cell lung cancer were modeled as dynamic graphs, and input to USNAP. Pathway enrichment analyses and comprehensive interpretations from literature show that USNAP identified biologically relevant mechanisms for different stages of cancer progression. Third, USNAP is scalable, and has a time complexity of [Formula: see text] , where m is the number of edges, and n is the number of vertices in the dynamic graph; [Formula: see text] is the number of edges, and [Formula: see text] is the number of vertices in the collapsed graph. AVAILABILITY AND IMPLEMENTATION: The code of USNAP is available at https://www.cs.utoronto.ca/~juris/data/USNAP22. Oxford University Press 2023-08-01 /pmc/articles/PMC10425186/ /pubmed/37527019 http://dx.doi.org/10.1093/bioinformatics/btad477 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Wong, Serene W H
Pastrello, Chiara
Kotlyar, Max
Faloutsos, Christos
Jurisica, Igor
USNAP: fast unique dense region detection and its application to lung cancer
title USNAP: fast unique dense region detection and its application to lung cancer
title_full USNAP: fast unique dense region detection and its application to lung cancer
title_fullStr USNAP: fast unique dense region detection and its application to lung cancer
title_full_unstemmed USNAP: fast unique dense region detection and its application to lung cancer
title_short USNAP: fast unique dense region detection and its application to lung cancer
title_sort usnap: fast unique dense region detection and its application to lung cancer
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10425186/
https://www.ncbi.nlm.nih.gov/pubmed/37527019
http://dx.doi.org/10.1093/bioinformatics/btad477
work_keys_str_mv AT wongserenewh usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer
AT pastrellochiara usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer
AT kotlyarmax usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer
AT faloutsoschristos usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer
AT jurisicaigor usnapfastuniquedenseregiondetectionanditsapplicationtolungcancer