Cargando…
Supervised topological data analysis for MALDI mass spectrometry imaging applications
BACKGROUND: Matrix-assisted laser desorption/ionization mass spectrometry imaging (MALDI MSI) displays significant potential for applications in cancer research, especially in tumor typing and subtyping. Lung cancer is the primary cause of tumor-related deaths, where the most lethal entities are ade...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10334630/ https://www.ncbi.nlm.nih.gov/pubmed/37430224 http://dx.doi.org/10.1186/s12859-023-05402-0 |
_version_ | 1785070899346538496 |
---|---|
author | Klaila, Gideon Vutov, Vladimir Stefanou, Anastasios |
author_facet | Klaila, Gideon Vutov, Vladimir Stefanou, Anastasios |
author_sort | Klaila, Gideon |
collection | PubMed |
description | BACKGROUND: Matrix-assisted laser desorption/ionization mass spectrometry imaging (MALDI MSI) displays significant potential for applications in cancer research, especially in tumor typing and subtyping. Lung cancer is the primary cause of tumor-related deaths, where the most lethal entities are adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). Distinguishing between these two common subtypes is crucial for therapy decisions and successful patient management. RESULTS: We propose a new algebraic topological framework, which obtains intrinsic information from MALDI data and transforms it to reflect topological persistence. Our framework offers two main advantages. Firstly, topological persistence aids in distinguishing the signal from noise. Secondly, it compresses the MALDI data, saving storage space and optimizes computational time for subsequent classification tasks. We present an algorithm that efficiently implements our topological framework, relying on a single tuning parameter. Afterwards, logistic regression and random forest classifiers are employed on the extracted persistence features, thereby accomplishing an automated tumor (sub-)typing process. To demonstrate the competitiveness of our proposed framework, we conduct experiments on a real-world MALDI dataset using cross-validation. Furthermore, we showcase the effectiveness of the single denoising parameter by evaluating its performance on synthetic MALDI images with varying levels of noise. CONCLUSION: Our empirical experiments demonstrate that the proposed algebraic topological framework successfully captures and leverages the intrinsic spectral information from MALDI data, leading to competitive results in classifying lung cancer subtypes. Moreover, the framework’s ability to be fine-tuned for denoising highlights its versatility and potential for enhancing data analysis in MALDI applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05402-0. |
format | Online Article Text |
id | pubmed-10334630 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-103346302023-07-12 Supervised topological data analysis for MALDI mass spectrometry imaging applications Klaila, Gideon Vutov, Vladimir Stefanou, Anastasios BMC Bioinformatics Research BACKGROUND: Matrix-assisted laser desorption/ionization mass spectrometry imaging (MALDI MSI) displays significant potential for applications in cancer research, especially in tumor typing and subtyping. Lung cancer is the primary cause of tumor-related deaths, where the most lethal entities are adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). Distinguishing between these two common subtypes is crucial for therapy decisions and successful patient management. RESULTS: We propose a new algebraic topological framework, which obtains intrinsic information from MALDI data and transforms it to reflect topological persistence. Our framework offers two main advantages. Firstly, topological persistence aids in distinguishing the signal from noise. Secondly, it compresses the MALDI data, saving storage space and optimizes computational time for subsequent classification tasks. We present an algorithm that efficiently implements our topological framework, relying on a single tuning parameter. Afterwards, logistic regression and random forest classifiers are employed on the extracted persistence features, thereby accomplishing an automated tumor (sub-)typing process. To demonstrate the competitiveness of our proposed framework, we conduct experiments on a real-world MALDI dataset using cross-validation. Furthermore, we showcase the effectiveness of the single denoising parameter by evaluating its performance on synthetic MALDI images with varying levels of noise. CONCLUSION: Our empirical experiments demonstrate that the proposed algebraic topological framework successfully captures and leverages the intrinsic spectral information from MALDI data, leading to competitive results in classifying lung cancer subtypes. Moreover, the framework’s ability to be fine-tuned for denoising highlights its versatility and potential for enhancing data analysis in MALDI applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05402-0. BioMed Central 2023-07-10 /pmc/articles/PMC10334630/ /pubmed/37430224 http://dx.doi.org/10.1186/s12859-023-05402-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Klaila, Gideon Vutov, Vladimir Stefanou, Anastasios Supervised topological data analysis for MALDI mass spectrometry imaging applications |
title | Supervised topological data analysis for MALDI mass spectrometry imaging applications |
title_full | Supervised topological data analysis for MALDI mass spectrometry imaging applications |
title_fullStr | Supervised topological data analysis for MALDI mass spectrometry imaging applications |
title_full_unstemmed | Supervised topological data analysis for MALDI mass spectrometry imaging applications |
title_short | Supervised topological data analysis for MALDI mass spectrometry imaging applications |
title_sort | supervised topological data analysis for maldi mass spectrometry imaging applications |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10334630/ https://www.ncbi.nlm.nih.gov/pubmed/37430224 http://dx.doi.org/10.1186/s12859-023-05402-0 |
work_keys_str_mv | AT klailagideon supervisedtopologicaldataanalysisformaldimassspectrometryimagingapplications AT vutovvladimir supervisedtopologicaldataanalysisformaldimassspectrometryimagingapplications AT stefanouanastasios supervisedtopologicaldataanalysisformaldimassspectrometryimagingapplications |