Cargando…

Supervised topological data analysis for MALDI mass spectrometry imaging applications

BACKGROUND: Matrix-assisted laser desorption/ionization mass spectrometry imaging (MALDI MSI) displays significant potential for applications in cancer research, especially in tumor typing and subtyping. Lung cancer is the primary cause of tumor-related deaths, where the most lethal entities are ade...

Descripción completa

Detalles Bibliográficos
Autores principales: Klaila, Gideon, Vutov, Vladimir, Stefanou, Anastasios
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10334630/
https://www.ncbi.nlm.nih.gov/pubmed/37430224
http://dx.doi.org/10.1186/s12859-023-05402-0
_version_ 1785070899346538496
author Klaila, Gideon
Vutov, Vladimir
Stefanou, Anastasios
author_facet Klaila, Gideon
Vutov, Vladimir
Stefanou, Anastasios
author_sort Klaila, Gideon
collection PubMed
description BACKGROUND: Matrix-assisted laser desorption/ionization mass spectrometry imaging (MALDI MSI) displays significant potential for applications in cancer research, especially in tumor typing and subtyping. Lung cancer is the primary cause of tumor-related deaths, where the most lethal entities are adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). Distinguishing between these two common subtypes is crucial for therapy decisions and successful patient management. RESULTS: We propose a new algebraic topological framework, which obtains intrinsic information from MALDI data and transforms it to reflect topological persistence. Our framework offers two main advantages. Firstly, topological persistence aids in distinguishing the signal from noise. Secondly, it compresses the MALDI data, saving storage space and optimizes computational time for subsequent classification tasks. We present an algorithm that efficiently implements our topological framework, relying on a single tuning parameter. Afterwards, logistic regression and random forest classifiers are employed on the extracted persistence features, thereby accomplishing an automated tumor (sub-)typing process. To demonstrate the competitiveness of our proposed framework, we conduct experiments on a real-world MALDI dataset using cross-validation. Furthermore, we showcase the effectiveness of the single denoising parameter by evaluating its performance on synthetic MALDI images with varying levels of noise. CONCLUSION: Our empirical experiments demonstrate that the proposed algebraic topological framework successfully captures and leverages the intrinsic spectral information from MALDI data, leading to competitive results in classifying lung cancer subtypes. Moreover, the framework’s ability to be fine-tuned for denoising highlights its versatility and potential for enhancing data analysis in MALDI applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05402-0.
format Online
Article
Text
id pubmed-10334630
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103346302023-07-12 Supervised topological data analysis for MALDI mass spectrometry imaging applications Klaila, Gideon Vutov, Vladimir Stefanou, Anastasios BMC Bioinformatics Research BACKGROUND: Matrix-assisted laser desorption/ionization mass spectrometry imaging (MALDI MSI) displays significant potential for applications in cancer research, especially in tumor typing and subtyping. Lung cancer is the primary cause of tumor-related deaths, where the most lethal entities are adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). Distinguishing between these two common subtypes is crucial for therapy decisions and successful patient management. RESULTS: We propose a new algebraic topological framework, which obtains intrinsic information from MALDI data and transforms it to reflect topological persistence. Our framework offers two main advantages. Firstly, topological persistence aids in distinguishing the signal from noise. Secondly, it compresses the MALDI data, saving storage space and optimizes computational time for subsequent classification tasks. We present an algorithm that efficiently implements our topological framework, relying on a single tuning parameter. Afterwards, logistic regression and random forest classifiers are employed on the extracted persistence features, thereby accomplishing an automated tumor (sub-)typing process. To demonstrate the competitiveness of our proposed framework, we conduct experiments on a real-world MALDI dataset using cross-validation. Furthermore, we showcase the effectiveness of the single denoising parameter by evaluating its performance on synthetic MALDI images with varying levels of noise. CONCLUSION: Our empirical experiments demonstrate that the proposed algebraic topological framework successfully captures and leverages the intrinsic spectral information from MALDI data, leading to competitive results in classifying lung cancer subtypes. Moreover, the framework’s ability to be fine-tuned for denoising highlights its versatility and potential for enhancing data analysis in MALDI applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05402-0. BioMed Central 2023-07-10 /pmc/articles/PMC10334630/ /pubmed/37430224 http://dx.doi.org/10.1186/s12859-023-05402-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Klaila, Gideon
Vutov, Vladimir
Stefanou, Anastasios
Supervised topological data analysis for MALDI mass spectrometry imaging applications
title Supervised topological data analysis for MALDI mass spectrometry imaging applications
title_full Supervised topological data analysis for MALDI mass spectrometry imaging applications
title_fullStr Supervised topological data analysis for MALDI mass spectrometry imaging applications
title_full_unstemmed Supervised topological data analysis for MALDI mass spectrometry imaging applications
title_short Supervised topological data analysis for MALDI mass spectrometry imaging applications
title_sort supervised topological data analysis for maldi mass spectrometry imaging applications
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10334630/
https://www.ncbi.nlm.nih.gov/pubmed/37430224
http://dx.doi.org/10.1186/s12859-023-05402-0
work_keys_str_mv AT klailagideon supervisedtopologicaldataanalysisformaldimassspectrometryimagingapplications
AT vutovvladimir supervisedtopologicaldataanalysisformaldimassspectrometryimagingapplications
AT stefanouanastasios supervisedtopologicaldataanalysisformaldimassspectrometryimagingapplications