Cargando…

Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry

Imaging mass spectrometry is a label-free imaging modality that allows for the spatial mapping of many compounds directly in tissues. In an imaging mass spectrometry experiment, a raster of the tissue surface produces a mass spectrum at each sampled [Formula: see text] , [Formula: see text] position...

Descripción completa

Detalles Bibliográficos
Autores principales: Ellin, Nicholas R., Miranda-Quintana, Ramón Alain, Prentice, Boone M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402165/
https://www.ncbi.nlm.nih.gov/pubmed/37546817
http://dx.doi.org/10.1101/2023.07.27.550838
_version_ 1785084812550209536
author Ellin, Nicholas R.
Miranda-Quintana, Ramón Alain
Prentice, Boone M.
author_facet Ellin, Nicholas R.
Miranda-Quintana, Ramón Alain
Prentice, Boone M.
author_sort Ellin, Nicholas R.
collection PubMed
description Imaging mass spectrometry is a label-free imaging modality that allows for the spatial mapping of many compounds directly in tissues. In an imaging mass spectrometry experiment, a raster of the tissue surface produces a mass spectrum at each sampled [Formula: see text] , [Formula: see text] position, resulting in thousands of individual mass spectra, each comprising a pixel in the resulting ion images. However, efficient analysis of imaging mass spectrometry datasets can be challenging due to the hyperspectral characteristics of the data. Each spectrum contains several thousand unique compounds at discrete m/z values that result in unique ion images, which demands robust and efficient algorithms for searching, statistical analysis, and visualization. Some traditional post-processing techniques are fundamentally ill-equipped to dissect these types of data. For example, while principal component analysis (PCA) has long served as a useful tool for mining imaging mass spectrometry datasets to identify correlated analytes and biological regions of interest, the interpretation of the PCA scores and loadings can be non-trivial. The loadings often containing negative peaks in the PCA-derived pseudo-spectra, which are difficult to ascribe to underlying tissue biology. Herein, we have utilized extended similarity indices to streamline the interpretation of imaging mass spectrometry data. This novel workflow uses PCA as a pixel-selection method to parse out the most and least correlated pixels, which are then compared using the extended similarity indices. The extended similarity indices complement PCA by removing all non-physical artifacts and streamlining the interpretation of large volumes of IMS spectra simultaneously. The linear complexity, [Formula: see text] , of these indices suggests that large imaging mass spectrometry datasets can be analyzed in a 1:1 scale of time and space with respect to the size of the input data. The extended similarity indices algorithmic workflow is exemplified here by identifying discrete biological regions of mouse brain tissue.
format Online
Article
Text
id pubmed-10402165
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104021652023-08-05 Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry Ellin, Nicholas R. Miranda-Quintana, Ramón Alain Prentice, Boone M. bioRxiv Article Imaging mass spectrometry is a label-free imaging modality that allows for the spatial mapping of many compounds directly in tissues. In an imaging mass spectrometry experiment, a raster of the tissue surface produces a mass spectrum at each sampled [Formula: see text] , [Formula: see text] position, resulting in thousands of individual mass spectra, each comprising a pixel in the resulting ion images. However, efficient analysis of imaging mass spectrometry datasets can be challenging due to the hyperspectral characteristics of the data. Each spectrum contains several thousand unique compounds at discrete m/z values that result in unique ion images, which demands robust and efficient algorithms for searching, statistical analysis, and visualization. Some traditional post-processing techniques are fundamentally ill-equipped to dissect these types of data. For example, while principal component analysis (PCA) has long served as a useful tool for mining imaging mass spectrometry datasets to identify correlated analytes and biological regions of interest, the interpretation of the PCA scores and loadings can be non-trivial. The loadings often containing negative peaks in the PCA-derived pseudo-spectra, which are difficult to ascribe to underlying tissue biology. Herein, we have utilized extended similarity indices to streamline the interpretation of imaging mass spectrometry data. This novel workflow uses PCA as a pixel-selection method to parse out the most and least correlated pixels, which are then compared using the extended similarity indices. The extended similarity indices complement PCA by removing all non-physical artifacts and streamlining the interpretation of large volumes of IMS spectra simultaneously. The linear complexity, [Formula: see text] , of these indices suggests that large imaging mass spectrometry datasets can be analyzed in a 1:1 scale of time and space with respect to the size of the input data. The extended similarity indices algorithmic workflow is exemplified here by identifying discrete biological regions of mouse brain tissue. Cold Spring Harbor Laboratory 2023-07-30 /pmc/articles/PMC10402165/ /pubmed/37546817 http://dx.doi.org/10.1101/2023.07.27.550838 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Ellin, Nicholas R.
Miranda-Quintana, Ramón Alain
Prentice, Boone M.
Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_full Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_fullStr Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_full_unstemmed Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_short Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_sort extended similarity methods for efficient data mining in imaging mass spectrometry
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402165/
https://www.ncbi.nlm.nih.gov/pubmed/37546817
http://dx.doi.org/10.1101/2023.07.27.550838
work_keys_str_mv AT ellinnicholasr extendedsimilaritymethodsforefficientdatamininginimagingmassspectrometry
AT mirandaquintanaramonalain extendedsimilaritymethodsforefficientdatamininginimagingmassspectrometry
AT prenticeboonem extendedsimilaritymethodsforefficientdatamininginimagingmassspectrometry