Cargando…

Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry

Imaging mass spectrometry is a label-free imaging modality that allows for the spatial mapping of many compounds directly in tissues. In an imaging mass spectrometry experiment, a raster of the tissue surface produces a mass spectrum at each sampled [Formula: see text] , [Formula: see text] position...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ellin, Nicholas R., Miranda-Quintana, Ramón Alain, Prentice, Boone M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Cold Spring Harbor Laboratory 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402165/ https://www.ncbi.nlm.nih.gov/pubmed/37546817 http://dx.doi.org/10.1101/2023.07.27.550838

_version_	1785084812550209536
author	Ellin, Nicholas R. Miranda-Quintana, Ramón Alain Prentice, Boone M.
author_facet	Ellin, Nicholas R. Miranda-Quintana, Ramón Alain Prentice, Boone M.
author_sort	Ellin, Nicholas R.
collection	PubMed
description	Imaging mass spectrometry is a label-free imaging modality that allows for the spatial mapping of many compounds directly in tissues. In an imaging mass spectrometry experiment, a raster of the tissue surface produces a mass spectrum at each sampled [Formula: see text] , [Formula: see text] position, resulting in thousands of individual mass spectra, each comprising a pixel in the resulting ion images. However, efficient analysis of imaging mass spectrometry datasets can be challenging due to the hyperspectral characteristics of the data. Each spectrum contains several thousand unique compounds at discrete m/z values that result in unique ion images, which demands robust and efficient algorithms for searching, statistical analysis, and visualization. Some traditional post-processing techniques are fundamentally ill-equipped to dissect these types of data. For example, while principal component analysis (PCA) has long served as a useful tool for mining imaging mass spectrometry datasets to identify correlated analytes and biological regions of interest, the interpretation of the PCA scores and loadings can be non-trivial. The loadings often containing negative peaks in the PCA-derived pseudo-spectra, which are difficult to ascribe to underlying tissue biology. Herein, we have utilized extended similarity indices to streamline the interpretation of imaging mass spectrometry data. This novel workflow uses PCA as a pixel-selection method to parse out the most and least correlated pixels, which are then compared using the extended similarity indices. The extended similarity indices complement PCA by removing all non-physical artifacts and streamlining the interpretation of large volumes of IMS spectra simultaneously. The linear complexity, [Formula: see text] , of these indices suggests that large imaging mass spectrometry datasets can be analyzed in a 1:1 scale of time and space with respect to the size of the input data. The extended similarity indices algorithmic workflow is exemplified here by identifying discrete biological regions of mouse brain tissue.
format	Online Article Text
id	pubmed-10402165
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Cold Spring Harbor Laboratory
record_format	MEDLINE/PubMed
spelling	pubmed-104021652023-08-05 Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry Ellin, Nicholas R. Miranda-Quintana, Ramón Alain Prentice, Boone M. bioRxiv Article Imaging mass spectrometry is a label-free imaging modality that allows for the spatial mapping of many compounds directly in tissues. In an imaging mass spectrometry experiment, a raster of the tissue surface produces a mass spectrum at each sampled [Formula: see text] , [Formula: see text] position, resulting in thousands of individual mass spectra, each comprising a pixel in the resulting ion images. However, efficient analysis of imaging mass spectrometry datasets can be challenging due to the hyperspectral characteristics of the data. Each spectrum contains several thousand unique compounds at discrete m/z values that result in unique ion images, which demands robust and efficient algorithms for searching, statistical analysis, and visualization. Some traditional post-processing techniques are fundamentally ill-equipped to dissect these types of data. For example, while principal component analysis (PCA) has long served as a useful tool for mining imaging mass spectrometry datasets to identify correlated analytes and biological regions of interest, the interpretation of the PCA scores and loadings can be non-trivial. The loadings often containing negative peaks in the PCA-derived pseudo-spectra, which are difficult to ascribe to underlying tissue biology. Herein, we have utilized extended similarity indices to streamline the interpretation of imaging mass spectrometry data. This novel workflow uses PCA as a pixel-selection method to parse out the most and least correlated pixels, which are then compared using the extended similarity indices. The extended similarity indices complement PCA by removing all non-physical artifacts and streamlining the interpretation of large volumes of IMS spectra simultaneously. The linear complexity, [Formula: see text] , of these indices suggests that large imaging mass spectrometry datasets can be analyzed in a 1:1 scale of time and space with respect to the size of the input data. The extended similarity indices algorithmic workflow is exemplified here by identifying discrete biological regions of mouse brain tissue. Cold Spring Harbor Laboratory 2023-07-30 /pmc/articles/PMC10402165/ /pubmed/37546817 http://dx.doi.org/10.1101/2023.07.27.550838 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle	Article Ellin, Nicholas R. Miranda-Quintana, Ramón Alain Prentice, Boone M. Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title	Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_full	Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_fullStr	Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_full_unstemmed	Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_short	Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry
title_sort	extended similarity methods for efficient data mining in imaging mass spectrometry
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402165/ https://www.ncbi.nlm.nih.gov/pubmed/37546817 http://dx.doi.org/10.1101/2023.07.27.550838
work_keys_str_mv	AT ellinnicholasr extendedsimilaritymethodsforefficientdatamininginimagingmassspectrometry AT mirandaquintanaramonalain extendedsimilaritymethodsforefficientdatamininginimagingmassspectrometry AT prenticeboonem extendedsimilaritymethodsforefficientdatamininginimagingmassspectrometry

Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry

Ejemplares similares