Cargando…

Fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data

Mass spectrometry, a popular technique for elucidating the molecular contents of experimental samples, creates data sets comprised of millions of three-dimensional (m/z, retention time, intensity) data points that correspond to the types and quantities of analyzed molecules. Open and commercial MS d...

Descripción completa

Detalles Bibliográficos
Autores principales: Handy, Kyle, Rosen, Jebediah, Gillan, André, Smith, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5687738/
https://www.ncbi.nlm.nih.gov/pubmed/29141005
http://dx.doi.org/10.1371/journal.pone.0188059
_version_ 1783279022061387776
author Handy, Kyle
Rosen, Jebediah
Gillan, André
Smith, Rob
author_facet Handy, Kyle
Rosen, Jebediah
Gillan, André
Smith, Rob
author_sort Handy, Kyle
collection PubMed
description Mass spectrometry, a popular technique for elucidating the molecular contents of experimental samples, creates data sets comprised of millions of three-dimensional (m/z, retention time, intensity) data points that correspond to the types and quantities of analyzed molecules. Open and commercial MS data formats are arranged by retention time, creating latency when accessing data across multiple m/z. Existing MS storage and retrieval methods have been developed to overcome the limitations of retention time-based data formats, but do not provide certain features such as dynamic summarization and storage and retrieval of point meta-data (such as signal cluster membership), precluding efficient viewing applications and certain data-processing approaches. This manuscript describes MzTree, a spatial database designed to provide real-time storage and retrieval of dynamically summarized standard and augmented MS data with fast performance in both m/z and RT directions. Performance is reported on real data with comparisons against related published retrieval systems.
format Online
Article
Text
id pubmed-5687738
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56877382017-11-30 Fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data Handy, Kyle Rosen, Jebediah Gillan, André Smith, Rob PLoS One Research Article Mass spectrometry, a popular technique for elucidating the molecular contents of experimental samples, creates data sets comprised of millions of three-dimensional (m/z, retention time, intensity) data points that correspond to the types and quantities of analyzed molecules. Open and commercial MS data formats are arranged by retention time, creating latency when accessing data across multiple m/z. Existing MS storage and retrieval methods have been developed to overcome the limitations of retention time-based data formats, but do not provide certain features such as dynamic summarization and storage and retrieval of point meta-data (such as signal cluster membership), precluding efficient viewing applications and certain data-processing approaches. This manuscript describes MzTree, a spatial database designed to provide real-time storage and retrieval of dynamically summarized standard and augmented MS data with fast performance in both m/z and RT directions. Performance is reported on real data with comparisons against related published retrieval systems. Public Library of Science 2017-11-15 /pmc/articles/PMC5687738/ /pubmed/29141005 http://dx.doi.org/10.1371/journal.pone.0188059 Text en © 2017 Handy et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Handy, Kyle
Rosen, Jebediah
Gillan, André
Smith, Rob
Fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data
title Fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data
title_full Fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data
title_fullStr Fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data
title_full_unstemmed Fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data
title_short Fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data
title_sort fast, axis-agnostic, dynamically summarized storage and retrieval for mass spectrometry data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5687738/
https://www.ncbi.nlm.nih.gov/pubmed/29141005
http://dx.doi.org/10.1371/journal.pone.0188059
work_keys_str_mv AT handykyle fastaxisagnosticdynamicallysummarizedstorageandretrievalformassspectrometrydata
AT rosenjebediah fastaxisagnosticdynamicallysummarizedstorageandretrievalformassspectrometrydata
AT gillanandre fastaxisagnosticdynamicallysummarizedstorageandretrievalformassspectrometrydata
AT smithrob fastaxisagnosticdynamicallysummarizedstorageandretrievalformassspectrometrydata