Cargando…

A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods

Non-targeted mass spectrometry (MS) has become an important method over recent years in the fields of metabolomics and environmental research. While more and more algorithms and workflows become available to process a large number of non-targeted data sets, there still exist few manually evaluated u...

Descripción completa

Detalles Bibliográficos
Autores principales:	Müller, Erik, Huber, Carolin, Beckers, Liza-Marie, Brack, Werner, Krauss, Martin, Schulze, Tobias
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Data Descriptor
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7240950/ https://www.ncbi.nlm.nih.gov/pubmed/32331455 http://dx.doi.org/10.3390/metabo10040162

_version_	1783536992341983232
author	Müller, Erik Huber, Carolin Beckers, Liza-Marie Brack, Werner Krauss, Martin Schulze, Tobias
author_facet	Müller, Erik Huber, Carolin Beckers, Liza-Marie Brack, Werner Krauss, Martin Schulze, Tobias
author_sort	Müller, Erik
collection	PubMed
description	Non-targeted mass spectrometry (MS) has become an important method over recent years in the fields of metabolomics and environmental research. While more and more algorithms and workflows become available to process a large number of non-targeted data sets, there still exist few manually evaluated universal test data sets for refining and evaluating these methods. The first step of non-targeted screening, peak detection and refinement of it is arguably the most important step for non-targeted screening. However, the absence of a model data set makes it harder for researchers to evaluate peak detection methods. In this Data Descriptor, we provide a manually checked data set consisting of 255,000 EICs (5000 peaks randomly sampled from across 51 samples) for the evaluation on peak detection and gap-filling algorithms. The data set was created from a previous real-world study, of which a subset was used to extract and manually classify ion chromatograms by three mass spectrometry experts. The data set consists of the converted mass spectrometry files, intermediate processing files and the central file containing a table with all important information for the classified peaks.
format	Online Article Text
id	pubmed-7240950
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-72409502020-06-11 A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods Müller, Erik Huber, Carolin Beckers, Liza-Marie Brack, Werner Krauss, Martin Schulze, Tobias Metabolites Data Descriptor Non-targeted mass spectrometry (MS) has become an important method over recent years in the fields of metabolomics and environmental research. While more and more algorithms and workflows become available to process a large number of non-targeted data sets, there still exist few manually evaluated universal test data sets for refining and evaluating these methods. The first step of non-targeted screening, peak detection and refinement of it is arguably the most important step for non-targeted screening. However, the absence of a model data set makes it harder for researchers to evaluate peak detection methods. In this Data Descriptor, we provide a manually checked data set consisting of 255,000 EICs (5000 peaks randomly sampled from across 51 samples) for the evaluation on peak detection and gap-filling algorithms. The data set was created from a previous real-world study, of which a subset was used to extract and manually classify ion chromatograms by three mass spectrometry experts. The data set consists of the converted mass spectrometry files, intermediate processing files and the central file containing a table with all important information for the classified peaks. MDPI 2020-04-22 /pmc/articles/PMC7240950/ /pubmed/32331455 http://dx.doi.org/10.3390/metabo10040162 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Data Descriptor Müller, Erik Huber, Carolin Beckers, Liza-Marie Brack, Werner Krauss, Martin Schulze, Tobias A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods
title	A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods
title_full	A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods
title_fullStr	A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods
title_full_unstemmed	A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods
title_short	A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods
title_sort	data set of 255,000 randomly selected and manually classified extracted ion chromatograms for evaluation of peak detection methods
topic	Data Descriptor
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7240950/ https://www.ncbi.nlm.nih.gov/pubmed/32331455 http://dx.doi.org/10.3390/metabo10040162
work_keys_str_mv	AT mullererik adatasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT hubercarolin adatasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT beckerslizamarie adatasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT brackwerner adatasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT kraussmartin adatasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT schulzetobias adatasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT mullererik datasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT hubercarolin datasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT beckerslizamarie datasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT brackwerner datasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT kraussmartin datasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods AT schulzetobias datasetof255000randomlyselectedandmanuallyclassifiedextractedionchromatogramsforevaluationofpeakdetectionmethods

A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods

Ejemplares similares