Cargando…

LC-MSsim – a simulation software for liquid chromatography mass spectrometry data

BACKGROUND: Mass Spectrometry coupled to Liquid Chromatography (LC-MS) is commonly used to analyze the protein content of biological samples in large scale studies. The data resulting from an LC-MS experiment is huge, highly complex and noisy. Accordingly, it has sparked new developments in Bioinfor...

Descripción completa

Detalles Bibliográficos
Autores principales: Schulz-Trieglaff, Ole, Pfeifer, Nico, Gröpl, Clemens, Kohlbacher, Oliver, Reinert, Knut
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2577660/
https://www.ncbi.nlm.nih.gov/pubmed/18842122
http://dx.doi.org/10.1186/1471-2105-9-423
_version_ 1782160499121586176
author Schulz-Trieglaff, Ole
Pfeifer, Nico
Gröpl, Clemens
Kohlbacher, Oliver
Reinert, Knut
author_facet Schulz-Trieglaff, Ole
Pfeifer, Nico
Gröpl, Clemens
Kohlbacher, Oliver
Reinert, Knut
author_sort Schulz-Trieglaff, Ole
collection PubMed
description BACKGROUND: Mass Spectrometry coupled to Liquid Chromatography (LC-MS) is commonly used to analyze the protein content of biological samples in large scale studies. The data resulting from an LC-MS experiment is huge, highly complex and noisy. Accordingly, it has sparked new developments in Bioinformatics, especially in the fields of algorithm development, statistics and software engineering. In a quantitative label-free mass spectrometry experiment, crucial steps are the detection of peptide features in the mass spectra and the alignment of samples by correcting for shifts in retention time. At the moment, it is difficult to compare the plethora of algorithms for these tasks. So far, curated benchmark data exists only for peptide identification algorithms but no data that represents a ground truth for the evaluation of feature detection, alignment and filtering algorithms. RESULTS: We present LC-MSsim, a simulation software for LC-ESI-MS experiments. It simulates ESI spectra on the MS level. It reads a list of proteins from a FASTA file and digests the protein mixture using a user-defined enzyme. The software creates an LC-MS data set using a predictor for the retention time of the peptides and a model for peak shapes and elution profiles of the mass spectral peaks. Our software also offers the possibility to add contaminants, to change the background noise level and includes a model for the detectability of peptides in mass spectra. After the simulation, LC-MSsim writes the simulated data to mzData, a public XML format. The software also stores the positions (monoisotopic m/z and retention time) and ion counts of the simulated ions in separate files. CONCLUSION: LC-MSsim generates simulated LC-MS data sets and incorporates models for peak shapes and contaminations. Algorithm developers can match the results of feature detection and alignment algorithms against the simulated ion lists and meaningful error rates can be computed. We anticipate that LC-MSsim will be useful to the wider community to perform benchmark studies and comparisons between computational tools.
format Text
id pubmed-2577660
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25776602008-11-04 LC-MSsim – a simulation software for liquid chromatography mass spectrometry data Schulz-Trieglaff, Ole Pfeifer, Nico Gröpl, Clemens Kohlbacher, Oliver Reinert, Knut BMC Bioinformatics Software BACKGROUND: Mass Spectrometry coupled to Liquid Chromatography (LC-MS) is commonly used to analyze the protein content of biological samples in large scale studies. The data resulting from an LC-MS experiment is huge, highly complex and noisy. Accordingly, it has sparked new developments in Bioinformatics, especially in the fields of algorithm development, statistics and software engineering. In a quantitative label-free mass spectrometry experiment, crucial steps are the detection of peptide features in the mass spectra and the alignment of samples by correcting for shifts in retention time. At the moment, it is difficult to compare the plethora of algorithms for these tasks. So far, curated benchmark data exists only for peptide identification algorithms but no data that represents a ground truth for the evaluation of feature detection, alignment and filtering algorithms. RESULTS: We present LC-MSsim, a simulation software for LC-ESI-MS experiments. It simulates ESI spectra on the MS level. It reads a list of proteins from a FASTA file and digests the protein mixture using a user-defined enzyme. The software creates an LC-MS data set using a predictor for the retention time of the peptides and a model for peak shapes and elution profiles of the mass spectral peaks. Our software also offers the possibility to add contaminants, to change the background noise level and includes a model for the detectability of peptides in mass spectra. After the simulation, LC-MSsim writes the simulated data to mzData, a public XML format. The software also stores the positions (monoisotopic m/z and retention time) and ion counts of the simulated ions in separate files. CONCLUSION: LC-MSsim generates simulated LC-MS data sets and incorporates models for peak shapes and contaminations. Algorithm developers can match the results of feature detection and alignment algorithms against the simulated ion lists and meaningful error rates can be computed. We anticipate that LC-MSsim will be useful to the wider community to perform benchmark studies and comparisons between computational tools. BioMed Central 2008-10-08 /pmc/articles/PMC2577660/ /pubmed/18842122 http://dx.doi.org/10.1186/1471-2105-9-423 Text en Copyright © 2008 Schulz-Trieglaff et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Schulz-Trieglaff, Ole
Pfeifer, Nico
Gröpl, Clemens
Kohlbacher, Oliver
Reinert, Knut
LC-MSsim – a simulation software for liquid chromatography mass spectrometry data
title LC-MSsim – a simulation software for liquid chromatography mass spectrometry data
title_full LC-MSsim – a simulation software for liquid chromatography mass spectrometry data
title_fullStr LC-MSsim – a simulation software for liquid chromatography mass spectrometry data
title_full_unstemmed LC-MSsim – a simulation software for liquid chromatography mass spectrometry data
title_short LC-MSsim – a simulation software for liquid chromatography mass spectrometry data
title_sort lc-mssim – a simulation software for liquid chromatography mass spectrometry data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2577660/
https://www.ncbi.nlm.nih.gov/pubmed/18842122
http://dx.doi.org/10.1186/1471-2105-9-423
work_keys_str_mv AT schulztrieglaffole lcmssimasimulationsoftwareforliquidchromatographymassspectrometrydata
AT pfeifernico lcmssimasimulationsoftwareforliquidchromatographymassspectrometrydata
AT groplclemens lcmssimasimulationsoftwareforliquidchromatographymassspectrometrydata
AT kohlbacheroliver lcmssimasimulationsoftwareforliquidchromatographymassspectrometrydata
AT reinertknut lcmssimasimulationsoftwareforliquidchromatographymassspectrometrydata