Cargando…

Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets

Motivation: The combination of liquid chromatography and mass spectrometry (LC/MS) has been widely used for large-scale comparative studies in systems biology, including proteomics, glycomics and metabolomics. In almost all experimental design, it is necessary to compare chromatograms across biologi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wandy, Joe, Daly, Rónán, Breitling, Rainer, Rogers, Simon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4760236/
https://www.ncbi.nlm.nih.gov/pubmed/25649621
http://dx.doi.org/10.1093/bioinformatics/btv072
_version_ 1782416854253305856
author Wandy, Joe
Daly, Rónán
Breitling, Rainer
Rogers, Simon
author_facet Wandy, Joe
Daly, Rónán
Breitling, Rainer
Rogers, Simon
author_sort Wandy, Joe
collection PubMed
description Motivation: The combination of liquid chromatography and mass spectrometry (LC/MS) has been widely used for large-scale comparative studies in systems biology, including proteomics, glycomics and metabolomics. In almost all experimental design, it is necessary to compare chromatograms across biological or technical replicates and across sample groups. Central to this is the peak alignment step, which is one of the most important but challenging preprocessing steps. Existing alignment tools do not take into account the structural dependencies between related peaks that coelute and are derived from the same metabolite or peptide. We propose a direct matching peak alignment method for LC/MS data that incorporates related peaks information (within each LC/MS run) and investigate its effect on alignment performance (across runs). The groupings of related peaks necessary for our method can be obtained from any peak clustering method and are built into a pair-wise peak similarity score function. The similarity score matrix produced is used by an approximation algorithm for the weighted matching problem to produce the actual alignment result. Results: We demonstrate that related peak information can improve alignment performance. The performance is evaluated on a set of benchmark datasets, where our method performs competitively compared to other popular alignment tools. Availability: The proposed alignment method has been implemented as a stand-alone application in Python, available for download at http://github.com/joewandy/peak-grouping-alignment. Contact: Simon.Rogers@glasgow.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4760236
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-47602362016-02-19 Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets Wandy, Joe Daly, Rónán Breitling, Rainer Rogers, Simon Bioinformatics Original Papers Motivation: The combination of liquid chromatography and mass spectrometry (LC/MS) has been widely used for large-scale comparative studies in systems biology, including proteomics, glycomics and metabolomics. In almost all experimental design, it is necessary to compare chromatograms across biological or technical replicates and across sample groups. Central to this is the peak alignment step, which is one of the most important but challenging preprocessing steps. Existing alignment tools do not take into account the structural dependencies between related peaks that coelute and are derived from the same metabolite or peptide. We propose a direct matching peak alignment method for LC/MS data that incorporates related peaks information (within each LC/MS run) and investigate its effect on alignment performance (across runs). The groupings of related peaks necessary for our method can be obtained from any peak clustering method and are built into a pair-wise peak similarity score function. The similarity score matrix produced is used by an approximation algorithm for the weighted matching problem to produce the actual alignment result. Results: We demonstrate that related peak information can improve alignment performance. The performance is evaluated on a set of benchmark datasets, where our method performs competitively compared to other popular alignment tools. Availability: The proposed alignment method has been implemented as a stand-alone application in Python, available for download at http://github.com/joewandy/peak-grouping-alignment. Contact: Simon.Rogers@glasgow.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-06-15 2015-02-02 /pmc/articles/PMC4760236/ /pubmed/25649621 http://dx.doi.org/10.1093/bioinformatics/btv072 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Wandy, Joe
Daly, Rónán
Breitling, Rainer
Rogers, Simon
Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets
title Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets
title_full Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets
title_fullStr Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets
title_full_unstemmed Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets
title_short Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets
title_sort incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4760236/
https://www.ncbi.nlm.nih.gov/pubmed/25649621
http://dx.doi.org/10.1093/bioinformatics/btv072
work_keys_str_mv AT wandyjoe incorporatingpeakgroupinginformationforalignmentofmultipleliquidchromatographymassspectrometrydatasets
AT dalyronan incorporatingpeakgroupinginformationforalignmentofmultipleliquidchromatographymassspectrometrydatasets
AT breitlingrainer incorporatingpeakgroupinginformationforalignmentofmultipleliquidchromatographymassspectrometrydatasets
AT rogerssimon incorporatingpeakgroupinginformationforalignmentofmultipleliquidchromatographymassspectrometrydatasets