Cargando…

Normalizing and Correcting Variable and Complex LC–MS Metabolomic Data with the R Package pseudoDrift

In biological research domains, liquid chromatography–mass spectroscopy (LC-MS) has prevailed as the preferred technique for generating high quality metabolomic data. However, even with advanced instrumentation and established data acquisition protocols, technical errors are still routinely encounte...

Descripción completa

Detalles Bibliográficos
Autores principales: Rodriguez, Jonas, Gomez-Cano, Lina, Grotewold, Erich, de Leon, Natalia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144304/
https://www.ncbi.nlm.nih.gov/pubmed/35629939
http://dx.doi.org/10.3390/metabo12050435
_version_ 1784716015967404032
author Rodriguez, Jonas
Gomez-Cano, Lina
Grotewold, Erich
de Leon, Natalia
author_facet Rodriguez, Jonas
Gomez-Cano, Lina
Grotewold, Erich
de Leon, Natalia
author_sort Rodriguez, Jonas
collection PubMed
description In biological research domains, liquid chromatography–mass spectroscopy (LC-MS) has prevailed as the preferred technique for generating high quality metabolomic data. However, even with advanced instrumentation and established data acquisition protocols, technical errors are still routinely encountered and can pose a significant challenge to unveiling biologically relevant information. In large-scale studies, signal drift and batch effects are how technical errors are most commonly manifested. We developed pseudoDrift, an R package with capabilities for data simulation and outlier detection, and a new training and testing approach that is implemented to capture and to optionally correct for technical errors in LC–MS metabolomic data. Using data simulation, we demonstrate here that our approach performs equally as well as existing methods and offers increased flexibility to the researcher. As part of our study, we generated a targeted LC–MS dataset that profiled 33 phenolic compounds from seedling stem tissue in 602 genetically diverse non-transgenic maize inbred lines. This dataset provides a unique opportunity to investigate the dynamics of specialized metabolism in plants.
format Online
Article
Text
id pubmed-9144304
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91443042022-05-29 Normalizing and Correcting Variable and Complex LC–MS Metabolomic Data with the R Package pseudoDrift Rodriguez, Jonas Gomez-Cano, Lina Grotewold, Erich de Leon, Natalia Metabolites Article In biological research domains, liquid chromatography–mass spectroscopy (LC-MS) has prevailed as the preferred technique for generating high quality metabolomic data. However, even with advanced instrumentation and established data acquisition protocols, technical errors are still routinely encountered and can pose a significant challenge to unveiling biologically relevant information. In large-scale studies, signal drift and batch effects are how technical errors are most commonly manifested. We developed pseudoDrift, an R package with capabilities for data simulation and outlier detection, and a new training and testing approach that is implemented to capture and to optionally correct for technical errors in LC–MS metabolomic data. Using data simulation, we demonstrate here that our approach performs equally as well as existing methods and offers increased flexibility to the researcher. As part of our study, we generated a targeted LC–MS dataset that profiled 33 phenolic compounds from seedling stem tissue in 602 genetically diverse non-transgenic maize inbred lines. This dataset provides a unique opportunity to investigate the dynamics of specialized metabolism in plants. MDPI 2022-05-12 /pmc/articles/PMC9144304/ /pubmed/35629939 http://dx.doi.org/10.3390/metabo12050435 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rodriguez, Jonas
Gomez-Cano, Lina
Grotewold, Erich
de Leon, Natalia
Normalizing and Correcting Variable and Complex LC–MS Metabolomic Data with the R Package pseudoDrift
title Normalizing and Correcting Variable and Complex LC–MS Metabolomic Data with the R Package pseudoDrift
title_full Normalizing and Correcting Variable and Complex LC–MS Metabolomic Data with the R Package pseudoDrift
title_fullStr Normalizing and Correcting Variable and Complex LC–MS Metabolomic Data with the R Package pseudoDrift
title_full_unstemmed Normalizing and Correcting Variable and Complex LC–MS Metabolomic Data with the R Package pseudoDrift
title_short Normalizing and Correcting Variable and Complex LC–MS Metabolomic Data with the R Package pseudoDrift
title_sort normalizing and correcting variable and complex lc–ms metabolomic data with the r package pseudodrift
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144304/
https://www.ncbi.nlm.nih.gov/pubmed/35629939
http://dx.doi.org/10.3390/metabo12050435
work_keys_str_mv AT rodriguezjonas normalizingandcorrectingvariableandcomplexlcmsmetabolomicdatawiththerpackagepseudodrift
AT gomezcanolina normalizingandcorrectingvariableandcomplexlcmsmetabolomicdatawiththerpackagepseudodrift
AT grotewolderich normalizingandcorrectingvariableandcomplexlcmsmetabolomicdatawiththerpackagepseudodrift
AT deleonnatalia normalizingandcorrectingvariableandcomplexlcmsmetabolomicdatawiththerpackagepseudodrift