Cargando…

Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons

BACKGROUND: Non-target screening consists in searching a sample for all present substances, suspected or unknown, with very little prior knowledge about the sample. This approach has been introduced more than a decade ago in the field of water analysis, together with dedicated compound identificatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Guillevic, Myriam, Guillevic, Aurore, Vollmer, Martin K., Schlauri, Paul, Hill, Matthias, Emmenegger, Lukas, Reimann, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8491408/
https://www.ncbi.nlm.nih.gov/pubmed/34607604
http://dx.doi.org/10.1186/s13321-021-00544-w
_version_ 1784578736636559360
author Guillevic, Myriam
Guillevic, Aurore
Vollmer, Martin K.
Schlauri, Paul
Hill, Matthias
Emmenegger, Lukas
Reimann, Stefan
author_facet Guillevic, Myriam
Guillevic, Aurore
Vollmer, Martin K.
Schlauri, Paul
Hill, Matthias
Emmenegger, Lukas
Reimann, Stefan
author_sort Guillevic, Myriam
collection PubMed
description BACKGROUND: Non-target screening consists in searching a sample for all present substances, suspected or unknown, with very little prior knowledge about the sample. This approach has been introduced more than a decade ago in the field of water analysis, together with dedicated compound identification tools, but is still very scarce for indoor and atmospheric trace gas measurements, despite the clear need for a better understanding of the atmospheric trace gas composition. For a systematic detection of emerging trace gases in the atmosphere, a new and powerful analytical method is gas chromatography (GC) of preconcentrated samples, followed by electron ionisation, high resolution mass spectrometry (EI-HRMS). In this work, we present data analysis tools to enable automated fragment formula annotation for unknown compounds measured by GC-EI-HRMS. RESULTS: Based on co-eluting mass/charge fragments, we developed an innovative data analysis method to reliably reconstruct the chemical formulae of the fragments, using efficient combinatorics and graph theory. The method does not require the presence of the molecular ion, which is absent in [Formula: see text] 40% of EI spectra. Our method has been trained and validated on >50 halocarbons and hydrocarbons, with 3–20 atoms and molar masses of 30–330 g mol[Formula: see text], measured with a mass resolution of approx. 3500. For >90% of the compounds, more than 90% of the annotated fragment formulae are correct. Cases of wrong identification can be attributed to the scarcity of detected fragments per compound or the lack of isotopic constraint (no minor isotopocule detected). CONCLUSIONS: Our method enables to reconstruct most probable chemical formulae independently from spectral databases. Therefore, it demonstrates the suitability of EI-HRMS data for non-target analysis and paves the way for the identification of substances for which no EI mass spectrum is registered in databases. We illustrate the performances of our method for atmospheric trace gases and suggest that it may be well suited for many other types of samples. The L-GPL licenced Python code is released under the name ALPINAC for ALgorithmic Process for Identification of Non-targeted Atmospheric Compounds. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00544-w.
format Online
Article
Text
id pubmed-8491408
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-84914082021-10-05 Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons Guillevic, Myriam Guillevic, Aurore Vollmer, Martin K. Schlauri, Paul Hill, Matthias Emmenegger, Lukas Reimann, Stefan J Cheminform Research Article BACKGROUND: Non-target screening consists in searching a sample for all present substances, suspected or unknown, with very little prior knowledge about the sample. This approach has been introduced more than a decade ago in the field of water analysis, together with dedicated compound identification tools, but is still very scarce for indoor and atmospheric trace gas measurements, despite the clear need for a better understanding of the atmospheric trace gas composition. For a systematic detection of emerging trace gases in the atmosphere, a new and powerful analytical method is gas chromatography (GC) of preconcentrated samples, followed by electron ionisation, high resolution mass spectrometry (EI-HRMS). In this work, we present data analysis tools to enable automated fragment formula annotation for unknown compounds measured by GC-EI-HRMS. RESULTS: Based on co-eluting mass/charge fragments, we developed an innovative data analysis method to reliably reconstruct the chemical formulae of the fragments, using efficient combinatorics and graph theory. The method does not require the presence of the molecular ion, which is absent in [Formula: see text] 40% of EI spectra. Our method has been trained and validated on >50 halocarbons and hydrocarbons, with 3–20 atoms and molar masses of 30–330 g mol[Formula: see text], measured with a mass resolution of approx. 3500. For >90% of the compounds, more than 90% of the annotated fragment formulae are correct. Cases of wrong identification can be attributed to the scarcity of detected fragments per compound or the lack of isotopic constraint (no minor isotopocule detected). CONCLUSIONS: Our method enables to reconstruct most probable chemical formulae independently from spectral databases. Therefore, it demonstrates the suitability of EI-HRMS data for non-target analysis and paves the way for the identification of substances for which no EI mass spectrum is registered in databases. We illustrate the performances of our method for atmospheric trace gases and suggest that it may be well suited for many other types of samples. The L-GPL licenced Python code is released under the name ALPINAC for ALgorithmic Process for Identification of Non-targeted Atmospheric Compounds. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00544-w. Springer International Publishing 2021-10-04 /pmc/articles/PMC8491408/ /pubmed/34607604 http://dx.doi.org/10.1186/s13321-021-00544-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Guillevic, Myriam
Guillevic, Aurore
Vollmer, Martin K.
Schlauri, Paul
Hill, Matthias
Emmenegger, Lukas
Reimann, Stefan
Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons
title Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons
title_full Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons
title_fullStr Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons
title_full_unstemmed Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons
title_short Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons
title_sort automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8491408/
https://www.ncbi.nlm.nih.gov/pubmed/34607604
http://dx.doi.org/10.1186/s13321-021-00544-w
work_keys_str_mv AT guillevicmyriam automatedfragmentformulaannotationforelectronionisationhighresolutionmassspectrometryapplicationtoatmosphericmeasurementsofhalocarbons
AT guillevicaurore automatedfragmentformulaannotationforelectronionisationhighresolutionmassspectrometryapplicationtoatmosphericmeasurementsofhalocarbons
AT vollmermartink automatedfragmentformulaannotationforelectronionisationhighresolutionmassspectrometryapplicationtoatmosphericmeasurementsofhalocarbons
AT schlauripaul automatedfragmentformulaannotationforelectronionisationhighresolutionmassspectrometryapplicationtoatmosphericmeasurementsofhalocarbons
AT hillmatthias automatedfragmentformulaannotationforelectronionisationhighresolutionmassspectrometryapplicationtoatmosphericmeasurementsofhalocarbons
AT emmeneggerlukas automatedfragmentformulaannotationforelectronionisationhighresolutionmassspectrometryapplicationtoatmosphericmeasurementsofhalocarbons
AT reimannstefan automatedfragmentformulaannotationforelectronionisationhighresolutionmassspectrometryapplicationtoatmosphericmeasurementsofhalocarbons