Cargando…

Determining soil particle-size distribution from infrared spectra using machine learning predictions: Methodology and modeling

Accuracy of infrared (IR) models to measure soil particle-size distribution (PSD) depends on soil preparation, methodology (sedimentation, laser), settling times and relevant soil features. Compositional soil data may require log ratio (ilr) transformation to avoid numerical biases. Machine learning...

Descripción completa

Detalles Bibliográficos
Autores principales: Parent, Elizabeth Jeanne, Parent, Serge-Étienne, Parent, Léon Etienne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8291647/
https://www.ncbi.nlm.nih.gov/pubmed/34283823
http://dx.doi.org/10.1371/journal.pone.0233242
_version_ 1783724681950396416
author Parent, Elizabeth Jeanne
Parent, Serge-Étienne
Parent, Léon Etienne
author_facet Parent, Elizabeth Jeanne
Parent, Serge-Étienne
Parent, Léon Etienne
author_sort Parent, Elizabeth Jeanne
collection PubMed
description Accuracy of infrared (IR) models to measure soil particle-size distribution (PSD) depends on soil preparation, methodology (sedimentation, laser), settling times and relevant soil features. Compositional soil data may require log ratio (ilr) transformation to avoid numerical biases. Machine learning can relate numerous independent variables that may impact on NIR spectra to assess particle-size distribution. Our objective was to reach high IRS prediction accuracy across a large range of PSD methods and soil properties. A total of 1298 soil samples from eastern Canada were IR-scanned. Spectra were processed by Stochastic Gradient Boosting (SGB) to predict sand, silt, clay and carbon. Slope and intercept of the log-log relationships between settling time and suspension density function (SDF) (R(2) = 0.84–0.92) performed similarly to NIR spectra using either ilr-transformed (R(2) = 0.81–0.93) or raw percentages (R(2) = 0.76–0.94). Settling times of 0.67-min and 2-h were the most accurate for NIR predictions (R(2) = 0.49–0.79). The NIR prediction of sand sieving method (R(2) = 0.66) was more accurate than sedimentation method(R(2) = 0.53). The NIR 2X gain was less accurate (R(2) = 0.69–0.92) than 4X (R(2) = 0.87–0.95). The MIR (R(2) = 0.45–0.80) performed better than NIR (R(2) = 0.40–0.71) spectra. Adding soil carbon, reconstituted bulk density, pH, red-green-blue color, oxalate and Mehlich3 extracts returned R(2) value of 0.86–0.91 for texture prediction. In addition to slope and intercept of the SDF, 4X gain, method and pre-treatment classes, soil carbon and color appeared to be promising features for routine SGB-processed NIR particle-size analysis. Machine learning methods support cost-effective soil texture NIR analysis.
format Online
Article
Text
id pubmed-8291647
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-82916472021-07-31 Determining soil particle-size distribution from infrared spectra using machine learning predictions: Methodology and modeling Parent, Elizabeth Jeanne Parent, Serge-Étienne Parent, Léon Etienne PLoS One Research Article Accuracy of infrared (IR) models to measure soil particle-size distribution (PSD) depends on soil preparation, methodology (sedimentation, laser), settling times and relevant soil features. Compositional soil data may require log ratio (ilr) transformation to avoid numerical biases. Machine learning can relate numerous independent variables that may impact on NIR spectra to assess particle-size distribution. Our objective was to reach high IRS prediction accuracy across a large range of PSD methods and soil properties. A total of 1298 soil samples from eastern Canada were IR-scanned. Spectra were processed by Stochastic Gradient Boosting (SGB) to predict sand, silt, clay and carbon. Slope and intercept of the log-log relationships between settling time and suspension density function (SDF) (R(2) = 0.84–0.92) performed similarly to NIR spectra using either ilr-transformed (R(2) = 0.81–0.93) or raw percentages (R(2) = 0.76–0.94). Settling times of 0.67-min and 2-h were the most accurate for NIR predictions (R(2) = 0.49–0.79). The NIR prediction of sand sieving method (R(2) = 0.66) was more accurate than sedimentation method(R(2) = 0.53). The NIR 2X gain was less accurate (R(2) = 0.69–0.92) than 4X (R(2) = 0.87–0.95). The MIR (R(2) = 0.45–0.80) performed better than NIR (R(2) = 0.40–0.71) spectra. Adding soil carbon, reconstituted bulk density, pH, red-green-blue color, oxalate and Mehlich3 extracts returned R(2) value of 0.86–0.91 for texture prediction. In addition to slope and intercept of the SDF, 4X gain, method and pre-treatment classes, soil carbon and color appeared to be promising features for routine SGB-processed NIR particle-size analysis. Machine learning methods support cost-effective soil texture NIR analysis. Public Library of Science 2021-07-20 /pmc/articles/PMC8291647/ /pubmed/34283823 http://dx.doi.org/10.1371/journal.pone.0233242 Text en © 2021 Parent et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Parent, Elizabeth Jeanne
Parent, Serge-Étienne
Parent, Léon Etienne
Determining soil particle-size distribution from infrared spectra using machine learning predictions: Methodology and modeling
title Determining soil particle-size distribution from infrared spectra using machine learning predictions: Methodology and modeling
title_full Determining soil particle-size distribution from infrared spectra using machine learning predictions: Methodology and modeling
title_fullStr Determining soil particle-size distribution from infrared spectra using machine learning predictions: Methodology and modeling
title_full_unstemmed Determining soil particle-size distribution from infrared spectra using machine learning predictions: Methodology and modeling
title_short Determining soil particle-size distribution from infrared spectra using machine learning predictions: Methodology and modeling
title_sort determining soil particle-size distribution from infrared spectra using machine learning predictions: methodology and modeling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8291647/
https://www.ncbi.nlm.nih.gov/pubmed/34283823
http://dx.doi.org/10.1371/journal.pone.0233242
work_keys_str_mv AT parentelizabethjeanne determiningsoilparticlesizedistributionfrominfraredspectrausingmachinelearningpredictionsmethodologyandmodeling
AT parentsergeetienne determiningsoilparticlesizedistributionfrominfraredspectrausingmachinelearningpredictionsmethodologyandmodeling
AT parentleonetienne determiningsoilparticlesizedistributionfrominfraredspectrausingmachinelearningpredictionsmethodologyandmodeling