Cargando…

Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation

Deriving the thematic accuracy of models is a fundamental part of image classification analyses. K-fold cross-validation (KCV), as an accuracy assessment technique, can be biased because existing built-in algorithms of software solutions do not handle the high autocorrelation of remotely sensed imag...

Descripción completa

Detalles Bibliográficos
Autores principales: Abriha, Dávid, Srivastava, Prashant K., Szabó, Szilárd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006495/
https://www.ncbi.nlm.nih.gov/pubmed/36915546
http://dx.doi.org/10.1016/j.heliyon.2023.e14045
_version_ 1784905308482568192
author Abriha, Dávid
Srivastava, Prashant K.
Szabó, Szilárd
author_facet Abriha, Dávid
Srivastava, Prashant K.
Szabó, Szilárd
author_sort Abriha, Dávid
collection PubMed
description Deriving the thematic accuracy of models is a fundamental part of image classification analyses. K-fold cross-validation (KCV), as an accuracy assessment technique, can be biased because existing built-in algorithms of software solutions do not handle the high autocorrelation of remotely sensed images, leading to overestimation of accuracies. We aimed to quantify the magnitude of the overestimation of KCV-based accuracies and propose a method to overcome this problem with the example of rooftops using a WorldView-2 (WV2) satellite image, and two orthophotos. Random split to training/testing subsets, independent testing and different types of repeated KCV sampling strategies were used to generate input datasets for classification. Results revealed that applying the random splitting of reference data to training/testing subsets and KCV methods had significantly biased the accuracies by up to 17%; overall accuracies (OAs) can incorrectly reach >99%. We found that repeated KCV can provide similar results to independent testing when spatial sampling is applied with a sufficiently large distance threshold (in our case 10 m). Coarser resolution of WV2 ensured more reliable results (up to a 5–9% increase in OA) than orthophotos. Object-based pixel purity of buildings showed that when using a majority filter for at least of 50% of objects the final accuracy approached 100% with each sampling method. The final conclusion is that KCV-based modelling ensures better accuracy than single models (with better pixel purity on the object level), but the accuracy metrics without spatially filtered sampling are not reliable.
format Online
Article
Text
id pubmed-10006495
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-100064952023-03-12 Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation Abriha, Dávid Srivastava, Prashant K. Szabó, Szilárd Heliyon Research Article Deriving the thematic accuracy of models is a fundamental part of image classification analyses. K-fold cross-validation (KCV), as an accuracy assessment technique, can be biased because existing built-in algorithms of software solutions do not handle the high autocorrelation of remotely sensed images, leading to overestimation of accuracies. We aimed to quantify the magnitude of the overestimation of KCV-based accuracies and propose a method to overcome this problem with the example of rooftops using a WorldView-2 (WV2) satellite image, and two orthophotos. Random split to training/testing subsets, independent testing and different types of repeated KCV sampling strategies were used to generate input datasets for classification. Results revealed that applying the random splitting of reference data to training/testing subsets and KCV methods had significantly biased the accuracies by up to 17%; overall accuracies (OAs) can incorrectly reach >99%. We found that repeated KCV can provide similar results to independent testing when spatial sampling is applied with a sufficiently large distance threshold (in our case 10 m). Coarser resolution of WV2 ensured more reliable results (up to a 5–9% increase in OA) than orthophotos. Object-based pixel purity of buildings showed that when using a majority filter for at least of 50% of objects the final accuracy approached 100% with each sampling method. The final conclusion is that KCV-based modelling ensures better accuracy than single models (with better pixel purity on the object level), but the accuracy metrics without spatially filtered sampling are not reliable. Elsevier 2023-02-24 /pmc/articles/PMC10006495/ /pubmed/36915546 http://dx.doi.org/10.1016/j.heliyon.2023.e14045 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Abriha, Dávid
Srivastava, Prashant K.
Szabó, Szilárd
Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation
title Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation
title_full Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation
title_fullStr Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation
title_full_unstemmed Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation
title_short Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation
title_sort smaller is better? unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006495/
https://www.ncbi.nlm.nih.gov/pubmed/36915546
http://dx.doi.org/10.1016/j.heliyon.2023.e14045
work_keys_str_mv AT abrihadavid smallerisbetterundulyniceaccuracyassessmentsinroofdetectionusingremotesensingdatawithmachinelearningandkfoldcrossvalidation
AT srivastavaprashantk smallerisbetterundulyniceaccuracyassessmentsinroofdetectionusingremotesensingdatawithmachinelearningandkfoldcrossvalidation
AT szaboszilard smallerisbetterundulyniceaccuracyassessmentsinroofdetectionusingremotesensingdatawithmachinelearningandkfoldcrossvalidation