Cargando…

A data science challenge for converting airborne remote sensing data into ecological information

Ecology has reached the point where data science competitions, in which multiple groups solve the same problem using the same data by different methods, will be productive for advancing quantitative methods for tasks such as species identification from remote sensing images. We ran a competition to...

Descripción completa

Detalles Bibliográficos
Autores principales: Marconi, Sergio, Graves, Sarah J., Gong, Dihong, Nia, Morteza Shahriari, Le Bras, Marion, Dorr, Bonnie J., Fontana, Peter, Gearhart, Justin, Greenberg, Craig, Harris, Dave J., Kumar, Sugumar Arvind, Nishant, Agarwal, Prarabdh, Joshi, Rege, Sundeep U., Bohlman, Stephanie Ann, White, Ethan P., Wang, Daisy Zhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6397763/
https://www.ncbi.nlm.nih.gov/pubmed/30842892
http://dx.doi.org/10.7717/peerj.5843
_version_ 1783399458557394944
author Marconi, Sergio
Graves, Sarah J.
Gong, Dihong
Nia, Morteza Shahriari
Le Bras, Marion
Dorr, Bonnie J.
Fontana, Peter
Gearhart, Justin
Greenberg, Craig
Harris, Dave J.
Kumar, Sugumar Arvind
Nishant, Agarwal
Prarabdh, Joshi
Rege, Sundeep U.
Bohlman, Stephanie Ann
White, Ethan P.
Wang, Daisy Zhe
author_facet Marconi, Sergio
Graves, Sarah J.
Gong, Dihong
Nia, Morteza Shahriari
Le Bras, Marion
Dorr, Bonnie J.
Fontana, Peter
Gearhart, Justin
Greenberg, Craig
Harris, Dave J.
Kumar, Sugumar Arvind
Nishant, Agarwal
Prarabdh, Joshi
Rege, Sundeep U.
Bohlman, Stephanie Ann
White, Ethan P.
Wang, Daisy Zhe
author_sort Marconi, Sergio
collection PubMed
description Ecology has reached the point where data science competitions, in which multiple groups solve the same problem using the same data by different methods, will be productive for advancing quantitative methods for tasks such as species identification from remote sensing images. We ran a competition to help improve three tasks that are central to converting images into information on individual trees: (1) crown segmentation, for identifying the location and size of individual trees; (2) alignment, to match ground truthed trees with remote sensing; and (3) species classification of individual trees. Six teams (composed of 16 individual participants) submitted predictions for one or more tasks. The crown segmentation task proved to be the most challenging, with the highest-performing algorithm yielding only 34% overlap between remotely sensed crowns and the ground truthed trees. However, most algorithms performed better on large trees. For the alignment task, an algorithm based on minimizing the difference, in terms of both position and tree size, between ground truthed and remotely sensed crowns yielded a perfect alignment. In hindsight, this task was over simplified by only including targeted trees instead of all possible remotely sensed crowns. Several algorithms performed well for species classification, with the highest-performing algorithm correctly classifying 92% of individuals and performing well on both common and rare species. Comparisons of results across algorithms provided a number of insights for improving the overall accuracy in extracting ecological information from remote sensing. Our experience suggests that this kind of competition can benefit methods development in ecology and biology more broadly.
format Online
Article
Text
id pubmed-6397763
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-63977632019-03-06 A data science challenge for converting airborne remote sensing data into ecological information Marconi, Sergio Graves, Sarah J. Gong, Dihong Nia, Morteza Shahriari Le Bras, Marion Dorr, Bonnie J. Fontana, Peter Gearhart, Justin Greenberg, Craig Harris, Dave J. Kumar, Sugumar Arvind Nishant, Agarwal Prarabdh, Joshi Rege, Sundeep U. Bohlman, Stephanie Ann White, Ethan P. Wang, Daisy Zhe PeerJ Ecology Ecology has reached the point where data science competitions, in which multiple groups solve the same problem using the same data by different methods, will be productive for advancing quantitative methods for tasks such as species identification from remote sensing images. We ran a competition to help improve three tasks that are central to converting images into information on individual trees: (1) crown segmentation, for identifying the location and size of individual trees; (2) alignment, to match ground truthed trees with remote sensing; and (3) species classification of individual trees. Six teams (composed of 16 individual participants) submitted predictions for one or more tasks. The crown segmentation task proved to be the most challenging, with the highest-performing algorithm yielding only 34% overlap between remotely sensed crowns and the ground truthed trees. However, most algorithms performed better on large trees. For the alignment task, an algorithm based on minimizing the difference, in terms of both position and tree size, between ground truthed and remotely sensed crowns yielded a perfect alignment. In hindsight, this task was over simplified by only including targeted trees instead of all possible remotely sensed crowns. Several algorithms performed well for species classification, with the highest-performing algorithm correctly classifying 92% of individuals and performing well on both common and rare species. Comparisons of results across algorithms provided a number of insights for improving the overall accuracy in extracting ecological information from remote sensing. Our experience suggests that this kind of competition can benefit methods development in ecology and biology more broadly. PeerJ Inc. 2019-02-28 /pmc/articles/PMC6397763/ /pubmed/30842892 http://dx.doi.org/10.7717/peerj.5843 Text en © 2019 Marconi et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Ecology
Marconi, Sergio
Graves, Sarah J.
Gong, Dihong
Nia, Morteza Shahriari
Le Bras, Marion
Dorr, Bonnie J.
Fontana, Peter
Gearhart, Justin
Greenberg, Craig
Harris, Dave J.
Kumar, Sugumar Arvind
Nishant, Agarwal
Prarabdh, Joshi
Rege, Sundeep U.
Bohlman, Stephanie Ann
White, Ethan P.
Wang, Daisy Zhe
A data science challenge for converting airborne remote sensing data into ecological information
title A data science challenge for converting airborne remote sensing data into ecological information
title_full A data science challenge for converting airborne remote sensing data into ecological information
title_fullStr A data science challenge for converting airborne remote sensing data into ecological information
title_full_unstemmed A data science challenge for converting airborne remote sensing data into ecological information
title_short A data science challenge for converting airborne remote sensing data into ecological information
title_sort data science challenge for converting airborne remote sensing data into ecological information
topic Ecology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6397763/
https://www.ncbi.nlm.nih.gov/pubmed/30842892
http://dx.doi.org/10.7717/peerj.5843
work_keys_str_mv AT marconisergio adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT gravessarahj adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT gongdihong adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT niamortezashahriari adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT lebrasmarion adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT dorrbonniej adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT fontanapeter adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT gearhartjustin adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT greenbergcraig adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT harrisdavej adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT kumarsugumararvind adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT nishantagarwal adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT prarabdhjoshi adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT regesundeepu adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT bohlmanstephanieann adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT whiteethanp adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT wangdaisyzhe adatasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT marconisergio datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT gravessarahj datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT gongdihong datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT niamortezashahriari datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT lebrasmarion datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT dorrbonniej datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT fontanapeter datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT gearhartjustin datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT greenbergcraig datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT harrisdavej datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT kumarsugumararvind datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT nishantagarwal datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT prarabdhjoshi datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT regesundeepu datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT bohlmanstephanieann datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT whiteethanp datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation
AT wangdaisyzhe datasciencechallengeforconvertingairborneremotesensingdataintoecologicalinformation