Cargando…
Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens
PREMISE: Despite the economic significance of insect damage to plants (i.e., herbivory), long‐term data documenting changes in herbivory are limited. Millions of pressed plant specimens are now available online and can be used to collect big data on plant–insect interactions during the Anthropocene....
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7328658/ https://www.ncbi.nlm.nih.gov/pubmed/32626611 http://dx.doi.org/10.1002/aps3.11369 |
_version_ | 1783552770882666496 |
---|---|
author | Meineke, Emily K. Tomasi, Carlo Yuan, Song Pryer, Kathleen M. |
author_facet | Meineke, Emily K. Tomasi, Carlo Yuan, Song Pryer, Kathleen M. |
author_sort | Meineke, Emily K. |
collection | PubMed |
description | PREMISE: Despite the economic significance of insect damage to plants (i.e., herbivory), long‐term data documenting changes in herbivory are limited. Millions of pressed plant specimens are now available online and can be used to collect big data on plant–insect interactions during the Anthropocene. METHODS: We initiated development of machine learning methods to automate extraction of herbivory data from herbarium specimens by training an insect damage detector and a damage type classifier on two distantly related plant species (Quercus bicolor and Onoclea sensibilis). We experimented with (1) classifying six types of herbivory and two control categories of undamaged leaf, and (2) detecting two of the damage categories for which several hundred annotations were available. RESULTS: Damage detection results were mixed, with a mean average precision of 45% in the simultaneous detection and classification of two types of damage. However, damage classification on hand‐drawn boxes identified the correct type of herbivory 81.5% of the time in eight categories. The damage classifier was accurate for categories with 100 or more test samples. DISCUSSION: These tools are a promising first step for the automation of herbivory data collection. We describe ongoing efforts to increase the accuracy of these models, allowing researchers to extract similar data and apply them to biological hypotheses. |
format | Online Article Text |
id | pubmed-7328658 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-73286582020-07-02 Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens Meineke, Emily K. Tomasi, Carlo Yuan, Song Pryer, Kathleen M. Appl Plant Sci Special Issue Articles PREMISE: Despite the economic significance of insect damage to plants (i.e., herbivory), long‐term data documenting changes in herbivory are limited. Millions of pressed plant specimens are now available online and can be used to collect big data on plant–insect interactions during the Anthropocene. METHODS: We initiated development of machine learning methods to automate extraction of herbivory data from herbarium specimens by training an insect damage detector and a damage type classifier on two distantly related plant species (Quercus bicolor and Onoclea sensibilis). We experimented with (1) classifying six types of herbivory and two control categories of undamaged leaf, and (2) detecting two of the damage categories for which several hundred annotations were available. RESULTS: Damage detection results were mixed, with a mean average precision of 45% in the simultaneous detection and classification of two types of damage. However, damage classification on hand‐drawn boxes identified the correct type of herbivory 81.5% of the time in eight categories. The damage classifier was accurate for categories with 100 or more test samples. DISCUSSION: These tools are a promising first step for the automation of herbivory data collection. We describe ongoing efforts to increase the accuracy of these models, allowing researchers to extract similar data and apply them to biological hypotheses. John Wiley and Sons Inc. 2020-07-01 /pmc/articles/PMC7328658/ /pubmed/32626611 http://dx.doi.org/10.1002/aps3.11369 Text en © 2020 Meineke et al. Applications in Plant Sciences is published by Wiley Periodicals, LLC. on behalf of the Botanical Society of America This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | Special Issue Articles Meineke, Emily K. Tomasi, Carlo Yuan, Song Pryer, Kathleen M. Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens |
title | Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens |
title_full | Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens |
title_fullStr | Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens |
title_full_unstemmed | Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens |
title_short | Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens |
title_sort | applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens |
topic | Special Issue Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7328658/ https://www.ncbi.nlm.nih.gov/pubmed/32626611 http://dx.doi.org/10.1002/aps3.11369 |
work_keys_str_mv | AT meinekeemilyk applyingmachinelearningtoinvestigatelongterminsectplantinteractionspreservedondigitizedherbariumspecimens AT tomasicarlo applyingmachinelearningtoinvestigatelongterminsectplantinteractionspreservedondigitizedherbariumspecimens AT yuansong applyingmachinelearningtoinvestigatelongterminsectplantinteractionspreservedondigitizedherbariumspecimens AT pryerkathleenm applyingmachinelearningtoinvestigatelongterminsectplantinteractionspreservedondigitizedherbariumspecimens |