Cargando…

Optimizing digitalization effort in morphometrics

Quantifying phenotypes is a common practice for addressing questions regarding morphological variation. The time dedicated to data acquisition can vary greatly depending on methods and on the required quantity of information. Optimizing digitization effort can be done either by pooling datasets amon...

Descripción completa

Detalles Bibliográficos
Autores principales: Evin, Allowen, Bonhomme, Vincent, Claude, Julien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7723759/
https://www.ncbi.nlm.nih.gov/pubmed/33324759
http://dx.doi.org/10.1093/biomethods/bpaa023
_version_ 1783620410221264896
author Evin, Allowen
Bonhomme, Vincent
Claude, Julien
author_facet Evin, Allowen
Bonhomme, Vincent
Claude, Julien
author_sort Evin, Allowen
collection PubMed
description Quantifying phenotypes is a common practice for addressing questions regarding morphological variation. The time dedicated to data acquisition can vary greatly depending on methods and on the required quantity of information. Optimizing digitization effort can be done either by pooling datasets among users, by automatizing data collection, or by reducing the number of measurements. Pooling datasets among users is not without risk since potential errors arising from multiple operators in data acquisition prevent combining morphometric datasets. We present an analytical workflow to estimate within and among operator biases and to assess whether morphometric datasets can be pooled. We show that pooling and sharing data requires careful examination of the errors occurring during data acquisition, that the choice of morphometric approach influences amount of error, and that in some cases pooling data should be avoided. The demonstration is based on a worked example (Sus scrofa teeth) using a combinations of 18 morphometric approaches and datasets for which we identified and quantified several potential sources of errors in the workflow. We show that it is possible to estimate the analytical power of a study using a small subset of data to select the best morphometric protocol and to optimize the number of variables necessary for analysis. In particular, we focus on semi-landmarks, which often produce an inflation of variables in contrast to the number of available observations use in statistical testing. We show how the workflow can be used for optimizing digitization efforts and provide recommendations for best practices in error management.
format Online
Article
Text
id pubmed-7723759
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77237592020-12-14 Optimizing digitalization effort in morphometrics Evin, Allowen Bonhomme, Vincent Claude, Julien Biol Methods Protoc Methods Manuscript Quantifying phenotypes is a common practice for addressing questions regarding morphological variation. The time dedicated to data acquisition can vary greatly depending on methods and on the required quantity of information. Optimizing digitization effort can be done either by pooling datasets among users, by automatizing data collection, or by reducing the number of measurements. Pooling datasets among users is not without risk since potential errors arising from multiple operators in data acquisition prevent combining morphometric datasets. We present an analytical workflow to estimate within and among operator biases and to assess whether morphometric datasets can be pooled. We show that pooling and sharing data requires careful examination of the errors occurring during data acquisition, that the choice of morphometric approach influences amount of error, and that in some cases pooling data should be avoided. The demonstration is based on a worked example (Sus scrofa teeth) using a combinations of 18 morphometric approaches and datasets for which we identified and quantified several potential sources of errors in the workflow. We show that it is possible to estimate the analytical power of a study using a small subset of data to select the best morphometric protocol and to optimize the number of variables necessary for analysis. In particular, we focus on semi-landmarks, which often produce an inflation of variables in contrast to the number of available observations use in statistical testing. We show how the workflow can be used for optimizing digitization efforts and provide recommendations for best practices in error management. Oxford University Press 2020-11-16 /pmc/articles/PMC7723759/ /pubmed/33324759 http://dx.doi.org/10.1093/biomethods/bpaa023 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Manuscript
Evin, Allowen
Bonhomme, Vincent
Claude, Julien
Optimizing digitalization effort in morphometrics
title Optimizing digitalization effort in morphometrics
title_full Optimizing digitalization effort in morphometrics
title_fullStr Optimizing digitalization effort in morphometrics
title_full_unstemmed Optimizing digitalization effort in morphometrics
title_short Optimizing digitalization effort in morphometrics
title_sort optimizing digitalization effort in morphometrics
topic Methods Manuscript
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7723759/
https://www.ncbi.nlm.nih.gov/pubmed/33324759
http://dx.doi.org/10.1093/biomethods/bpaa023
work_keys_str_mv AT evinallowen optimizingdigitalizationeffortinmorphometrics
AT bonhommevincent optimizingdigitalizationeffortinmorphometrics
AT claudejulien optimizingdigitalizationeffortinmorphometrics