Cargando…

Sharing is caring? Measurement error and the issues arising from combining 3D morphometric datasets

Geometric morphometrics is routinely used in ecology and evolution and morphometric datasets are increasingly shared among researchers, allowing for more comprehensive studies and higher statistical power (as a consequence of increased sample size). However, sharing of morphometric data opens up the...

Descripción completa

Detalles Bibliográficos
Autores principales: Fruciano, Carmelo, Celik, Mélina A., Butler, Kaylene, Dooley, Tom, Weisbecker, Vera, Phillips, Matthew J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5587461/
https://www.ncbi.nlm.nih.gov/pubmed/28904781
http://dx.doi.org/10.1002/ece3.3256
_version_ 1783261985929953280
author Fruciano, Carmelo
Celik, Mélina A.
Butler, Kaylene
Dooley, Tom
Weisbecker, Vera
Phillips, Matthew J.
author_facet Fruciano, Carmelo
Celik, Mélina A.
Butler, Kaylene
Dooley, Tom
Weisbecker, Vera
Phillips, Matthew J.
author_sort Fruciano, Carmelo
collection PubMed
description Geometric morphometrics is routinely used in ecology and evolution and morphometric datasets are increasingly shared among researchers, allowing for more comprehensive studies and higher statistical power (as a consequence of increased sample size). However, sharing of morphometric data opens up the question of how much nonbiologically relevant variation (i.e., measurement error) is introduced in the resulting datasets and how this variation affects analyses. We perform a set of analyses based on an empirical 3D geometric morphometric dataset. In particular, we quantify the amount of error associated with combining data from multiple devices and digitized by multiple operators and test for the presence of bias. We also extend these analyses to a dataset obtained with a recently developed automated method, which does not require human‐digitized landmarks. Further, we analyze how measurement error affects estimates of phylogenetic signal and how its effect compares with the effect of phylogenetic uncertainty. We show that measurement error can be substantial when combining surface models produced by different devices and even more among landmarks digitized by different operators. We also document the presence of small, but significant, amounts of nonrandom error (i.e., bias). Measurement error is heavily reduced by excluding landmarks that are difficult to digitize. The automated method we tested had low levels of error, if used in combination with a procedure for dimensionality reduction. Estimates of phylogenetic signal can be more affected by measurement error than by phylogenetic uncertainty. Our results generally highlight the importance of landmark choice and the usefulness of estimating measurement error. Further, measurement error may limit comparisons of estimates of phylogenetic signal across studies if these have been performed using different devices or by different operators. Finally, we also show how widely held assumptions do not always hold true, particularly that measurement error affects inference more at a shallower phylogenetic scale and that automated methods perform worse than human digitization.
format Online
Article
Text
id pubmed-5587461
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-55874612017-09-13 Sharing is caring? Measurement error and the issues arising from combining 3D morphometric datasets Fruciano, Carmelo Celik, Mélina A. Butler, Kaylene Dooley, Tom Weisbecker, Vera Phillips, Matthew J. Ecol Evol Original Research Geometric morphometrics is routinely used in ecology and evolution and morphometric datasets are increasingly shared among researchers, allowing for more comprehensive studies and higher statistical power (as a consequence of increased sample size). However, sharing of morphometric data opens up the question of how much nonbiologically relevant variation (i.e., measurement error) is introduced in the resulting datasets and how this variation affects analyses. We perform a set of analyses based on an empirical 3D geometric morphometric dataset. In particular, we quantify the amount of error associated with combining data from multiple devices and digitized by multiple operators and test for the presence of bias. We also extend these analyses to a dataset obtained with a recently developed automated method, which does not require human‐digitized landmarks. Further, we analyze how measurement error affects estimates of phylogenetic signal and how its effect compares with the effect of phylogenetic uncertainty. We show that measurement error can be substantial when combining surface models produced by different devices and even more among landmarks digitized by different operators. We also document the presence of small, but significant, amounts of nonrandom error (i.e., bias). Measurement error is heavily reduced by excluding landmarks that are difficult to digitize. The automated method we tested had low levels of error, if used in combination with a procedure for dimensionality reduction. Estimates of phylogenetic signal can be more affected by measurement error than by phylogenetic uncertainty. Our results generally highlight the importance of landmark choice and the usefulness of estimating measurement error. Further, measurement error may limit comparisons of estimates of phylogenetic signal across studies if these have been performed using different devices or by different operators. Finally, we also show how widely held assumptions do not always hold true, particularly that measurement error affects inference more at a shallower phylogenetic scale and that automated methods perform worse than human digitization. John Wiley and Sons Inc. 2017-07-31 /pmc/articles/PMC5587461/ /pubmed/28904781 http://dx.doi.org/10.1002/ece3.3256 Text en © 2017 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Fruciano, Carmelo
Celik, Mélina A.
Butler, Kaylene
Dooley, Tom
Weisbecker, Vera
Phillips, Matthew J.
Sharing is caring? Measurement error and the issues arising from combining 3D morphometric datasets
title Sharing is caring? Measurement error and the issues arising from combining 3D morphometric datasets
title_full Sharing is caring? Measurement error and the issues arising from combining 3D morphometric datasets
title_fullStr Sharing is caring? Measurement error and the issues arising from combining 3D morphometric datasets
title_full_unstemmed Sharing is caring? Measurement error and the issues arising from combining 3D morphometric datasets
title_short Sharing is caring? Measurement error and the issues arising from combining 3D morphometric datasets
title_sort sharing is caring? measurement error and the issues arising from combining 3d morphometric datasets
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5587461/
https://www.ncbi.nlm.nih.gov/pubmed/28904781
http://dx.doi.org/10.1002/ece3.3256
work_keys_str_mv AT frucianocarmelo sharingiscaringmeasurementerrorandtheissuesarisingfromcombining3dmorphometricdatasets
AT celikmelinaa sharingiscaringmeasurementerrorandtheissuesarisingfromcombining3dmorphometricdatasets
AT butlerkaylene sharingiscaringmeasurementerrorandtheissuesarisingfromcombining3dmorphometricdatasets
AT dooleytom sharingiscaringmeasurementerrorandtheissuesarisingfromcombining3dmorphometricdatasets
AT weisbeckervera sharingiscaringmeasurementerrorandtheissuesarisingfromcombining3dmorphometricdatasets
AT phillipsmatthewj sharingiscaringmeasurementerrorandtheissuesarisingfromcombining3dmorphometricdatasets