Cargando…

Rampant False Detection of Adaptive Phenotypic Optimization by ParTI-Based Pareto Front Inference

Organisms face tradeoffs in performing multiple tasks. Identifying the optimal phenotypes maximizing the organismal fitness (or Pareto front) and inferring the relevant tasks allow testing phenotypic adaptations and help delineate evolutionary constraints, tradeoffs, and critical fitness components,...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Mengyi, Zhang, Jianzhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042732/
https://www.ncbi.nlm.nih.gov/pubmed/33346805
http://dx.doi.org/10.1093/molbev/msaa330
_version_ 1783678178369208320
author Sun, Mengyi
Zhang, Jianzhi
author_facet Sun, Mengyi
Zhang, Jianzhi
author_sort Sun, Mengyi
collection PubMed
description Organisms face tradeoffs in performing multiple tasks. Identifying the optimal phenotypes maximizing the organismal fitness (or Pareto front) and inferring the relevant tasks allow testing phenotypic adaptations and help delineate evolutionary constraints, tradeoffs, and critical fitness components, so are of broad interest. It has been proposed that Pareto fronts can be identified from high-dimensional phenotypic data, including molecular phenotypes such as gene expression levels, by fitting polytopes (lines, triangles, tetrahedrons, and so on), and a program named ParTI was recently introduced for this purpose. ParTI has identified Pareto fronts and inferred phenotypes best for individual tasks (or archetypes) from numerous data sets such as the beak morphologies of Darwin’s finches and mRNA concentrations in human tumors, implying evolutionary optimizations of the involved traits. Nevertheless, the reliabilities of these findings are unknown. Using real and simulated data that lack evolutionary optimization, we here report extremely high false-positive rates of ParTI. The errors arise from phylogenetic relationships or population structures of the organisms analyzed and the flexibility of data analysis in ParTI that is equivalent to p-hacking. Because these problems are virtually universal, our findings cast doubt on almost all ParTI-based results and suggest that reliably identifying Pareto fronts and archetypes from high-dimensional phenotypic data are currently generally difficult.
format Online
Article
Text
id pubmed-8042732
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80427322021-04-16 Rampant False Detection of Adaptive Phenotypic Optimization by ParTI-Based Pareto Front Inference Sun, Mengyi Zhang, Jianzhi Mol Biol Evol Methods Organisms face tradeoffs in performing multiple tasks. Identifying the optimal phenotypes maximizing the organismal fitness (or Pareto front) and inferring the relevant tasks allow testing phenotypic adaptations and help delineate evolutionary constraints, tradeoffs, and critical fitness components, so are of broad interest. It has been proposed that Pareto fronts can be identified from high-dimensional phenotypic data, including molecular phenotypes such as gene expression levels, by fitting polytopes (lines, triangles, tetrahedrons, and so on), and a program named ParTI was recently introduced for this purpose. ParTI has identified Pareto fronts and inferred phenotypes best for individual tasks (or archetypes) from numerous data sets such as the beak morphologies of Darwin’s finches and mRNA concentrations in human tumors, implying evolutionary optimizations of the involved traits. Nevertheless, the reliabilities of these findings are unknown. Using real and simulated data that lack evolutionary optimization, we here report extremely high false-positive rates of ParTI. The errors arise from phylogenetic relationships or population structures of the organisms analyzed and the flexibility of data analysis in ParTI that is equivalent to p-hacking. Because these problems are virtually universal, our findings cast doubt on almost all ParTI-based results and suggest that reliably identifying Pareto fronts and archetypes from high-dimensional phenotypic data are currently generally difficult. Oxford University Press 2020-12-21 /pmc/articles/PMC8042732/ /pubmed/33346805 http://dx.doi.org/10.1093/molbev/msaa330 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Sun, Mengyi
Zhang, Jianzhi
Rampant False Detection of Adaptive Phenotypic Optimization by ParTI-Based Pareto Front Inference
title Rampant False Detection of Adaptive Phenotypic Optimization by ParTI-Based Pareto Front Inference
title_full Rampant False Detection of Adaptive Phenotypic Optimization by ParTI-Based Pareto Front Inference
title_fullStr Rampant False Detection of Adaptive Phenotypic Optimization by ParTI-Based Pareto Front Inference
title_full_unstemmed Rampant False Detection of Adaptive Phenotypic Optimization by ParTI-Based Pareto Front Inference
title_short Rampant False Detection of Adaptive Phenotypic Optimization by ParTI-Based Pareto Front Inference
title_sort rampant false detection of adaptive phenotypic optimization by parti-based pareto front inference
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042732/
https://www.ncbi.nlm.nih.gov/pubmed/33346805
http://dx.doi.org/10.1093/molbev/msaa330
work_keys_str_mv AT sunmengyi rampantfalsedetectionofadaptivephenotypicoptimizationbypartibasedparetofrontinference
AT zhangjianzhi rampantfalsedetectionofadaptivephenotypicoptimizationbypartibasedparetofrontinference