Cargando…

Information in morphological characters

The construction of morphological character matrices is central to paleontological systematic study, which extracts paleontological information from fossils. Although the word information has been repeatedly mentioned in a wide array of paleontological systematic studies, its meaning has rarely been...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Congyu, Jiangzuo, Qigao, Tschopp, Emanuel, Wang, Haibing, Norell, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8427622/
https://www.ncbi.nlm.nih.gov/pubmed/34522333
http://dx.doi.org/10.1002/ece3.7874
_version_ 1783750213275484160
author Yu, Congyu
Jiangzuo, Qigao
Tschopp, Emanuel
Wang, Haibing
Norell, Mark
author_facet Yu, Congyu
Jiangzuo, Qigao
Tschopp, Emanuel
Wang, Haibing
Norell, Mark
author_sort Yu, Congyu
collection PubMed
description The construction of morphological character matrices is central to paleontological systematic study, which extracts paleontological information from fossils. Although the word information has been repeatedly mentioned in a wide array of paleontological systematic studies, its meaning has rarely been clarified nor specifically defined. It is important, however, to establish a standard to measure paleontological information because fossils are hardly complete, rendering the recognition of homologous and homoplastic structures difficult. Here, based on information theory, we show the deep connections between paleontological systematic study and communication system engineering. Information is defined as the decrease of uncertainty and it is the information in morphological characters that allows distinguishing operational taxonomic units (OTUs) and reconstructing evolutionary history. We propose that concepts in communication system engineering such as source coding and channel coding, correspond to the construction of diagnostic features and the entire character matrices in paleontological studies. The two coding strategies should be distinguished following typical communication system engineering, because they serve dual purposes. With character matrices from six different vertebrate groups, we analyzed their information properties including source entropy, mutual information, and channel capacity. Estimation of channel capacity shows character saturation of all matrices in transmitting paleontological information, indicating that, due to the presence of noise, oversampling characters not only increases the burden in character scoring, but also may decrease quality of matrices. We further test the use of information entropy, which measures how informative a variable is, as a character weighting criterion in parsimony‐based systematic studies. The results show high consistency with existing knowledge with both good resolution and interpretability.
format Online
Article
Text
id pubmed-8427622
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-84276222021-09-13 Information in morphological characters Yu, Congyu Jiangzuo, Qigao Tschopp, Emanuel Wang, Haibing Norell, Mark Ecol Evol Original Research The construction of morphological character matrices is central to paleontological systematic study, which extracts paleontological information from fossils. Although the word information has been repeatedly mentioned in a wide array of paleontological systematic studies, its meaning has rarely been clarified nor specifically defined. It is important, however, to establish a standard to measure paleontological information because fossils are hardly complete, rendering the recognition of homologous and homoplastic structures difficult. Here, based on information theory, we show the deep connections between paleontological systematic study and communication system engineering. Information is defined as the decrease of uncertainty and it is the information in morphological characters that allows distinguishing operational taxonomic units (OTUs) and reconstructing evolutionary history. We propose that concepts in communication system engineering such as source coding and channel coding, correspond to the construction of diagnostic features and the entire character matrices in paleontological studies. The two coding strategies should be distinguished following typical communication system engineering, because they serve dual purposes. With character matrices from six different vertebrate groups, we analyzed their information properties including source entropy, mutual information, and channel capacity. Estimation of channel capacity shows character saturation of all matrices in transmitting paleontological information, indicating that, due to the presence of noise, oversampling characters not only increases the burden in character scoring, but also may decrease quality of matrices. We further test the use of information entropy, which measures how informative a variable is, as a character weighting criterion in parsimony‐based systematic studies. The results show high consistency with existing knowledge with both good resolution and interpretability. John Wiley and Sons Inc. 2021-08-04 /pmc/articles/PMC8427622/ /pubmed/34522333 http://dx.doi.org/10.1002/ece3.7874 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Yu, Congyu
Jiangzuo, Qigao
Tschopp, Emanuel
Wang, Haibing
Norell, Mark
Information in morphological characters
title Information in morphological characters
title_full Information in morphological characters
title_fullStr Information in morphological characters
title_full_unstemmed Information in morphological characters
title_short Information in morphological characters
title_sort information in morphological characters
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8427622/
https://www.ncbi.nlm.nih.gov/pubmed/34522333
http://dx.doi.org/10.1002/ece3.7874
work_keys_str_mv AT yucongyu informationinmorphologicalcharacters
AT jiangzuoqigao informationinmorphologicalcharacters
AT tschoppemanuel informationinmorphologicalcharacters
AT wanghaibing informationinmorphologicalcharacters
AT norellmark informationinmorphologicalcharacters