Cargando…

Machine learning performance in a microbial molecular autopsy context: A cross-sectional postmortem human population study

BACKGROUND: The postmortem microbiome can provide valuable information to a death investigation and to the human health of the once living. Microbiome sequencing produces, in general, large multi-dimensional datasets that can be difficult to analyze and interpret. Machine learning methods can be use...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yu, Pechal, Jennifer L., Schmidt, Carl J., Jordan, Heather R., Wang, Wesley W., Benbow, M. Eric, Sze, Sing-Hoi, Tarone, Aaron M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6464165/
https://www.ncbi.nlm.nih.gov/pubmed/30986212
http://dx.doi.org/10.1371/journal.pone.0213829
_version_ 1783410832047079424
author Zhang, Yu
Pechal, Jennifer L.
Schmidt, Carl J.
Jordan, Heather R.
Wang, Wesley W.
Benbow, M. Eric
Sze, Sing-Hoi
Tarone, Aaron M.
author_facet Zhang, Yu
Pechal, Jennifer L.
Schmidt, Carl J.
Jordan, Heather R.
Wang, Wesley W.
Benbow, M. Eric
Sze, Sing-Hoi
Tarone, Aaron M.
author_sort Zhang, Yu
collection PubMed
description BACKGROUND: The postmortem microbiome can provide valuable information to a death investigation and to the human health of the once living. Microbiome sequencing produces, in general, large multi-dimensional datasets that can be difficult to analyze and interpret. Machine learning methods can be useful in overcoming this analytical challenge. However, different methods employ distinct strategies to handle complex datasets. It is unclear whether one method is more appropriate than others for modeling postmortem microbiomes and their ability to predict attributes of interest in death investigations, which require understanding of how the microbial communities change after death and may represent those of the once living host. METHODS AND FINDINGS: Postmortem microbiomes were collected by swabbing five anatomical areas during routine death investigation, sequenced and analyzed from 188 death cases. Three machine learning methods (boosted algorithms, random forests, and neural networks) were compared with respect to their abilities to predict case attributes: postmortem interval (PMI), location of death, and manner of death. Accuracy depended on the method used, the numbers of anatomical areas analyzed, and the predicted attribute of death. CONCLUSIONS: All algorithms performed well but with distinct features to their performance. Xgboost often produced the most accurate predictions but may also be more prone to overfitting. Random forest was the most stable across predictions that included more anatomic areas. Analysis of postmortem microbiota from more than three anatomic areas appears to yield limited returns on accuracy, with the eyes and rectum providing the most useful information correlating with circumstances of death in most cases for this dataset.
format Online
Article
Text
id pubmed-6464165
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64641652019-05-03 Machine learning performance in a microbial molecular autopsy context: A cross-sectional postmortem human population study Zhang, Yu Pechal, Jennifer L. Schmidt, Carl J. Jordan, Heather R. Wang, Wesley W. Benbow, M. Eric Sze, Sing-Hoi Tarone, Aaron M. PLoS One Research Article BACKGROUND: The postmortem microbiome can provide valuable information to a death investigation and to the human health of the once living. Microbiome sequencing produces, in general, large multi-dimensional datasets that can be difficult to analyze and interpret. Machine learning methods can be useful in overcoming this analytical challenge. However, different methods employ distinct strategies to handle complex datasets. It is unclear whether one method is more appropriate than others for modeling postmortem microbiomes and their ability to predict attributes of interest in death investigations, which require understanding of how the microbial communities change after death and may represent those of the once living host. METHODS AND FINDINGS: Postmortem microbiomes were collected by swabbing five anatomical areas during routine death investigation, sequenced and analyzed from 188 death cases. Three machine learning methods (boosted algorithms, random forests, and neural networks) were compared with respect to their abilities to predict case attributes: postmortem interval (PMI), location of death, and manner of death. Accuracy depended on the method used, the numbers of anatomical areas analyzed, and the predicted attribute of death. CONCLUSIONS: All algorithms performed well but with distinct features to their performance. Xgboost often produced the most accurate predictions but may also be more prone to overfitting. Random forest was the most stable across predictions that included more anatomic areas. Analysis of postmortem microbiota from more than three anatomic areas appears to yield limited returns on accuracy, with the eyes and rectum providing the most useful information correlating with circumstances of death in most cases for this dataset. Public Library of Science 2019-04-15 /pmc/articles/PMC6464165/ /pubmed/30986212 http://dx.doi.org/10.1371/journal.pone.0213829 Text en © 2019 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Yu
Pechal, Jennifer L.
Schmidt, Carl J.
Jordan, Heather R.
Wang, Wesley W.
Benbow, M. Eric
Sze, Sing-Hoi
Tarone, Aaron M.
Machine learning performance in a microbial molecular autopsy context: A cross-sectional postmortem human population study
title Machine learning performance in a microbial molecular autopsy context: A cross-sectional postmortem human population study
title_full Machine learning performance in a microbial molecular autopsy context: A cross-sectional postmortem human population study
title_fullStr Machine learning performance in a microbial molecular autopsy context: A cross-sectional postmortem human population study
title_full_unstemmed Machine learning performance in a microbial molecular autopsy context: A cross-sectional postmortem human population study
title_short Machine learning performance in a microbial molecular autopsy context: A cross-sectional postmortem human population study
title_sort machine learning performance in a microbial molecular autopsy context: a cross-sectional postmortem human population study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6464165/
https://www.ncbi.nlm.nih.gov/pubmed/30986212
http://dx.doi.org/10.1371/journal.pone.0213829
work_keys_str_mv AT zhangyu machinelearningperformanceinamicrobialmolecularautopsycontextacrosssectionalpostmortemhumanpopulationstudy
AT pechaljenniferl machinelearningperformanceinamicrobialmolecularautopsycontextacrosssectionalpostmortemhumanpopulationstudy
AT schmidtcarlj machinelearningperformanceinamicrobialmolecularautopsycontextacrosssectionalpostmortemhumanpopulationstudy
AT jordanheatherr machinelearningperformanceinamicrobialmolecularautopsycontextacrosssectionalpostmortemhumanpopulationstudy
AT wangwesleyw machinelearningperformanceinamicrobialmolecularautopsycontextacrosssectionalpostmortemhumanpopulationstudy
AT benbowmeric machinelearningperformanceinamicrobialmolecularautopsycontextacrosssectionalpostmortemhumanpopulationstudy
AT szesinghoi machinelearningperformanceinamicrobialmolecularautopsycontextacrosssectionalpostmortemhumanpopulationstudy
AT taroneaaronm machinelearningperformanceinamicrobialmolecularautopsycontextacrosssectionalpostmortemhumanpopulationstudy