Cargando…

Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching

A huge quantity of microbiome samples have been accumulated, and more are yet to come from all niches around the globe. With the accumulation of data, there is an urgent need for comparisons and searches of microbiome samples among thousands of millions of samples in a fast and accurate manner. Howe...

Descripción completa

Detalles Bibliográficos
Autores principales: Zha, Yuguo, Chong, Hui, Ning, Kang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8059704/
https://www.ncbi.nlm.nih.gov/pubmed/33897651
http://dx.doi.org/10.3389/fmicb.2021.642439
_version_ 1783681228450299904
author Zha, Yuguo
Chong, Hui
Ning, Kang
author_facet Zha, Yuguo
Chong, Hui
Ning, Kang
author_sort Zha, Yuguo
collection PubMed
description A huge quantity of microbiome samples have been accumulated, and more are yet to come from all niches around the globe. With the accumulation of data, there is an urgent need for comparisons and searches of microbiome samples among thousands of millions of samples in a fast and accurate manner. However, it is a very difficult computational challenge to identify similar samples, as well as identify their likely origins, among such a grand pool of samples from all around the world. Currently, several approaches have already been proposed for such a challenge, based on either distance calculation, unsupervised algorithms, or supervised algorithms. These methods have advantages and disadvantages for the different settings of comparisons and searches, and their results are also drastically different. In this review, we systematically compared distance-based, unsupervised, and supervised methods for microbiome sample comparison and search. Firstly, we assessed their accuracy and efficiency, both in theory and in practice. Then we described the scenarios in which one or multiple methods were applicable for sample searches. Thirdly, we provided several applications for microbiome sample comparisons and searches, and provided suggestions on the choice of methods. Finally, we provided several perspectives for the future development of microbiome sample comparison and search, including deep learning technologies for tracking the sources of microbiome samples.
format Online
Article
Text
id pubmed-8059704
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-80597042021-04-22 Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching Zha, Yuguo Chong, Hui Ning, Kang Front Microbiol Microbiology A huge quantity of microbiome samples have been accumulated, and more are yet to come from all niches around the globe. With the accumulation of data, there is an urgent need for comparisons and searches of microbiome samples among thousands of millions of samples in a fast and accurate manner. However, it is a very difficult computational challenge to identify similar samples, as well as identify their likely origins, among such a grand pool of samples from all around the world. Currently, several approaches have already been proposed for such a challenge, based on either distance calculation, unsupervised algorithms, or supervised algorithms. These methods have advantages and disadvantages for the different settings of comparisons and searches, and their results are also drastically different. In this review, we systematically compared distance-based, unsupervised, and supervised methods for microbiome sample comparison and search. Firstly, we assessed their accuracy and efficiency, both in theory and in practice. Then we described the scenarios in which one or multiple methods were applicable for sample searches. Thirdly, we provided several applications for microbiome sample comparisons and searches, and provided suggestions on the choice of methods. Finally, we provided several perspectives for the future development of microbiome sample comparison and search, including deep learning technologies for tracking the sources of microbiome samples. Frontiers Media S.A. 2021-04-07 /pmc/articles/PMC8059704/ /pubmed/33897651 http://dx.doi.org/10.3389/fmicb.2021.642439 Text en Copyright © 2021 Zha, Chong and Ning. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Zha, Yuguo
Chong, Hui
Ning, Kang
Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching
title Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching
title_full Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching
title_fullStr Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching
title_full_unstemmed Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching
title_short Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching
title_sort microbiome sample comparison and search: from pair-wise calculations to model-based matching
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8059704/
https://www.ncbi.nlm.nih.gov/pubmed/33897651
http://dx.doi.org/10.3389/fmicb.2021.642439
work_keys_str_mv AT zhayuguo microbiomesamplecomparisonandsearchfrompairwisecalculationstomodelbasedmatching
AT chonghui microbiomesamplecomparisonandsearchfrompairwisecalculationstomodelbasedmatching
AT ningkang microbiomesamplecomparisonandsearchfrompairwisecalculationstomodelbasedmatching