Cargando…
Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching
A huge quantity of microbiome samples have been accumulated, and more are yet to come from all niches around the globe. With the accumulation of data, there is an urgent need for comparisons and searches of microbiome samples among thousands of millions of samples in a fast and accurate manner. Howe...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8059704/ https://www.ncbi.nlm.nih.gov/pubmed/33897651 http://dx.doi.org/10.3389/fmicb.2021.642439 |
_version_ | 1783681228450299904 |
---|---|
author | Zha, Yuguo Chong, Hui Ning, Kang |
author_facet | Zha, Yuguo Chong, Hui Ning, Kang |
author_sort | Zha, Yuguo |
collection | PubMed |
description | A huge quantity of microbiome samples have been accumulated, and more are yet to come from all niches around the globe. With the accumulation of data, there is an urgent need for comparisons and searches of microbiome samples among thousands of millions of samples in a fast and accurate manner. However, it is a very difficult computational challenge to identify similar samples, as well as identify their likely origins, among such a grand pool of samples from all around the world. Currently, several approaches have already been proposed for such a challenge, based on either distance calculation, unsupervised algorithms, or supervised algorithms. These methods have advantages and disadvantages for the different settings of comparisons and searches, and their results are also drastically different. In this review, we systematically compared distance-based, unsupervised, and supervised methods for microbiome sample comparison and search. Firstly, we assessed their accuracy and efficiency, both in theory and in practice. Then we described the scenarios in which one or multiple methods were applicable for sample searches. Thirdly, we provided several applications for microbiome sample comparisons and searches, and provided suggestions on the choice of methods. Finally, we provided several perspectives for the future development of microbiome sample comparison and search, including deep learning technologies for tracking the sources of microbiome samples. |
format | Online Article Text |
id | pubmed-8059704 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80597042021-04-22 Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching Zha, Yuguo Chong, Hui Ning, Kang Front Microbiol Microbiology A huge quantity of microbiome samples have been accumulated, and more are yet to come from all niches around the globe. With the accumulation of data, there is an urgent need for comparisons and searches of microbiome samples among thousands of millions of samples in a fast and accurate manner. However, it is a very difficult computational challenge to identify similar samples, as well as identify their likely origins, among such a grand pool of samples from all around the world. Currently, several approaches have already been proposed for such a challenge, based on either distance calculation, unsupervised algorithms, or supervised algorithms. These methods have advantages and disadvantages for the different settings of comparisons and searches, and their results are also drastically different. In this review, we systematically compared distance-based, unsupervised, and supervised methods for microbiome sample comparison and search. Firstly, we assessed their accuracy and efficiency, both in theory and in practice. Then we described the scenarios in which one or multiple methods were applicable for sample searches. Thirdly, we provided several applications for microbiome sample comparisons and searches, and provided suggestions on the choice of methods. Finally, we provided several perspectives for the future development of microbiome sample comparison and search, including deep learning technologies for tracking the sources of microbiome samples. Frontiers Media S.A. 2021-04-07 /pmc/articles/PMC8059704/ /pubmed/33897651 http://dx.doi.org/10.3389/fmicb.2021.642439 Text en Copyright © 2021 Zha, Chong and Ning. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Zha, Yuguo Chong, Hui Ning, Kang Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching |
title | Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching |
title_full | Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching |
title_fullStr | Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching |
title_full_unstemmed | Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching |
title_short | Microbiome Sample Comparison and Search: From Pair-Wise Calculations to Model-Based Matching |
title_sort | microbiome sample comparison and search: from pair-wise calculations to model-based matching |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8059704/ https://www.ncbi.nlm.nih.gov/pubmed/33897651 http://dx.doi.org/10.3389/fmicb.2021.642439 |
work_keys_str_mv | AT zhayuguo microbiomesamplecomparisonandsearchfrompairwisecalculationstomodelbasedmatching AT chonghui microbiomesamplecomparisonandsearchfrompairwisecalculationstomodelbasedmatching AT ningkang microbiomesamplecomparisonandsearchfrompairwisecalculationstomodelbasedmatching |