Cargando…

Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information

Recent algorithmic advances in amplicon-based microbiome studies enable the inference of exact amplicon sequence fragments. These new methods enable the investigation of sub-operational taxonomic units (sOTU) by removing erroneous sequences. However, short (e.g., 150-nucleotide [nt]) DNA sequence fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Janssen, Stefan, McDonald, Daniel, Gonzalez, Antonio, Navas-Molina, Jose A., Jiang, Lingjing, Xu, Zhenjiang Zech, Winker, Kevin, Kado, Deborah M., Orwoll, Eric, Manary, Mark, Mirarab, Siavash, Knight, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5904434/
https://www.ncbi.nlm.nih.gov/pubmed/29719869
http://dx.doi.org/10.1128/mSystems.00021-18
_version_ 1783315101135142912
author Janssen, Stefan
McDonald, Daniel
Gonzalez, Antonio
Navas-Molina, Jose A.
Jiang, Lingjing
Xu, Zhenjiang Zech
Winker, Kevin
Kado, Deborah M.
Orwoll, Eric
Manary, Mark
Mirarab, Siavash
Knight, Rob
author_facet Janssen, Stefan
McDonald, Daniel
Gonzalez, Antonio
Navas-Molina, Jose A.
Jiang, Lingjing
Xu, Zhenjiang Zech
Winker, Kevin
Kado, Deborah M.
Orwoll, Eric
Manary, Mark
Mirarab, Siavash
Knight, Rob
author_sort Janssen, Stefan
collection PubMed
description Recent algorithmic advances in amplicon-based microbiome studies enable the inference of exact amplicon sequence fragments. These new methods enable the investigation of sub-operational taxonomic units (sOTU) by removing erroneous sequences. However, short (e.g., 150-nucleotide [nt]) DNA sequence fragments do not contain sufficient phylogenetic signal to reproduce a reasonable tree, introducing a barrier in the utilization of critical phylogenetically aware metrics such as Faith’s PD or UniFrac. Although fragment insertion methods do exist, those methods have not been tested for sOTUs from high-throughput amplicon studies in insertions against a broad reference phylogeny. We benchmarked the SATé-enabled phylogenetic placement (SEPP) technique explicitly against 16S V4 sequence fragments and showed that it outperforms the conceptually problematic but often-used practice of reconstructing de novo phylogenies. In addition, we provide a BSD-licensed QIIME2 plugin (https://github.com/biocore/q2-fragment-insertion) for SEPP and integration into the microbial study management platform QIITA. IMPORTANCE The move from OTU-based to sOTU-based analysis, while providing additional resolution, also introduces computational challenges. We demonstrate that one popular method of dealing with sOTUs (building a de novo tree from the short sequences) can provide incorrect results in human gut metagenomic studies and show that phylogenetic placement of the new sequences with SEPP resolves this problem while also yielding other benefits over existing methods.
format Online
Article
Text
id pubmed-5904434
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-59044342018-05-01 Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information Janssen, Stefan McDonald, Daniel Gonzalez, Antonio Navas-Molina, Jose A. Jiang, Lingjing Xu, Zhenjiang Zech Winker, Kevin Kado, Deborah M. Orwoll, Eric Manary, Mark Mirarab, Siavash Knight, Rob mSystems Research Article Recent algorithmic advances in amplicon-based microbiome studies enable the inference of exact amplicon sequence fragments. These new methods enable the investigation of sub-operational taxonomic units (sOTU) by removing erroneous sequences. However, short (e.g., 150-nucleotide [nt]) DNA sequence fragments do not contain sufficient phylogenetic signal to reproduce a reasonable tree, introducing a barrier in the utilization of critical phylogenetically aware metrics such as Faith’s PD or UniFrac. Although fragment insertion methods do exist, those methods have not been tested for sOTUs from high-throughput amplicon studies in insertions against a broad reference phylogeny. We benchmarked the SATé-enabled phylogenetic placement (SEPP) technique explicitly against 16S V4 sequence fragments and showed that it outperforms the conceptually problematic but often-used practice of reconstructing de novo phylogenies. In addition, we provide a BSD-licensed QIIME2 plugin (https://github.com/biocore/q2-fragment-insertion) for SEPP and integration into the microbial study management platform QIITA. IMPORTANCE The move from OTU-based to sOTU-based analysis, while providing additional resolution, also introduces computational challenges. We demonstrate that one popular method of dealing with sOTUs (building a de novo tree from the short sequences) can provide incorrect results in human gut metagenomic studies and show that phylogenetic placement of the new sequences with SEPP resolves this problem while also yielding other benefits over existing methods. American Society for Microbiology 2018-04-17 /pmc/articles/PMC5904434/ /pubmed/29719869 http://dx.doi.org/10.1128/mSystems.00021-18 Text en Copyright © 2018 Janssen et al. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Janssen, Stefan
McDonald, Daniel
Gonzalez, Antonio
Navas-Molina, Jose A.
Jiang, Lingjing
Xu, Zhenjiang Zech
Winker, Kevin
Kado, Deborah M.
Orwoll, Eric
Manary, Mark
Mirarab, Siavash
Knight, Rob
Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
title Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
title_full Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
title_fullStr Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
title_full_unstemmed Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
title_short Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
title_sort phylogenetic placement of exact amplicon sequences improves associations with clinical information
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5904434/
https://www.ncbi.nlm.nih.gov/pubmed/29719869
http://dx.doi.org/10.1128/mSystems.00021-18
work_keys_str_mv AT janssenstefan phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT mcdonalddaniel phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT gonzalezantonio phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT navasmolinajosea phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT jianglingjing phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT xuzhenjiangzech phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT winkerkevin phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT kadodeborahm phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT orwolleric phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT manarymark phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT mirarabsiavash phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation
AT knightrob phylogeneticplacementofexactampliconsequencesimprovesassociationswithclinicalinformation