Cargando…

Bioinformatic Amplicon Read Processing Strategies Strongly Affect Eukaryotic Diversity and the Taxonomic Composition of Communities

Amplicon read sequencing has revolutionized the field of microbial diversity studies. The technique has been developed for bacterial assemblages and has undergone rigorous testing with mock communities. However, due to the great complexity of eukaryotes and the numbers of different rDNA copies, anal...

Descripción completa

Detalles Bibliográficos
Autores principales: Majaneva, Markus, Hyytiäinen, Kirsi, Varvio, Sirkka Liisa, Nagai, Satoshi, Blomster, Jaanika
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4457843/
https://www.ncbi.nlm.nih.gov/pubmed/26047335
http://dx.doi.org/10.1371/journal.pone.0130035
_version_ 1782375006275108864
author Majaneva, Markus
Hyytiäinen, Kirsi
Varvio, Sirkka Liisa
Nagai, Satoshi
Blomster, Jaanika
author_facet Majaneva, Markus
Hyytiäinen, Kirsi
Varvio, Sirkka Liisa
Nagai, Satoshi
Blomster, Jaanika
author_sort Majaneva, Markus
collection PubMed
description Amplicon read sequencing has revolutionized the field of microbial diversity studies. The technique has been developed for bacterial assemblages and has undergone rigorous testing with mock communities. However, due to the great complexity of eukaryotes and the numbers of different rDNA copies, analyzing eukaryotic diversity is more demanding than analyzing bacterial or mock communities, so studies are needed that test the methods of analyses on taxonomically diverse natural communities. In this study, we used 20 samples collected from the Baltic Sea ice, slush and under-ice water to investigate three program packages (UPARSE, mothur and QIIME) and 18 different bioinformatic strategies implemented in them. Our aim was to assess the impact of the initial steps of bioinformatic strategies on the results when analyzing natural eukaryotic communities. We found significant differences among the strategies in resulting read length, number of OTUs and estimates of diversity as well as clear differences in the taxonomic composition of communities. The differences arose mainly because of the variable number of chimeric reads that passed the pre-processing steps. Singleton removal and denoising substantially lowered the number of errors. Our study showed that the initial steps of the bioinformatic amplicon read processing strategies require careful consideration before applying them to eukaryotic communities.
format Online
Article
Text
id pubmed-4457843
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44578432015-06-09 Bioinformatic Amplicon Read Processing Strategies Strongly Affect Eukaryotic Diversity and the Taxonomic Composition of Communities Majaneva, Markus Hyytiäinen, Kirsi Varvio, Sirkka Liisa Nagai, Satoshi Blomster, Jaanika PLoS One Research Article Amplicon read sequencing has revolutionized the field of microbial diversity studies. The technique has been developed for bacterial assemblages and has undergone rigorous testing with mock communities. However, due to the great complexity of eukaryotes and the numbers of different rDNA copies, analyzing eukaryotic diversity is more demanding than analyzing bacterial or mock communities, so studies are needed that test the methods of analyses on taxonomically diverse natural communities. In this study, we used 20 samples collected from the Baltic Sea ice, slush and under-ice water to investigate three program packages (UPARSE, mothur and QIIME) and 18 different bioinformatic strategies implemented in them. Our aim was to assess the impact of the initial steps of bioinformatic strategies on the results when analyzing natural eukaryotic communities. We found significant differences among the strategies in resulting read length, number of OTUs and estimates of diversity as well as clear differences in the taxonomic composition of communities. The differences arose mainly because of the variable number of chimeric reads that passed the pre-processing steps. Singleton removal and denoising substantially lowered the number of errors. Our study showed that the initial steps of the bioinformatic amplicon read processing strategies require careful consideration before applying them to eukaryotic communities. Public Library of Science 2015-06-05 /pmc/articles/PMC4457843/ /pubmed/26047335 http://dx.doi.org/10.1371/journal.pone.0130035 Text en © 2015 Majaneva et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Majaneva, Markus
Hyytiäinen, Kirsi
Varvio, Sirkka Liisa
Nagai, Satoshi
Blomster, Jaanika
Bioinformatic Amplicon Read Processing Strategies Strongly Affect Eukaryotic Diversity and the Taxonomic Composition of Communities
title Bioinformatic Amplicon Read Processing Strategies Strongly Affect Eukaryotic Diversity and the Taxonomic Composition of Communities
title_full Bioinformatic Amplicon Read Processing Strategies Strongly Affect Eukaryotic Diversity and the Taxonomic Composition of Communities
title_fullStr Bioinformatic Amplicon Read Processing Strategies Strongly Affect Eukaryotic Diversity and the Taxonomic Composition of Communities
title_full_unstemmed Bioinformatic Amplicon Read Processing Strategies Strongly Affect Eukaryotic Diversity and the Taxonomic Composition of Communities
title_short Bioinformatic Amplicon Read Processing Strategies Strongly Affect Eukaryotic Diversity and the Taxonomic Composition of Communities
title_sort bioinformatic amplicon read processing strategies strongly affect eukaryotic diversity and the taxonomic composition of communities
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4457843/
https://www.ncbi.nlm.nih.gov/pubmed/26047335
http://dx.doi.org/10.1371/journal.pone.0130035
work_keys_str_mv AT majanevamarkus bioinformaticampliconreadprocessingstrategiesstronglyaffecteukaryoticdiversityandthetaxonomiccompositionofcommunities
AT hyytiainenkirsi bioinformaticampliconreadprocessingstrategiesstronglyaffecteukaryoticdiversityandthetaxonomiccompositionofcommunities
AT varviosirkkaliisa bioinformaticampliconreadprocessingstrategiesstronglyaffecteukaryoticdiversityandthetaxonomiccompositionofcommunities
AT nagaisatoshi bioinformaticampliconreadprocessingstrategiesstronglyaffecteukaryoticdiversityandthetaxonomiccompositionofcommunities
AT blomsterjaanika bioinformaticampliconreadprocessingstrategiesstronglyaffecteukaryoticdiversityandthetaxonomiccompositionofcommunities