Cargando…

Optimization of Genotype by Sequencing data for phylogenetic purposes

• Herein we propose a framework for assembling and analyzing Genotype by Sequencing (GBS) data to better understand evolutionary relationships within a group of closely related species using the mastiff bats (Molossus) as our model system. Many species within this genus have low-levels of genetic va...

Descripción completa

Detalles Bibliográficos
Autores principales: Loureiro, L.O., Engstrom, M.D., Lim, B.K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7195544/
https://www.ncbi.nlm.nih.gov/pubmed/32373482
http://dx.doi.org/10.1016/j.mex.2020.100892
_version_ 1783528557861928960
author Loureiro, L.O.
Engstrom, M.D.
Lim, B.K.
author_facet Loureiro, L.O.
Engstrom, M.D.
Lim, B.K.
author_sort Loureiro, L.O.
collection PubMed
description • Herein we propose a framework for assembling and analyzing Genotype by Sequencing (GBS) data to better understand evolutionary relationships within a group of closely related species using the mastiff bats (Molossus) as our model system. Many species within this genus have low-levels of genetic variation within and between morphologically distinct species, and the relationships among them remain unresolved using traditional Sanger sequencing methods. Given that both de novo and reference genome pipelines can be used to assemble next generation sequences, and that several tree inference methodologies have been proposed for single nucleotide polymorphism (SNP) data, we test whether different alignments and phylogenetic approaches produce similar results. We also examined how the process of SNP identification and mapping can affect the consistency of the analyses. Different alignments and phylogenetic inferences produced consistent results, supporting the GBS approach for answering evolutionary questions on a macroevolutionary scale when the genetic distance among phenotypically identifiable clades is low. We highlight the importance of exploring the relationships among groups using different assembly assumptions and also distinct phylogenetic inference methods, particularly when addressing phylogenetic questions in genetic and morphologically conservative taxa. • The method uses the comparison of several filter settings, alignments, and tree inference approaches on Genotype by Sequencing data. • Consistent results were found among several approaches. • The methodology successfully recovered well supported species boundaries and phylogenetic relationships among species of mastiff bats not hypothesized by previous methods.
format Online
Article
Text
id pubmed-7195544
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-71955442020-05-05 Optimization of Genotype by Sequencing data for phylogenetic purposes Loureiro, L.O. Engstrom, M.D. Lim, B.K. MethodsX Biochemistry, Genetics and Molecular Biology • Herein we propose a framework for assembling and analyzing Genotype by Sequencing (GBS) data to better understand evolutionary relationships within a group of closely related species using the mastiff bats (Molossus) as our model system. Many species within this genus have low-levels of genetic variation within and between morphologically distinct species, and the relationships among them remain unresolved using traditional Sanger sequencing methods. Given that both de novo and reference genome pipelines can be used to assemble next generation sequences, and that several tree inference methodologies have been proposed for single nucleotide polymorphism (SNP) data, we test whether different alignments and phylogenetic approaches produce similar results. We also examined how the process of SNP identification and mapping can affect the consistency of the analyses. Different alignments and phylogenetic inferences produced consistent results, supporting the GBS approach for answering evolutionary questions on a macroevolutionary scale when the genetic distance among phenotypically identifiable clades is low. We highlight the importance of exploring the relationships among groups using different assembly assumptions and also distinct phylogenetic inference methods, particularly when addressing phylogenetic questions in genetic and morphologically conservative taxa. • The method uses the comparison of several filter settings, alignments, and tree inference approaches on Genotype by Sequencing data. • Consistent results were found among several approaches. • The methodology successfully recovered well supported species boundaries and phylogenetic relationships among species of mastiff bats not hypothesized by previous methods. Elsevier 2020-04-20 /pmc/articles/PMC7195544/ /pubmed/32373482 http://dx.doi.org/10.1016/j.mex.2020.100892 Text en © 2020 The Author(s). Published by Elsevier B.V. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Biochemistry, Genetics and Molecular Biology
Loureiro, L.O.
Engstrom, M.D.
Lim, B.K.
Optimization of Genotype by Sequencing data for phylogenetic purposes
title Optimization of Genotype by Sequencing data for phylogenetic purposes
title_full Optimization of Genotype by Sequencing data for phylogenetic purposes
title_fullStr Optimization of Genotype by Sequencing data for phylogenetic purposes
title_full_unstemmed Optimization of Genotype by Sequencing data for phylogenetic purposes
title_short Optimization of Genotype by Sequencing data for phylogenetic purposes
title_sort optimization of genotype by sequencing data for phylogenetic purposes
topic Biochemistry, Genetics and Molecular Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7195544/
https://www.ncbi.nlm.nih.gov/pubmed/32373482
http://dx.doi.org/10.1016/j.mex.2020.100892
work_keys_str_mv AT loureirolo optimizationofgenotypebysequencingdataforphylogeneticpurposes
AT engstrommd optimizationofgenotypebysequencingdataforphylogeneticpurposes
AT limbk optimizationofgenotypebysequencingdataforphylogeneticpurposes