Cargando…
Optimization of Genotype by Sequencing data for phylogenetic purposes
• Herein we propose a framework for assembling and analyzing Genotype by Sequencing (GBS) data to better understand evolutionary relationships within a group of closely related species using the mastiff bats (Molossus) as our model system. Many species within this genus have low-levels of genetic va...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7195544/ https://www.ncbi.nlm.nih.gov/pubmed/32373482 http://dx.doi.org/10.1016/j.mex.2020.100892 |
_version_ | 1783528557861928960 |
---|---|
author | Loureiro, L.O. Engstrom, M.D. Lim, B.K. |
author_facet | Loureiro, L.O. Engstrom, M.D. Lim, B.K. |
author_sort | Loureiro, L.O. |
collection | PubMed |
description | • Herein we propose a framework for assembling and analyzing Genotype by Sequencing (GBS) data to better understand evolutionary relationships within a group of closely related species using the mastiff bats (Molossus) as our model system. Many species within this genus have low-levels of genetic variation within and between morphologically distinct species, and the relationships among them remain unresolved using traditional Sanger sequencing methods. Given that both de novo and reference genome pipelines can be used to assemble next generation sequences, and that several tree inference methodologies have been proposed for single nucleotide polymorphism (SNP) data, we test whether different alignments and phylogenetic approaches produce similar results. We also examined how the process of SNP identification and mapping can affect the consistency of the analyses. Different alignments and phylogenetic inferences produced consistent results, supporting the GBS approach for answering evolutionary questions on a macroevolutionary scale when the genetic distance among phenotypically identifiable clades is low. We highlight the importance of exploring the relationships among groups using different assembly assumptions and also distinct phylogenetic inference methods, particularly when addressing phylogenetic questions in genetic and morphologically conservative taxa. • The method uses the comparison of several filter settings, alignments, and tree inference approaches on Genotype by Sequencing data. • Consistent results were found among several approaches. • The methodology successfully recovered well supported species boundaries and phylogenetic relationships among species of mastiff bats not hypothesized by previous methods. |
format | Online Article Text |
id | pubmed-7195544 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-71955442020-05-05 Optimization of Genotype by Sequencing data for phylogenetic purposes Loureiro, L.O. Engstrom, M.D. Lim, B.K. MethodsX Biochemistry, Genetics and Molecular Biology • Herein we propose a framework for assembling and analyzing Genotype by Sequencing (GBS) data to better understand evolutionary relationships within a group of closely related species using the mastiff bats (Molossus) as our model system. Many species within this genus have low-levels of genetic variation within and between morphologically distinct species, and the relationships among them remain unresolved using traditional Sanger sequencing methods. Given that both de novo and reference genome pipelines can be used to assemble next generation sequences, and that several tree inference methodologies have been proposed for single nucleotide polymorphism (SNP) data, we test whether different alignments and phylogenetic approaches produce similar results. We also examined how the process of SNP identification and mapping can affect the consistency of the analyses. Different alignments and phylogenetic inferences produced consistent results, supporting the GBS approach for answering evolutionary questions on a macroevolutionary scale when the genetic distance among phenotypically identifiable clades is low. We highlight the importance of exploring the relationships among groups using different assembly assumptions and also distinct phylogenetic inference methods, particularly when addressing phylogenetic questions in genetic and morphologically conservative taxa. • The method uses the comparison of several filter settings, alignments, and tree inference approaches on Genotype by Sequencing data. • Consistent results were found among several approaches. • The methodology successfully recovered well supported species boundaries and phylogenetic relationships among species of mastiff bats not hypothesized by previous methods. Elsevier 2020-04-20 /pmc/articles/PMC7195544/ /pubmed/32373482 http://dx.doi.org/10.1016/j.mex.2020.100892 Text en © 2020 The Author(s). Published by Elsevier B.V. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Biochemistry, Genetics and Molecular Biology Loureiro, L.O. Engstrom, M.D. Lim, B.K. Optimization of Genotype by Sequencing data for phylogenetic purposes |
title | Optimization of Genotype by Sequencing data for phylogenetic purposes |
title_full | Optimization of Genotype by Sequencing data for phylogenetic purposes |
title_fullStr | Optimization of Genotype by Sequencing data for phylogenetic purposes |
title_full_unstemmed | Optimization of Genotype by Sequencing data for phylogenetic purposes |
title_short | Optimization of Genotype by Sequencing data for phylogenetic purposes |
title_sort | optimization of genotype by sequencing data for phylogenetic purposes |
topic | Biochemistry, Genetics and Molecular Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7195544/ https://www.ncbi.nlm.nih.gov/pubmed/32373482 http://dx.doi.org/10.1016/j.mex.2020.100892 |
work_keys_str_mv | AT loureirolo optimizationofgenotypebysequencingdataforphylogeneticpurposes AT engstrommd optimizationofgenotypebysequencingdataforphylogeneticpurposes AT limbk optimizationofgenotypebysequencingdataforphylogeneticpurposes |