Cargando…

Targeting legume loci: A comparison of three methods for target enrichment bait design in Leguminosae phylogenomics

PREMISE OF THE STUDY: The development of pipelines for locus discovery has spurred the use of target enrichment for plant phylogenomics. However, few studies have compared pipelines from locus discovery and bait design, through validation, to tree inference. We compared three methods within Legumino...

Descripción completa

Detalles Bibliográficos
Autores principales: Vatanparast, Mohammad, Powell, Adrian, Doyle, Jeff J., Egan, Ashley N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5895186/
https://www.ncbi.nlm.nih.gov/pubmed/29732266
http://dx.doi.org/10.1002/aps3.1036
Descripción
Sumario:PREMISE OF THE STUDY: The development of pipelines for locus discovery has spurred the use of target enrichment for plant phylogenomics. However, few studies have compared pipelines from locus discovery and bait design, through validation, to tree inference. We compared three methods within Leguminosae (Fabaceae) and present a workflow for future efforts. METHODS: Using 30 transcriptomes, we compared Hyb‐Seq, MarkerMiner, and the Yang and Smith (Y&S) pipelines for locus discovery, validated 7501 baits targeting 507 loci across 25 genera via Illumina sequencing, and inferred gene and species trees via concatenation‐ and coalescent‐based methods. RESULTS: Hyb‐Seq discovered loci with the longest mean length. MarkerMiner discovered the most conserved loci with the least flagged as paralogous. Y&S offered the most parsimony‐informative sites and putative orthologs. Target recovery averaged 93% across taxa. We optimized our targeted locus set based on a workflow designed to minimize paralog/ortholog conflation and thus present 423 loci for legume phylogenomics. CONCLUSIONS: Methods differed across criteria important for phylogenetic marker development. We recommend Hyb‐Seq as a method that may be useful for most phylogenomic projects. Our targeted locus set is a resource for future, community‐driven efforts to reconstruct the legume tree of life.