Cargando…

Targeted Capture of Complete Coding Regions across Divergent Species

Despite continued advances in sequencing technologies, there is a need for methods that can efficiently sequence large numbers of genes from diverse species. One approach to accomplish this is targeted capture (hybrid enrichment). While these methods are well established for genome resequencing proj...

Descripción completa

Detalles Bibliográficos
Autores principales: Schott, Ryan K., Panesar, Bhawandeep, Card, Daren C., Preston, Matthew, Castoe, Todd A., Chang, Belinda S.W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5381602/
https://www.ncbi.nlm.nih.gov/pubmed/28137744
http://dx.doi.org/10.1093/gbe/evx005
_version_ 1782519963927445504
author Schott, Ryan K.
Panesar, Bhawandeep
Card, Daren C.
Preston, Matthew
Castoe, Todd A.
Chang, Belinda S.W.
author_facet Schott, Ryan K.
Panesar, Bhawandeep
Card, Daren C.
Preston, Matthew
Castoe, Todd A.
Chang, Belinda S.W.
author_sort Schott, Ryan K.
collection PubMed
description Despite continued advances in sequencing technologies, there is a need for methods that can efficiently sequence large numbers of genes from diverse species. One approach to accomplish this is targeted capture (hybrid enrichment). While these methods are well established for genome resequencing projects, cross-species capture strategies are still being developed and generally focus on the capture of conserved regions, rather than complete coding regions from specific genes of interest. The resulting data is thus useful for phylogenetic studies, but the wealth of comparative data that could be used for evolutionary and functional studies is lost. Here, we design and implement a targeted capture method that enables recovery of complete coding regions across broad taxonomic scales. Capture probes were designed from multiple reference species and extensively tiled in order to facilitate cross-species capture. Using novel bioinformatics pipelines we were able to recover nearly all of the targeted genes with high completeness from species that were up to 200 myr divergent. Increased probe diversity and tiling for a subset of genes had a large positive effect on both recovery and completeness. The resulting data produced an accurate species tree, but importantly this same data can also be applied to studies of molecular evolution and function that will allow researchers to ask larger questions in broader phylogenetic contexts. Our method demonstrates the utility of cross-species approaches for the capture of full length coding sequences, and will substantially improve the ability for researchers to conduct large-scale comparative studies of molecular evolution and function.
format Online
Article
Text
id pubmed-5381602
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-53816022017-04-10 Targeted Capture of Complete Coding Regions across Divergent Species Schott, Ryan K. Panesar, Bhawandeep Card, Daren C. Preston, Matthew Castoe, Todd A. Chang, Belinda S.W. Genome Biol Evol Research Article Despite continued advances in sequencing technologies, there is a need for methods that can efficiently sequence large numbers of genes from diverse species. One approach to accomplish this is targeted capture (hybrid enrichment). While these methods are well established for genome resequencing projects, cross-species capture strategies are still being developed and generally focus on the capture of conserved regions, rather than complete coding regions from specific genes of interest. The resulting data is thus useful for phylogenetic studies, but the wealth of comparative data that could be used for evolutionary and functional studies is lost. Here, we design and implement a targeted capture method that enables recovery of complete coding regions across broad taxonomic scales. Capture probes were designed from multiple reference species and extensively tiled in order to facilitate cross-species capture. Using novel bioinformatics pipelines we were able to recover nearly all of the targeted genes with high completeness from species that were up to 200 myr divergent. Increased probe diversity and tiling for a subset of genes had a large positive effect on both recovery and completeness. The resulting data produced an accurate species tree, but importantly this same data can also be applied to studies of molecular evolution and function that will allow researchers to ask larger questions in broader phylogenetic contexts. Our method demonstrates the utility of cross-species approaches for the capture of full length coding sequences, and will substantially improve the ability for researchers to conduct large-scale comparative studies of molecular evolution and function. Oxford University Press 2017-01-30 /pmc/articles/PMC5381602/ /pubmed/28137744 http://dx.doi.org/10.1093/gbe/evx005 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research Article
Schott, Ryan K.
Panesar, Bhawandeep
Card, Daren C.
Preston, Matthew
Castoe, Todd A.
Chang, Belinda S.W.
Targeted Capture of Complete Coding Regions across Divergent Species
title Targeted Capture of Complete Coding Regions across Divergent Species
title_full Targeted Capture of Complete Coding Regions across Divergent Species
title_fullStr Targeted Capture of Complete Coding Regions across Divergent Species
title_full_unstemmed Targeted Capture of Complete Coding Regions across Divergent Species
title_short Targeted Capture of Complete Coding Regions across Divergent Species
title_sort targeted capture of complete coding regions across divergent species
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5381602/
https://www.ncbi.nlm.nih.gov/pubmed/28137744
http://dx.doi.org/10.1093/gbe/evx005
work_keys_str_mv AT schottryank targetedcaptureofcompletecodingregionsacrossdivergentspecies
AT panesarbhawandeep targetedcaptureofcompletecodingregionsacrossdivergentspecies
AT carddarenc targetedcaptureofcompletecodingregionsacrossdivergentspecies
AT prestonmatthew targetedcaptureofcompletecodingregionsacrossdivergentspecies
AT castoetodda targetedcaptureofcompletecodingregionsacrossdivergentspecies
AT changbelindasw targetedcaptureofcompletecodingregionsacrossdivergentspecies