Cargando…

Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing(1)

• Premise of the study: Penstemon (Plantaginaceae) is a large and diverse genus endemic to North America. However, determining the phylogenetic relationships among its 280 species has been difficult due to its recent evolutionary radiation. The development of a large, multilocus data set can help to...

Descripción completa

Detalles Bibliográficos
Autores principales: Blischak, Paul D., Wenzel, Aaron J., Wolfe, Andrea D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Botanical Society of America 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4259454/
https://www.ncbi.nlm.nih.gov/pubmed/25506519
http://dx.doi.org/10.3732/apps.1400044
_version_ 1782348019852640256
author Blischak, Paul D.
Wenzel, Aaron J.
Wolfe, Andrea D.
author_facet Blischak, Paul D.
Wenzel, Aaron J.
Wolfe, Andrea D.
author_sort Blischak, Paul D.
collection PubMed
description • Premise of the study: Penstemon (Plantaginaceae) is a large and diverse genus endemic to North America. However, determining the phylogenetic relationships among its 280 species has been difficult due to its recent evolutionary radiation. The development of a large, multilocus data set can help to resolve this challenge. • Methods: Using both previously sequenced genomic libraries and our own low-coverage whole-genome shotgun sequencing libraries, we used the MAKER2 Annotation Pipeline to identify gene regions for the development of sequencing loci from six extremely low-coverage Penstemon genomes (∼0.005×–0.007×). We also compared this approach to BLAST searches, and conducted analyses to characterize sequence divergence across the species sequenced. • Results: Annotations and gene predictions were successfully added to more than 10,000 contigs for potential use in downstream primer design. Primers were then designed for chloroplast, mitochondrial, and nuclear loci from these annotated sequences. MAKER2 identified longer gene regions in all six Penstemon genomes when compared with BLASTN and BLASTX searches. The average level of sequence divergence among the six species was 7.14%. • Discussion: Combining bioinformatics tools into a workflow that produces annotations can be useful for creating potential phylogenetic markers from thousands of sequences even when genome coverage is extremely low and reference data are only available from distant relatives. Furthermore, the output from MAKER2 contains information about important gene features, such as exon boundaries, and can be easily integrated with visualization tools to facilitate the process of marker development.
format Online
Article
Text
id pubmed-4259454
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Botanical Society of America
record_format MEDLINE/PubMed
spelling pubmed-42594542014-12-12 Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing(1) Blischak, Paul D. Wenzel, Aaron J. Wolfe, Andrea D. Appl Plant Sci Application Article • Premise of the study: Penstemon (Plantaginaceae) is a large and diverse genus endemic to North America. However, determining the phylogenetic relationships among its 280 species has been difficult due to its recent evolutionary radiation. The development of a large, multilocus data set can help to resolve this challenge. • Methods: Using both previously sequenced genomic libraries and our own low-coverage whole-genome shotgun sequencing libraries, we used the MAKER2 Annotation Pipeline to identify gene regions for the development of sequencing loci from six extremely low-coverage Penstemon genomes (∼0.005×–0.007×). We also compared this approach to BLAST searches, and conducted analyses to characterize sequence divergence across the species sequenced. • Results: Annotations and gene predictions were successfully added to more than 10,000 contigs for potential use in downstream primer design. Primers were then designed for chloroplast, mitochondrial, and nuclear loci from these annotated sequences. MAKER2 identified longer gene regions in all six Penstemon genomes when compared with BLASTN and BLASTX searches. The average level of sequence divergence among the six species was 7.14%. • Discussion: Combining bioinformatics tools into a workflow that produces annotations can be useful for creating potential phylogenetic markers from thousands of sequences even when genome coverage is extremely low and reference data are only available from distant relatives. Furthermore, the output from MAKER2 contains information about important gene features, such as exon boundaries, and can be easily integrated with visualization tools to facilitate the process of marker development. Botanical Society of America 2014-12-04 /pmc/articles/PMC4259454/ /pubmed/25506519 http://dx.doi.org/10.3732/apps.1400044 Text en © 2014 Blischak et al. Published by the Botanical Society of America http://creativecommons.org/licenses/by-nc/4.0/ This work is licensed under a Creative Commons Attribution License (CC-BY-NC-SA).
spellingShingle Application Article
Blischak, Paul D.
Wenzel, Aaron J.
Wolfe, Andrea D.
Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing(1)
title Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing(1)
title_full Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing(1)
title_fullStr Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing(1)
title_full_unstemmed Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing(1)
title_short Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing(1)
title_sort gene prediction and annotation in penstemon (plantaginaceae): a workflow for marker development from extremely low-coverage genome sequencing(1)
topic Application Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4259454/
https://www.ncbi.nlm.nih.gov/pubmed/25506519
http://dx.doi.org/10.3732/apps.1400044
work_keys_str_mv AT blischakpauld genepredictionandannotationinpenstemonplantaginaceaeaworkflowformarkerdevelopmentfromextremelylowcoveragegenomesequencing1
AT wenzelaaronj genepredictionandannotationinpenstemonplantaginaceaeaworkflowformarkerdevelopmentfromextremelylowcoveragegenomesequencing1
AT wolfeandread genepredictionandannotationinpenstemonplantaginaceaeaworkflowformarkerdevelopmentfromextremelylowcoveragegenomesequencing1