Cargando…

Approaches to variant discovery for conifer transcriptome sequencing

There is a wide diversity of bioinformatic tools available for the assembly of next generation sequence and subsequence variant calling to identify genetic markers at scale. Integration of genomics tools such as genomic selection, association studies, pedigree analysis and analysis of genetic divers...

Descripción completa

Detalles Bibliográficos
Autores principales: Telfer, Emily, Graham, Natalie, Macdonald, Lucy, Sturrock, Shane, Wilcox, Phillip, Stanbra, Lisa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218030/
https://www.ncbi.nlm.nih.gov/pubmed/30395612
http://dx.doi.org/10.1371/journal.pone.0205835
_version_ 1783368384586448896
author Telfer, Emily
Graham, Natalie
Macdonald, Lucy
Sturrock, Shane
Wilcox, Phillip
Stanbra, Lisa
author_facet Telfer, Emily
Graham, Natalie
Macdonald, Lucy
Sturrock, Shane
Wilcox, Phillip
Stanbra, Lisa
author_sort Telfer, Emily
collection PubMed
description There is a wide diversity of bioinformatic tools available for the assembly of next generation sequence and subsequence variant calling to identify genetic markers at scale. Integration of genomics tools such as genomic selection, association studies, pedigree analysis and analysis of genetic diversity, into operational breeding is a goal for New Zealand’s most widely planted exotic tree species, Pinus radiata. In the absence of full reference genomes for large megagenomes such as in conifers, RNA sequencing in a range of genotypes and tissue types, offers a rich source of genetic markers for downstream application. We compared nine different assembler and variant calling software combinations in a single transcriptomic library and found that Single Nucleotide Polymorphism (SNPs) discovery could vary by as much as an order of magnitude (8,061 SNPs up to 86,815 SNPs). The assembler with the best realignment of the packages trialled, Trinity, in combination with several variant callers was then applied to a much larger multi-genotype, multi-tissue transcriptome and identified 683,135 in silico SNPs across a predicted 449,951 exons when mapped to the Pinus taeda ver 1.01e reference.
format Online
Article
Text
id pubmed-6218030
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62180302018-11-19 Approaches to variant discovery for conifer transcriptome sequencing Telfer, Emily Graham, Natalie Macdonald, Lucy Sturrock, Shane Wilcox, Phillip Stanbra, Lisa PLoS One Research Article There is a wide diversity of bioinformatic tools available for the assembly of next generation sequence and subsequence variant calling to identify genetic markers at scale. Integration of genomics tools such as genomic selection, association studies, pedigree analysis and analysis of genetic diversity, into operational breeding is a goal for New Zealand’s most widely planted exotic tree species, Pinus radiata. In the absence of full reference genomes for large megagenomes such as in conifers, RNA sequencing in a range of genotypes and tissue types, offers a rich source of genetic markers for downstream application. We compared nine different assembler and variant calling software combinations in a single transcriptomic library and found that Single Nucleotide Polymorphism (SNPs) discovery could vary by as much as an order of magnitude (8,061 SNPs up to 86,815 SNPs). The assembler with the best realignment of the packages trialled, Trinity, in combination with several variant callers was then applied to a much larger multi-genotype, multi-tissue transcriptome and identified 683,135 in silico SNPs across a predicted 449,951 exons when mapped to the Pinus taeda ver 1.01e reference. Public Library of Science 2018-11-05 /pmc/articles/PMC6218030/ /pubmed/30395612 http://dx.doi.org/10.1371/journal.pone.0205835 Text en © 2018 Telfer et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Telfer, Emily
Graham, Natalie
Macdonald, Lucy
Sturrock, Shane
Wilcox, Phillip
Stanbra, Lisa
Approaches to variant discovery for conifer transcriptome sequencing
title Approaches to variant discovery for conifer transcriptome sequencing
title_full Approaches to variant discovery for conifer transcriptome sequencing
title_fullStr Approaches to variant discovery for conifer transcriptome sequencing
title_full_unstemmed Approaches to variant discovery for conifer transcriptome sequencing
title_short Approaches to variant discovery for conifer transcriptome sequencing
title_sort approaches to variant discovery for conifer transcriptome sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218030/
https://www.ncbi.nlm.nih.gov/pubmed/30395612
http://dx.doi.org/10.1371/journal.pone.0205835
work_keys_str_mv AT telferemily approachestovariantdiscoveryforconifertranscriptomesequencing
AT grahamnatalie approachestovariantdiscoveryforconifertranscriptomesequencing
AT macdonaldlucy approachestovariantdiscoveryforconifertranscriptomesequencing
AT sturrockshane approachestovariantdiscoveryforconifertranscriptomesequencing
AT wilcoxphillip approachestovariantdiscoveryforconifertranscriptomesequencing
AT stanbralisa approachestovariantdiscoveryforconifertranscriptomesequencing