Cargando…

SNP Discovery in European Anchovy (Engraulis encrasicolus, L) by High-Throughput Transcriptome and Genome Sequencing

Increased throughput in sequencing technologies has facilitated the acquisition of detailed genomic information in non-model species. The focus of this research was to discover and validate SNPs derived from the European anchovy (Engraulis encrasicolus) transcriptome, a species with no available ref...

Descripción completa

Detalles Bibliográficos
Autores principales: Montes, Iratxe, Conklin, Darrell, Albaina, Aitor, Creer, Simon, Carvalho, Gary R., Santos, María, Estonba, Andone
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3731364/
https://www.ncbi.nlm.nih.gov/pubmed/23936375
http://dx.doi.org/10.1371/journal.pone.0070051
_version_ 1782279157327069184
author Montes, Iratxe
Conklin, Darrell
Albaina, Aitor
Creer, Simon
Carvalho, Gary R.
Santos, María
Estonba, Andone
author_facet Montes, Iratxe
Conklin, Darrell
Albaina, Aitor
Creer, Simon
Carvalho, Gary R.
Santos, María
Estonba, Andone
author_sort Montes, Iratxe
collection PubMed
description Increased throughput in sequencing technologies has facilitated the acquisition of detailed genomic information in non-model species. The focus of this research was to discover and validate SNPs derived from the European anchovy (Engraulis encrasicolus) transcriptome, a species with no available reference genome, using next generation sequencing technologies. A cDNA library was constructed from four tissues of ten fish individuals corresponding to three populations of E. encrasicolus, and Roche 454 GS FLX Titanium sequencing yielded 19,367 contigs. Additionally, the European anchovy genome was sequenced for the same ten individuals using an Illumina HiSeq2000. Using a computational pipeline for combining transcriptome and genome information, a total of 18,994 SNPs met the necessary minor allele frequency and depth filters. A series of further stringent filters were applied to identify those SNPs likely to succeed in genotyping assays, and for filtering of those in potential duplicated genome regions. A novel method for detecting potential intron-exon boundaries in areas of putative SNPs has also been applied in silico to improve genotyping success. In all, 2,317 filtered putative transcriptome SNPs suitable for genotyping primer design were identified. From those, a subset of 530 were selected, with the genotyping results showing the highest reported conversion and validation rates (91.3% and 83.2%, respectively) reported to date for a non-model species. This study represents a promising strategy to discover genotypable SNPs in the exome of non-model organisms. The genomic resource generated for E. encrasicolus, both in terms of sequences and novel markers, will be informative for research into this species with applications including traceability studies, population genetic analyses and aquaculture.
format Online
Article
Text
id pubmed-3731364
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37313642013-08-09 SNP Discovery in European Anchovy (Engraulis encrasicolus, L) by High-Throughput Transcriptome and Genome Sequencing Montes, Iratxe Conklin, Darrell Albaina, Aitor Creer, Simon Carvalho, Gary R. Santos, María Estonba, Andone PLoS One Research Article Increased throughput in sequencing technologies has facilitated the acquisition of detailed genomic information in non-model species. The focus of this research was to discover and validate SNPs derived from the European anchovy (Engraulis encrasicolus) transcriptome, a species with no available reference genome, using next generation sequencing technologies. A cDNA library was constructed from four tissues of ten fish individuals corresponding to three populations of E. encrasicolus, and Roche 454 GS FLX Titanium sequencing yielded 19,367 contigs. Additionally, the European anchovy genome was sequenced for the same ten individuals using an Illumina HiSeq2000. Using a computational pipeline for combining transcriptome and genome information, a total of 18,994 SNPs met the necessary minor allele frequency and depth filters. A series of further stringent filters were applied to identify those SNPs likely to succeed in genotyping assays, and for filtering of those in potential duplicated genome regions. A novel method for detecting potential intron-exon boundaries in areas of putative SNPs has also been applied in silico to improve genotyping success. In all, 2,317 filtered putative transcriptome SNPs suitable for genotyping primer design were identified. From those, a subset of 530 were selected, with the genotyping results showing the highest reported conversion and validation rates (91.3% and 83.2%, respectively) reported to date for a non-model species. This study represents a promising strategy to discover genotypable SNPs in the exome of non-model organisms. The genomic resource generated for E. encrasicolus, both in terms of sequences and novel markers, will be informative for research into this species with applications including traceability studies, population genetic analyses and aquaculture. Public Library of Science 2013-08-01 /pmc/articles/PMC3731364/ /pubmed/23936375 http://dx.doi.org/10.1371/journal.pone.0070051 Text en © 2013 Montes et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Montes, Iratxe
Conklin, Darrell
Albaina, Aitor
Creer, Simon
Carvalho, Gary R.
Santos, María
Estonba, Andone
SNP Discovery in European Anchovy (Engraulis encrasicolus, L) by High-Throughput Transcriptome and Genome Sequencing
title SNP Discovery in European Anchovy (Engraulis encrasicolus, L) by High-Throughput Transcriptome and Genome Sequencing
title_full SNP Discovery in European Anchovy (Engraulis encrasicolus, L) by High-Throughput Transcriptome and Genome Sequencing
title_fullStr SNP Discovery in European Anchovy (Engraulis encrasicolus, L) by High-Throughput Transcriptome and Genome Sequencing
title_full_unstemmed SNP Discovery in European Anchovy (Engraulis encrasicolus, L) by High-Throughput Transcriptome and Genome Sequencing
title_short SNP Discovery in European Anchovy (Engraulis encrasicolus, L) by High-Throughput Transcriptome and Genome Sequencing
title_sort snp discovery in european anchovy (engraulis encrasicolus, l) by high-throughput transcriptome and genome sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3731364/
https://www.ncbi.nlm.nih.gov/pubmed/23936375
http://dx.doi.org/10.1371/journal.pone.0070051
work_keys_str_mv AT montesiratxe snpdiscoveryineuropeananchovyengraulisencrasicoluslbyhighthroughputtranscriptomeandgenomesequencing
AT conklindarrell snpdiscoveryineuropeananchovyengraulisencrasicoluslbyhighthroughputtranscriptomeandgenomesequencing
AT albainaaitor snpdiscoveryineuropeananchovyengraulisencrasicoluslbyhighthroughputtranscriptomeandgenomesequencing
AT creersimon snpdiscoveryineuropeananchovyengraulisencrasicoluslbyhighthroughputtranscriptomeandgenomesequencing
AT carvalhogaryr snpdiscoveryineuropeananchovyengraulisencrasicoluslbyhighthroughputtranscriptomeandgenomesequencing
AT santosmaria snpdiscoveryineuropeananchovyengraulisencrasicoluslbyhighthroughputtranscriptomeandgenomesequencing
AT estonbaandone snpdiscoveryineuropeananchovyengraulisencrasicoluslbyhighthroughputtranscriptomeandgenomesequencing