Cargando…

Reference-Free Population Genomics from Next-Generation Transcriptome Data and the Vertebrate–Invertebrate Gap

In animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closel...

Descripción completa

Detalles Bibliográficos
Autores principales: Gayral, Philippe, Melo-Ferreira, José, Glémin, Sylvain, Bierne, Nicolas, Carneiro, Miguel, Nabholz, Benoit, Lourenco, Joao M., Alves, Paulo C., Ballenghien, Marion, Faivre, Nicolas, Belkhir, Khalid, Cahais, Vincent, Loire, Etienne, Bernard, Aurélien, Galtier, Nicolas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3623758/
https://www.ncbi.nlm.nih.gov/pubmed/23593039
http://dx.doi.org/10.1371/journal.pgen.1003457
_version_ 1782265962015227904
author Gayral, Philippe
Melo-Ferreira, José
Glémin, Sylvain
Bierne, Nicolas
Carneiro, Miguel
Nabholz, Benoit
Lourenco, Joao M.
Alves, Paulo C.
Ballenghien, Marion
Faivre, Nicolas
Belkhir, Khalid
Cahais, Vincent
Loire, Etienne
Bernard, Aurélien
Galtier, Nicolas
author_facet Gayral, Philippe
Melo-Ferreira, José
Glémin, Sylvain
Bierne, Nicolas
Carneiro, Miguel
Nabholz, Benoit
Lourenco, Joao M.
Alves, Paulo C.
Ballenghien, Marion
Faivre, Nicolas
Belkhir, Khalid
Cahais, Vincent
Loire, Etienne
Bernard, Aurélien
Galtier, Nicolas
author_sort Gayral, Philippe
collection PubMed
description In animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closely related reference genome. Here we achieve de novo, reference-free population genomic analysis from wild samples in five non-model animal species, based on next-generation sequencing transcriptome data. We introduce a pipe-line for cDNA assembly, read mapping, SNP/genotype calling, and data cleaning, with specific focus on the issue of hidden paralogy detection. In two species for which a reference genome is available, similar results were obtained whether the reference was used or not, demonstrating the robustness of our de novo inferences. The population genomic profile of a hare, a turtle, an oyster, a tunicate, and a termite were found to be intermediate between those of human and Drosophila, indicating that the discordant genomic diversity patterns that have been reported between these two species do not reflect a generalized vertebrate versus invertebrate gap. The genomic average diversity was generally higher in invertebrates than in vertebrates (with the notable exception of termite), in agreement with the notion that population size tends to be larger in the former than in the latter. The non-synonymous to synonymous ratio, however, did not differ significantly between vertebrates and invertebrates, even though it was negatively correlated with genetic diversity within each of the two groups. This study opens promising perspective regarding genome-wide population analyses of non-model organisms and the influence of population size on non-synonymous versus synonymous diversity.
format Online
Article
Text
id pubmed-3623758
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36237582013-04-16 Reference-Free Population Genomics from Next-Generation Transcriptome Data and the Vertebrate–Invertebrate Gap Gayral, Philippe Melo-Ferreira, José Glémin, Sylvain Bierne, Nicolas Carneiro, Miguel Nabholz, Benoit Lourenco, Joao M. Alves, Paulo C. Ballenghien, Marion Faivre, Nicolas Belkhir, Khalid Cahais, Vincent Loire, Etienne Bernard, Aurélien Galtier, Nicolas PLoS Genet Research Article In animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closely related reference genome. Here we achieve de novo, reference-free population genomic analysis from wild samples in five non-model animal species, based on next-generation sequencing transcriptome data. We introduce a pipe-line for cDNA assembly, read mapping, SNP/genotype calling, and data cleaning, with specific focus on the issue of hidden paralogy detection. In two species for which a reference genome is available, similar results were obtained whether the reference was used or not, demonstrating the robustness of our de novo inferences. The population genomic profile of a hare, a turtle, an oyster, a tunicate, and a termite were found to be intermediate between those of human and Drosophila, indicating that the discordant genomic diversity patterns that have been reported between these two species do not reflect a generalized vertebrate versus invertebrate gap. The genomic average diversity was generally higher in invertebrates than in vertebrates (with the notable exception of termite), in agreement with the notion that population size tends to be larger in the former than in the latter. The non-synonymous to synonymous ratio, however, did not differ significantly between vertebrates and invertebrates, even though it was negatively correlated with genetic diversity within each of the two groups. This study opens promising perspective regarding genome-wide population analyses of non-model organisms and the influence of population size on non-synonymous versus synonymous diversity. Public Library of Science 2013-04-11 /pmc/articles/PMC3623758/ /pubmed/23593039 http://dx.doi.org/10.1371/journal.pgen.1003457 Text en © 2013 Gayral et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Gayral, Philippe
Melo-Ferreira, José
Glémin, Sylvain
Bierne, Nicolas
Carneiro, Miguel
Nabholz, Benoit
Lourenco, Joao M.
Alves, Paulo C.
Ballenghien, Marion
Faivre, Nicolas
Belkhir, Khalid
Cahais, Vincent
Loire, Etienne
Bernard, Aurélien
Galtier, Nicolas
Reference-Free Population Genomics from Next-Generation Transcriptome Data and the Vertebrate–Invertebrate Gap
title Reference-Free Population Genomics from Next-Generation Transcriptome Data and the Vertebrate–Invertebrate Gap
title_full Reference-Free Population Genomics from Next-Generation Transcriptome Data and the Vertebrate–Invertebrate Gap
title_fullStr Reference-Free Population Genomics from Next-Generation Transcriptome Data and the Vertebrate–Invertebrate Gap
title_full_unstemmed Reference-Free Population Genomics from Next-Generation Transcriptome Data and the Vertebrate–Invertebrate Gap
title_short Reference-Free Population Genomics from Next-Generation Transcriptome Data and the Vertebrate–Invertebrate Gap
title_sort reference-free population genomics from next-generation transcriptome data and the vertebrate–invertebrate gap
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3623758/
https://www.ncbi.nlm.nih.gov/pubmed/23593039
http://dx.doi.org/10.1371/journal.pgen.1003457
work_keys_str_mv AT gayralphilippe referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT meloferreirajose referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT gleminsylvain referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT biernenicolas referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT carneiromiguel referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT nabholzbenoit referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT lourencojoaom referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT alvespauloc referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT ballenghienmarion referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT faivrenicolas referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT belkhirkhalid referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT cahaisvincent referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT loireetienne referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT bernardaurelien referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT galtiernicolas referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap