Cargando…

Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing

In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have...

Descripción completa

Detalles Bibliográficos
Autores principales: Kanda, Kojun, Pflug, James M., Sproul, John S., Dasenko, Mark A., Maddison, David R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4696846/
https://www.ncbi.nlm.nih.gov/pubmed/26716693
http://dx.doi.org/10.1371/journal.pone.0143929
_version_ 1782407845284675584
author Kanda, Kojun
Pflug, James M.
Sproul, John S.
Dasenko, Mark A.
Maddison, David R.
author_facet Kanda, Kojun
Pflug, James M.
Sproul, John S.
Dasenko, Mark A.
Maddison, David R.
author_sort Kanda, Kojun
collection PubMed
description In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced.
format Online
Article
Text
id pubmed-4696846
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46968462016-01-13 Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing Kanda, Kojun Pflug, James M. Sproul, John S. Dasenko, Mark A. Maddison, David R. PLoS One Research Article In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced. Public Library of Science 2015-12-30 /pmc/articles/PMC4696846/ /pubmed/26716693 http://dx.doi.org/10.1371/journal.pone.0143929 Text en © 2015 Kanda et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kanda, Kojun
Pflug, James M.
Sproul, John S.
Dasenko, Mark A.
Maddison, David R.
Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing
title Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing
title_full Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing
title_fullStr Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing
title_full_unstemmed Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing
title_short Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing
title_sort successful recovery of nuclear protein-coding genes from small insects in museums using illumina sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4696846/
https://www.ncbi.nlm.nih.gov/pubmed/26716693
http://dx.doi.org/10.1371/journal.pone.0143929
work_keys_str_mv AT kandakojun successfulrecoveryofnuclearproteincodinggenesfromsmallinsectsinmuseumsusingilluminasequencing
AT pflugjamesm successfulrecoveryofnuclearproteincodinggenesfromsmallinsectsinmuseumsusingilluminasequencing
AT sprouljohns successfulrecoveryofnuclearproteincodinggenesfromsmallinsectsinmuseumsusingilluminasequencing
AT dasenkomarka successfulrecoveryofnuclearproteincodinggenesfromsmallinsectsinmuseumsusingilluminasequencing
AT maddisondavidr successfulrecoveryofnuclearproteincodinggenesfromsmallinsectsinmuseumsusingilluminasequencing