Cargando…

Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing

BACKGROUND: The availability of low cost sequencing has spurred its application to discovery and typing of variation, including variation induced by mutagenesis. Mutation discovery is challenging as it requires a substantial amount of sequencing and analysis to detect very rare changes and distingui...

Descripción completa

Detalles Bibliográficos
Autores principales: Monson-Miller, Jennifer, Sanchez-Mendez, Diana C, Fass, Joseph, Henry, Isabelle M, Tai, Thomas H, Comai, Luca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3305632/
https://www.ncbi.nlm.nih.gov/pubmed/22333298
http://dx.doi.org/10.1186/1471-2164-13-72
_version_ 1782227112880504832
author Monson-Miller, Jennifer
Sanchez-Mendez, Diana C
Fass, Joseph
Henry, Isabelle M
Tai, Thomas H
Comai, Luca
author_facet Monson-Miller, Jennifer
Sanchez-Mendez, Diana C
Fass, Joseph
Henry, Isabelle M
Tai, Thomas H
Comai, Luca
author_sort Monson-Miller, Jennifer
collection PubMed
description BACKGROUND: The availability of low cost sequencing has spurred its application to discovery and typing of variation, including variation induced by mutagenesis. Mutation discovery is challenging as it requires a substantial amount of sequencing and analysis to detect very rare changes and distinguish them from noise. Also challenging are the cases when the organism of interest has not been sequenced or is highly divergent from the reference. RESULTS: We describe the development of a simple method for reduced representation sequencing. Input DNA was digested with a single restriction enzyme and ligated to Y adapters modified to contain a sequence barcode and to provide a compatible overhang for ligation. We demonstrated the efficiency of this method at SNP discovery using rice and arabidopsis. To test its suitability for the discovery of very rare SNP, one control and three mutagenized rice individuals (1, 5 and 10 mM sodium azide) were used to prepare genomic libraries for Illumina sequencers by ligating barcoded adapters to NlaIII restriction sites. For genome-dependent discovery 15-30 million of 80 base reads per individual were aligned to the reference sequence achieving individual sequencing coverage from 7 to 15×. We identified high-confidence base changes by comparing sequences across individuals and identified instances consistent with mutations, i.e. changes that were found in a single treated individual and were solely GC to AT transitions. For genome-independent discovery 70-mers were extracted from the sequence of the control individual and single-copy sequence was identified by comparing the 70-mers across samples to evaluate copy number and variation. This de novo "genome" was used to align the reads and identify mutations as above. Covering approximately 1/5 of the 380 Mb genome of rice we detected mutation densities ranging from 0.6 to 4 per Mb of diploid DNA depending on the mutagenic treatment. CONCLUSIONS: The combination of a simple and cost-effective library construction method, with Illumina sequencing, and the use of a bioinformatic pipeline allows practical SNP discovery regardless of whether a genomic reference is available.
format Online
Article
Text
id pubmed-3305632
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33056322012-03-16 Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing Monson-Miller, Jennifer Sanchez-Mendez, Diana C Fass, Joseph Henry, Isabelle M Tai, Thomas H Comai, Luca BMC Genomics Methodology Article BACKGROUND: The availability of low cost sequencing has spurred its application to discovery and typing of variation, including variation induced by mutagenesis. Mutation discovery is challenging as it requires a substantial amount of sequencing and analysis to detect very rare changes and distinguish them from noise. Also challenging are the cases when the organism of interest has not been sequenced or is highly divergent from the reference. RESULTS: We describe the development of a simple method for reduced representation sequencing. Input DNA was digested with a single restriction enzyme and ligated to Y adapters modified to contain a sequence barcode and to provide a compatible overhang for ligation. We demonstrated the efficiency of this method at SNP discovery using rice and arabidopsis. To test its suitability for the discovery of very rare SNP, one control and three mutagenized rice individuals (1, 5 and 10 mM sodium azide) were used to prepare genomic libraries for Illumina sequencers by ligating barcoded adapters to NlaIII restriction sites. For genome-dependent discovery 15-30 million of 80 base reads per individual were aligned to the reference sequence achieving individual sequencing coverage from 7 to 15×. We identified high-confidence base changes by comparing sequences across individuals and identified instances consistent with mutations, i.e. changes that were found in a single treated individual and were solely GC to AT transitions. For genome-independent discovery 70-mers were extracted from the sequence of the control individual and single-copy sequence was identified by comparing the 70-mers across samples to evaluate copy number and variation. This de novo "genome" was used to align the reads and identify mutations as above. Covering approximately 1/5 of the 380 Mb genome of rice we detected mutation densities ranging from 0.6 to 4 per Mb of diploid DNA depending on the mutagenic treatment. CONCLUSIONS: The combination of a simple and cost-effective library construction method, with Illumina sequencing, and the use of a bioinformatic pipeline allows practical SNP discovery regardless of whether a genomic reference is available. BioMed Central 2012-02-14 /pmc/articles/PMC3305632/ /pubmed/22333298 http://dx.doi.org/10.1186/1471-2164-13-72 Text en Copyright ©2012 Monson-Miller et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Monson-Miller, Jennifer
Sanchez-Mendez, Diana C
Fass, Joseph
Henry, Isabelle M
Tai, Thomas H
Comai, Luca
Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing
title Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing
title_full Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing
title_fullStr Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing
title_full_unstemmed Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing
title_short Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing
title_sort reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3305632/
https://www.ncbi.nlm.nih.gov/pubmed/22333298
http://dx.doi.org/10.1186/1471-2164-13-72
work_keys_str_mv AT monsonmillerjennifer referencegenomeindependentassessmentofmutationdensityusingrestrictionenzymephasedsequencing
AT sanchezmendezdianac referencegenomeindependentassessmentofmutationdensityusingrestrictionenzymephasedsequencing
AT fassjoseph referencegenomeindependentassessmentofmutationdensityusingrestrictionenzymephasedsequencing
AT henryisabellem referencegenomeindependentassessmentofmutationdensityusingrestrictionenzymephasedsequencing
AT taithomash referencegenomeindependentassessmentofmutationdensityusingrestrictionenzymephasedsequencing
AT comailuca referencegenomeindependentassessmentofmutationdensityusingrestrictionenzymephasedsequencing