Cargando…

Positional Information Resolves Structural Variations and Uncovers an Evolutionarily Divergent Genetic Locus in Accessions of Arabidopsis thaliana

Genome sequencing of closely related individuals has yielded valuable insights that link genome evolution to phenotypic variations. However, advancement in sequencing technology has also led to an escalation in the number of poor quality–drafted genomes assembled based on reference genomes that can...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, Alvina G., Denton-Giles, Matthew, Mueller-Roeber, Bernd, Schippers, Jos H. M., Dijkwel, Paul P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3157834/
https://www.ncbi.nlm.nih.gov/pubmed/21622917
http://dx.doi.org/10.1093/gbe/evr038
_version_ 1782210335090933760
author Lai, Alvina G.
Denton-Giles, Matthew
Mueller-Roeber, Bernd
Schippers, Jos H. M.
Dijkwel, Paul P.
author_facet Lai, Alvina G.
Denton-Giles, Matthew
Mueller-Roeber, Bernd
Schippers, Jos H. M.
Dijkwel, Paul P.
author_sort Lai, Alvina G.
collection PubMed
description Genome sequencing of closely related individuals has yielded valuable insights that link genome evolution to phenotypic variations. However, advancement in sequencing technology has also led to an escalation in the number of poor quality–drafted genomes assembled based on reference genomes that can have highly divergent or haplotypic regions. The self-fertilizing nature of Arabidopsis thaliana poses an advantage to sequencing projects because its genome is mostly homozygous. To determine the accuracy of an Arabidopsis drafted genome in less conserved regions, we performed a resequencing experiment on a ∼371-kb genomic interval in the Landsberg erecta (Ler-0) accession. We identified novel structural variations (SVs) between Ler-0 and the reference accession Col-0 using a long-range polymerase chain reaction approach to generate an Illumina data set that has positional information, that is, a data set with reads that map to a known location. Positional information is important for accurate genome assembly and the resolution of SVs particularly in highly duplicated or repetitive regions. Sixty-one regions with misassembly signatures were identified from the Ler-0 draft, suggesting the presence of novel SVs that are not represented in the draft sequence. Sixty of those were resolved by iterative mapping using our data set. Fifteen large indels (>100 bp) identified from this study were found to be located either within protein-coding regions or upstream regulatory regions, suggesting the formation of novel alleles or altered regulation of existing genes in Ler-0. We propose future genome-sequencing experiments to follow a clone-based approach that incorporates positional information to ultimately reveal haplotype-specific differences between accessions.
format Online
Article
Text
id pubmed-3157834
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31578342011-08-18 Positional Information Resolves Structural Variations and Uncovers an Evolutionarily Divergent Genetic Locus in Accessions of Arabidopsis thaliana Lai, Alvina G. Denton-Giles, Matthew Mueller-Roeber, Bernd Schippers, Jos H. M. Dijkwel, Paul P. Genome Biol Evol Research Articles Genome sequencing of closely related individuals has yielded valuable insights that link genome evolution to phenotypic variations. However, advancement in sequencing technology has also led to an escalation in the number of poor quality–drafted genomes assembled based on reference genomes that can have highly divergent or haplotypic regions. The self-fertilizing nature of Arabidopsis thaliana poses an advantage to sequencing projects because its genome is mostly homozygous. To determine the accuracy of an Arabidopsis drafted genome in less conserved regions, we performed a resequencing experiment on a ∼371-kb genomic interval in the Landsberg erecta (Ler-0) accession. We identified novel structural variations (SVs) between Ler-0 and the reference accession Col-0 using a long-range polymerase chain reaction approach to generate an Illumina data set that has positional information, that is, a data set with reads that map to a known location. Positional information is important for accurate genome assembly and the resolution of SVs particularly in highly duplicated or repetitive regions. Sixty-one regions with misassembly signatures were identified from the Ler-0 draft, suggesting the presence of novel SVs that are not represented in the draft sequence. Sixty of those were resolved by iterative mapping using our data set. Fifteen large indels (>100 bp) identified from this study were found to be located either within protein-coding regions or upstream regulatory regions, suggesting the formation of novel alleles or altered regulation of existing genes in Ler-0. We propose future genome-sequencing experiments to follow a clone-based approach that incorporates positional information to ultimately reveal haplotype-specific differences between accessions. Oxford University Press 2011-05-27 /pmc/articles/PMC3157834/ /pubmed/21622917 http://dx.doi.org/10.1093/gbe/evr038 Text en © The Author(s) 2011. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Lai, Alvina G.
Denton-Giles, Matthew
Mueller-Roeber, Bernd
Schippers, Jos H. M.
Dijkwel, Paul P.
Positional Information Resolves Structural Variations and Uncovers an Evolutionarily Divergent Genetic Locus in Accessions of Arabidopsis thaliana
title Positional Information Resolves Structural Variations and Uncovers an Evolutionarily Divergent Genetic Locus in Accessions of Arabidopsis thaliana
title_full Positional Information Resolves Structural Variations and Uncovers an Evolutionarily Divergent Genetic Locus in Accessions of Arabidopsis thaliana
title_fullStr Positional Information Resolves Structural Variations and Uncovers an Evolutionarily Divergent Genetic Locus in Accessions of Arabidopsis thaliana
title_full_unstemmed Positional Information Resolves Structural Variations and Uncovers an Evolutionarily Divergent Genetic Locus in Accessions of Arabidopsis thaliana
title_short Positional Information Resolves Structural Variations and Uncovers an Evolutionarily Divergent Genetic Locus in Accessions of Arabidopsis thaliana
title_sort positional information resolves structural variations and uncovers an evolutionarily divergent genetic locus in accessions of arabidopsis thaliana
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3157834/
https://www.ncbi.nlm.nih.gov/pubmed/21622917
http://dx.doi.org/10.1093/gbe/evr038
work_keys_str_mv AT laialvinag positionalinformationresolvesstructuralvariationsanduncoversanevolutionarilydivergentgeneticlocusinaccessionsofarabidopsisthaliana
AT dentongilesmatthew positionalinformationresolvesstructuralvariationsanduncoversanevolutionarilydivergentgeneticlocusinaccessionsofarabidopsisthaliana
AT muellerroeberbernd positionalinformationresolvesstructuralvariationsanduncoversanevolutionarilydivergentgeneticlocusinaccessionsofarabidopsisthaliana
AT schippersjoshm positionalinformationresolvesstructuralvariationsanduncoversanevolutionarilydivergentgeneticlocusinaccessionsofarabidopsisthaliana
AT dijkwelpaulp positionalinformationresolvesstructuralvariationsanduncoversanevolutionarilydivergentgeneticlocusinaccessionsofarabidopsisthaliana