Cargando…

Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster

Highly repetitive satellite DNA (satDNA) repeats are found in most eukaryotic genomes. SatDNAs are rapidly evolving and have roles in genome stability and chromosome segregation. Their repetitive nature poses a challenge for genome assembly and makes progress on the detailed study of satDNA structur...

Descripción completa

Detalles Bibliográficos
Autores principales: Khost, Daniel E., Eickbush, Danna G., Larracuente, Amanda M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411766/
https://www.ncbi.nlm.nih.gov/pubmed/28373483
http://dx.doi.org/10.1101/gr.213512.116
_version_ 1783232861539663872
author Khost, Daniel E.
Eickbush, Danna G.
Larracuente, Amanda M.
author_facet Khost, Daniel E.
Eickbush, Danna G.
Larracuente, Amanda M.
author_sort Khost, Daniel E.
collection PubMed
description Highly repetitive satellite DNA (satDNA) repeats are found in most eukaryotic genomes. SatDNAs are rapidly evolving and have roles in genome stability and chromosome segregation. Their repetitive nature poses a challenge for genome assembly and makes progress on the detailed study of satDNA structure difficult. Here, we use single-molecule sequencing long reads from Pacific Biosciences (PacBio) to determine the detailed structure of all major autosomal complex satDNA loci in Drosophila melanogaster, with a particular focus on the 260-bp and Responder satellites. We determine the optimal de novo assembly methods and parameter combinations required to produce a high-quality assembly of these previously unassembled satDNA loci and validate this assembly using molecular and computational approaches. We determined that the computationally intensive PBcR-BLASR assembly pipeline yielded better assemblies than the faster and more efficient pipelines based on the MHAP hashing algorithm, and it is essential to validate assemblies of repetitive loci. The assemblies reveal that satDNA repeats are organized into large arrays interrupted by transposable elements. The repeats in the center of the array tend to be homogenized in sequence, suggesting that gene conversion and unequal crossovers lead to repeat homogenization through concerted evolution, although the degree of unequal crossing over may differ among complex satellite loci. We find evidence for higher-order structure within satDNA arrays that suggest recent structural rearrangements. These assemblies provide a platform for the evolutionary and functional genomics of satDNAs in pericentric heterochromatin.
format Online
Article
Text
id pubmed-5411766
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-54117662017-05-16 Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster Khost, Daniel E. Eickbush, Danna G. Larracuente, Amanda M. Genome Res Research Highly repetitive satellite DNA (satDNA) repeats are found in most eukaryotic genomes. SatDNAs are rapidly evolving and have roles in genome stability and chromosome segregation. Their repetitive nature poses a challenge for genome assembly and makes progress on the detailed study of satDNA structure difficult. Here, we use single-molecule sequencing long reads from Pacific Biosciences (PacBio) to determine the detailed structure of all major autosomal complex satDNA loci in Drosophila melanogaster, with a particular focus on the 260-bp and Responder satellites. We determine the optimal de novo assembly methods and parameter combinations required to produce a high-quality assembly of these previously unassembled satDNA loci and validate this assembly using molecular and computational approaches. We determined that the computationally intensive PBcR-BLASR assembly pipeline yielded better assemblies than the faster and more efficient pipelines based on the MHAP hashing algorithm, and it is essential to validate assemblies of repetitive loci. The assemblies reveal that satDNA repeats are organized into large arrays interrupted by transposable elements. The repeats in the center of the array tend to be homogenized in sequence, suggesting that gene conversion and unequal crossovers lead to repeat homogenization through concerted evolution, although the degree of unequal crossing over may differ among complex satellite loci. We find evidence for higher-order structure within satDNA arrays that suggest recent structural rearrangements. These assemblies provide a platform for the evolutionary and functional genomics of satDNAs in pericentric heterochromatin. Cold Spring Harbor Laboratory Press 2017-05 /pmc/articles/PMC5411766/ /pubmed/28373483 http://dx.doi.org/10.1101/gr.213512.116 Text en © 2017 Khost et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Research
Khost, Daniel E.
Eickbush, Danna G.
Larracuente, Amanda M.
Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster
title Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster
title_full Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster
title_fullStr Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster
title_full_unstemmed Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster
title_short Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster
title_sort single-molecule sequencing resolves the detailed structure of complex satellite dna loci in drosophila melanogaster
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411766/
https://www.ncbi.nlm.nih.gov/pubmed/28373483
http://dx.doi.org/10.1101/gr.213512.116
work_keys_str_mv AT khostdaniele singlemoleculesequencingresolvesthedetailedstructureofcomplexsatellitednalociindrosophilamelanogaster
AT eickbushdannag singlemoleculesequencingresolvesthedetailedstructureofcomplexsatellitednalociindrosophilamelanogaster
AT larracuenteamandam singlemoleculesequencingresolvesthedetailedstructureofcomplexsatellitednalociindrosophilamelanogaster