Cargando…

High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies

Satellite repeats are a structural component of centromeres and telomeres, and in some instances, their divergence is known to drive speciation. Due to their highly repetitive nature, satellite sequences have been understudied and underrepresented in genome assemblies. To investigate their turnover...

Descripción completa

Detalles Bibliográficos
Autores principales: Cechova, Monika, Harris, Robert S, Tomaszkiewicz, Marta, Arbeithuber, Barbara, Chiaromonte, Francesca, Makova, Kateryna D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805231/
https://www.ncbi.nlm.nih.gov/pubmed/31273383
http://dx.doi.org/10.1093/molbev/msz156
_version_ 1783461334002696192
author Cechova, Monika
Harris, Robert S
Tomaszkiewicz, Marta
Arbeithuber, Barbara
Chiaromonte, Francesca
Makova, Kateryna D
author_facet Cechova, Monika
Harris, Robert S
Tomaszkiewicz, Marta
Arbeithuber, Barbara
Chiaromonte, Francesca
Makova, Kateryna D
author_sort Cechova, Monika
collection PubMed
description Satellite repeats are a structural component of centromeres and telomeres, and in some instances, their divergence is known to drive speciation. Due to their highly repetitive nature, satellite sequences have been understudied and underrepresented in genome assemblies. To investigate their turnover in great apes, we studied satellite repeats of unit sizes up to 50 bp in human, chimpanzee, bonobo, gorilla, and Sumatran and Bornean orangutans, using unassembled short and long sequencing reads. The density of satellite repeats, as identified from accurate short reads (Illumina), varied greatly among great ape genomes. These were dominated by a handful of abundant repeated motifs, frequently shared among species, which formed two groups: 1) the (AATGG)(n) repeat (critical for heat shock response) and its derivatives; and 2) subtelomeric 32-mers involved in telomeric metabolism. Using the densities of abundant repeats, individuals could be classified into species. However, clustering did not reproduce the accepted species phylogeny, suggesting rapid repeat evolution. Several abundant repeats were enriched in males versus females; using Y chromosome assemblies or Fluorescent In Situ Hybridization, we validated their location on the Y. Finally, applying a novel computational tool, we identified many satellite repeats completely embedded within long Oxford Nanopore and Pacific Biosciences reads. Such repeats were up to 59 kb in length and consisted of perfect repeats interspersed with other similar sequences. Our results based on sequencing reads generated with three different technologies provide the first detailed characterization of great ape satellite repeats, and open new avenues for exploring their functions.
format Online
Article
Text
id pubmed-6805231
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68052312019-10-25 High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies Cechova, Monika Harris, Robert S Tomaszkiewicz, Marta Arbeithuber, Barbara Chiaromonte, Francesca Makova, Kateryna D Mol Biol Evol Discoveries Satellite repeats are a structural component of centromeres and telomeres, and in some instances, their divergence is known to drive speciation. Due to their highly repetitive nature, satellite sequences have been understudied and underrepresented in genome assemblies. To investigate their turnover in great apes, we studied satellite repeats of unit sizes up to 50 bp in human, chimpanzee, bonobo, gorilla, and Sumatran and Bornean orangutans, using unassembled short and long sequencing reads. The density of satellite repeats, as identified from accurate short reads (Illumina), varied greatly among great ape genomes. These were dominated by a handful of abundant repeated motifs, frequently shared among species, which formed two groups: 1) the (AATGG)(n) repeat (critical for heat shock response) and its derivatives; and 2) subtelomeric 32-mers involved in telomeric metabolism. Using the densities of abundant repeats, individuals could be classified into species. However, clustering did not reproduce the accepted species phylogeny, suggesting rapid repeat evolution. Several abundant repeats were enriched in males versus females; using Y chromosome assemblies or Fluorescent In Situ Hybridization, we validated their location on the Y. Finally, applying a novel computational tool, we identified many satellite repeats completely embedded within long Oxford Nanopore and Pacific Biosciences reads. Such repeats were up to 59 kb in length and consisted of perfect repeats interspersed with other similar sequences. Our results based on sequencing reads generated with three different technologies provide the first detailed characterization of great ape satellite repeats, and open new avenues for exploring their functions. Oxford University Press 2019-11 2019-07-02 /pmc/articles/PMC6805231/ /pubmed/31273383 http://dx.doi.org/10.1093/molbev/msz156 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Discoveries
Cechova, Monika
Harris, Robert S
Tomaszkiewicz, Marta
Arbeithuber, Barbara
Chiaromonte, Francesca
Makova, Kateryna D
High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies
title High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies
title_full High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies
title_fullStr High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies
title_full_unstemmed High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies
title_short High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies
title_sort high satellite repeat turnover in great apes studied with short- and long-read technologies
topic Discoveries
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805231/
https://www.ncbi.nlm.nih.gov/pubmed/31273383
http://dx.doi.org/10.1093/molbev/msz156
work_keys_str_mv AT cechovamonika highsatelliterepeatturnoveringreatapesstudiedwithshortandlongreadtechnologies
AT harrisroberts highsatelliterepeatturnoveringreatapesstudiedwithshortandlongreadtechnologies
AT tomaszkiewiczmarta highsatelliterepeatturnoveringreatapesstudiedwithshortandlongreadtechnologies
AT arbeithuberbarbara highsatelliterepeatturnoveringreatapesstudiedwithshortandlongreadtechnologies
AT chiaromontefrancesca highsatelliterepeatturnoveringreatapesstudiedwithshortandlongreadtechnologies
AT makovakaterynad highsatelliterepeatturnoveringreatapesstudiedwithshortandlongreadtechnologies