Cargando…

Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data

The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%–5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Her...

Descripción completa

Detalles Bibliográficos
Autores principales: Alkan, Can, Ventura, Mario, Archidiacono, Nicoletta, Rocchi, Mariano, Sahinalp, S. Cenk, Eichler, Evan E
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1994983/
https://www.ncbi.nlm.nih.gov/pubmed/17907796
http://dx.doi.org/10.1371/journal.pcbi.0030181
_version_ 1782135506154291200
author Alkan, Can
Ventura, Mario
Archidiacono, Nicoletta
Rocchi, Mariano
Sahinalp, S. Cenk
Eichler, Evan E
author_facet Alkan, Can
Ventura, Mario
Archidiacono, Nicoletta
Rocchi, Mariano
Sahinalp, S. Cenk
Eichler, Evan E
author_sort Alkan, Can
collection PubMed
description The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%–5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution.
format Text
id pubmed-1994983
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-19949832007-09-28 Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data Alkan, Can Ventura, Mario Archidiacono, Nicoletta Rocchi, Mariano Sahinalp, S. Cenk Eichler, Evan E PLoS Comput Biol Research Article The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%–5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution. Public Library of Science 2007-09 2007-09-28 /pmc/articles/PMC1994983/ /pubmed/17907796 http://dx.doi.org/10.1371/journal.pcbi.0030181 Text en © 2007 Alkan et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Alkan, Can
Ventura, Mario
Archidiacono, Nicoletta
Rocchi, Mariano
Sahinalp, S. Cenk
Eichler, Evan E
Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data
title Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data
title_full Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data
title_fullStr Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data
title_full_unstemmed Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data
title_short Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data
title_sort organization and evolution of primate centromeric dna from whole-genome shotgun sequence data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1994983/
https://www.ncbi.nlm.nih.gov/pubmed/17907796
http://dx.doi.org/10.1371/journal.pcbi.0030181
work_keys_str_mv AT alkancan organizationandevolutionofprimatecentromericdnafromwholegenomeshotgunsequencedata
AT venturamario organizationandevolutionofprimatecentromericdnafromwholegenomeshotgunsequencedata
AT archidiacononicoletta organizationandevolutionofprimatecentromericdnafromwholegenomeshotgunsequencedata
AT rocchimariano organizationandevolutionofprimatecentromericdnafromwholegenomeshotgunsequencedata
AT sahinalpscenk organizationandevolutionofprimatecentromericdnafromwholegenomeshotgunsequencedata
AT eichlerevane organizationandevolutionofprimatecentromericdnafromwholegenomeshotgunsequencedata