Cargando…

Haplotype diversity and sequence heterogeneity of human telomeres

Telomeres are regions of repetitive nucleotide sequences capping the ends of eukaryotic chromosomes that protect against deterioration, and whose lengths can be correlated with age and adverse health risk factors. Yet, given their length and repetitive nature, telomeric regions are not easily recons...

Descripción completa

Detalles Bibliográficos
Autores principales: Grigorev, Kirill, Foox, Jonathan, Bezdan, Daniela, Butler, Daniel, Luxton, Jared J., Reed, Jake, McKenna, Miles J., Taylor, Lynn, George, Kerry A., Meydan, Cem, Bailey, Susan M., Mason, Christopher E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8256856/
https://www.ncbi.nlm.nih.gov/pubmed/34162698
http://dx.doi.org/10.1101/gr.274639.120
_version_ 1783718181563531264
author Grigorev, Kirill
Foox, Jonathan
Bezdan, Daniela
Butler, Daniel
Luxton, Jared J.
Reed, Jake
McKenna, Miles J.
Taylor, Lynn
George, Kerry A.
Meydan, Cem
Bailey, Susan M.
Mason, Christopher E.
author_facet Grigorev, Kirill
Foox, Jonathan
Bezdan, Daniela
Butler, Daniel
Luxton, Jared J.
Reed, Jake
McKenna, Miles J.
Taylor, Lynn
George, Kerry A.
Meydan, Cem
Bailey, Susan M.
Mason, Christopher E.
author_sort Grigorev, Kirill
collection PubMed
description Telomeres are regions of repetitive nucleotide sequences capping the ends of eukaryotic chromosomes that protect against deterioration, and whose lengths can be correlated with age and adverse health risk factors. Yet, given their length and repetitive nature, telomeric regions are not easily reconstructed from short-read sequencing, thus making telomere sequencing, mapping, and variant resolution challenging problems. Recently, long-read sequencing, with read lengths measuring in hundreds of kilobase pairs, has made it possible to routinely read into telomeric regions and inspect their sequence structure. Here, we describe a framework for extracting telomeric reads from whole-genome single-molecule sequencing experiments, including de novo identification of telomere repeat motifs and repeat types, and also describe their sequence variation. We find that long, complex telomeric stretches and repeats can be accurately captured with long-read sequencing, observe extensive sequence heterogeneity of human telomeres, discover and localize noncanonical telomere sequence motifs (both previously reported, as well as novel), and validate them in short-read sequence data. These data reveal extensive intra- and inter-population diversity of repeats in telomeric haplotypes, reveal higher paternal inheritance of telomeric variants, and represent the first motif composition maps of multi-kilobase-pair human telomeric haplotypes across three distinct ancestries (Ashkenazi, Chinese, and Utah), which can aid in future studies of genetic variation, aging, and genome biology.
format Online
Article
Text
id pubmed-8256856
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-82568562021-07-23 Haplotype diversity and sequence heterogeneity of human telomeres Grigorev, Kirill Foox, Jonathan Bezdan, Daniela Butler, Daniel Luxton, Jared J. Reed, Jake McKenna, Miles J. Taylor, Lynn George, Kerry A. Meydan, Cem Bailey, Susan M. Mason, Christopher E. Genome Res Method Telomeres are regions of repetitive nucleotide sequences capping the ends of eukaryotic chromosomes that protect against deterioration, and whose lengths can be correlated with age and adverse health risk factors. Yet, given their length and repetitive nature, telomeric regions are not easily reconstructed from short-read sequencing, thus making telomere sequencing, mapping, and variant resolution challenging problems. Recently, long-read sequencing, with read lengths measuring in hundreds of kilobase pairs, has made it possible to routinely read into telomeric regions and inspect their sequence structure. Here, we describe a framework for extracting telomeric reads from whole-genome single-molecule sequencing experiments, including de novo identification of telomere repeat motifs and repeat types, and also describe their sequence variation. We find that long, complex telomeric stretches and repeats can be accurately captured with long-read sequencing, observe extensive sequence heterogeneity of human telomeres, discover and localize noncanonical telomere sequence motifs (both previously reported, as well as novel), and validate them in short-read sequence data. These data reveal extensive intra- and inter-population diversity of repeats in telomeric haplotypes, reveal higher paternal inheritance of telomeric variants, and represent the first motif composition maps of multi-kilobase-pair human telomeric haplotypes across three distinct ancestries (Ashkenazi, Chinese, and Utah), which can aid in future studies of genetic variation, aging, and genome biology. Cold Spring Harbor Laboratory Press 2021-07 /pmc/articles/PMC8256856/ /pubmed/34162698 http://dx.doi.org/10.1101/gr.274639.120 Text en © 2021 Grigorev et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Method
Grigorev, Kirill
Foox, Jonathan
Bezdan, Daniela
Butler, Daniel
Luxton, Jared J.
Reed, Jake
McKenna, Miles J.
Taylor, Lynn
George, Kerry A.
Meydan, Cem
Bailey, Susan M.
Mason, Christopher E.
Haplotype diversity and sequence heterogeneity of human telomeres
title Haplotype diversity and sequence heterogeneity of human telomeres
title_full Haplotype diversity and sequence heterogeneity of human telomeres
title_fullStr Haplotype diversity and sequence heterogeneity of human telomeres
title_full_unstemmed Haplotype diversity and sequence heterogeneity of human telomeres
title_short Haplotype diversity and sequence heterogeneity of human telomeres
title_sort haplotype diversity and sequence heterogeneity of human telomeres
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8256856/
https://www.ncbi.nlm.nih.gov/pubmed/34162698
http://dx.doi.org/10.1101/gr.274639.120
work_keys_str_mv AT grigorevkirill haplotypediversityandsequenceheterogeneityofhumantelomeres
AT fooxjonathan haplotypediversityandsequenceheterogeneityofhumantelomeres
AT bezdandaniela haplotypediversityandsequenceheterogeneityofhumantelomeres
AT butlerdaniel haplotypediversityandsequenceheterogeneityofhumantelomeres
AT luxtonjaredj haplotypediversityandsequenceheterogeneityofhumantelomeres
AT reedjake haplotypediversityandsequenceheterogeneityofhumantelomeres
AT mckennamilesj haplotypediversityandsequenceheterogeneityofhumantelomeres
AT taylorlynn haplotypediversityandsequenceheterogeneityofhumantelomeres
AT georgekerrya haplotypediversityandsequenceheterogeneityofhumantelomeres
AT meydancem haplotypediversityandsequenceheterogeneityofhumantelomeres
AT baileysusanm haplotypediversityandsequenceheterogeneityofhumantelomeres
AT masonchristophere haplotypediversityandsequenceheterogeneityofhumantelomeres