Cargando…

Large-scale analysis of tandem repeat variability in the human genome

Tandem repeats are short DNA sequences that are repeated head-to-tail with a propensity to be variable. They constitute a significant proportion of the human genome, also occurring within coding and regulatory regions. Variation in these repeats can alter the function and/or expression of genes allo...

Descripción completa

Detalles Bibliográficos
Autores principales: Duitama, Jorge, Zablotskaya, Alena, Gemayel, Rita, Jansen, An, Belet, Stefanie, Vermeesch, Joris R., Verstrepen, Kevin J., Froyen, Guy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4027155/
https://www.ncbi.nlm.nih.gov/pubmed/24682812
http://dx.doi.org/10.1093/nar/gku212
_version_ 1782316955136425984
author Duitama, Jorge
Zablotskaya, Alena
Gemayel, Rita
Jansen, An
Belet, Stefanie
Vermeesch, Joris R.
Verstrepen, Kevin J.
Froyen, Guy
author_facet Duitama, Jorge
Zablotskaya, Alena
Gemayel, Rita
Jansen, An
Belet, Stefanie
Vermeesch, Joris R.
Verstrepen, Kevin J.
Froyen, Guy
author_sort Duitama, Jorge
collection PubMed
description Tandem repeats are short DNA sequences that are repeated head-to-tail with a propensity to be variable. They constitute a significant proportion of the human genome, also occurring within coding and regulatory regions. Variation in these repeats can alter the function and/or expression of genes allowing organisms to swiftly adapt to novel environments. Importantly, some repeat expansions have also been linked to certain neurodegenerative diseases. Therefore, accurate sequencing of tandem repeats could contribute to our understanding of common phenotypic variability and might uncover missing genetic factors in idiopathic clinical conditions. However, despite long-standing evidence for the functional role of repeats, they are largely ignored because of technical limitations in sequencing, mapping and typing. Here, we report on a novel capture technique and data filtering protocol that allowed simultaneous sequencing of thousands of tandem repeats in the human genomes of a three generation family using GS-FLX-plus Titanium technology. Our results demonstrated that up to 7.6% of tandem repeats in this family (4% in coding sequences) differ from the reference sequence, and identified a de novo variation in the family tree. The method opens new routes to look at this underappreciated type of genetic variability, including the identification of novel disease-related repeats.
format Online
Article
Text
id pubmed-4027155
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40271552014-05-28 Large-scale analysis of tandem repeat variability in the human genome Duitama, Jorge Zablotskaya, Alena Gemayel, Rita Jansen, An Belet, Stefanie Vermeesch, Joris R. Verstrepen, Kevin J. Froyen, Guy Nucleic Acids Res Genomics Tandem repeats are short DNA sequences that are repeated head-to-tail with a propensity to be variable. They constitute a significant proportion of the human genome, also occurring within coding and regulatory regions. Variation in these repeats can alter the function and/or expression of genes allowing organisms to swiftly adapt to novel environments. Importantly, some repeat expansions have also been linked to certain neurodegenerative diseases. Therefore, accurate sequencing of tandem repeats could contribute to our understanding of common phenotypic variability and might uncover missing genetic factors in idiopathic clinical conditions. However, despite long-standing evidence for the functional role of repeats, they are largely ignored because of technical limitations in sequencing, mapping and typing. Here, we report on a novel capture technique and data filtering protocol that allowed simultaneous sequencing of thousands of tandem repeats in the human genomes of a three generation family using GS-FLX-plus Titanium technology. Our results demonstrated that up to 7.6% of tandem repeats in this family (4% in coding sequences) differ from the reference sequence, and identified a de novo variation in the family tree. The method opens new routes to look at this underappreciated type of genetic variability, including the identification of novel disease-related repeats. Oxford University Press 2014-05-01 2014-03-20 /pmc/articles/PMC4027155/ /pubmed/24682812 http://dx.doi.org/10.1093/nar/gku212 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Genomics
Duitama, Jorge
Zablotskaya, Alena
Gemayel, Rita
Jansen, An
Belet, Stefanie
Vermeesch, Joris R.
Verstrepen, Kevin J.
Froyen, Guy
Large-scale analysis of tandem repeat variability in the human genome
title Large-scale analysis of tandem repeat variability in the human genome
title_full Large-scale analysis of tandem repeat variability in the human genome
title_fullStr Large-scale analysis of tandem repeat variability in the human genome
title_full_unstemmed Large-scale analysis of tandem repeat variability in the human genome
title_short Large-scale analysis of tandem repeat variability in the human genome
title_sort large-scale analysis of tandem repeat variability in the human genome
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4027155/
https://www.ncbi.nlm.nih.gov/pubmed/24682812
http://dx.doi.org/10.1093/nar/gku212
work_keys_str_mv AT duitamajorge largescaleanalysisoftandemrepeatvariabilityinthehumangenome
AT zablotskayaalena largescaleanalysisoftandemrepeatvariabilityinthehumangenome
AT gemayelrita largescaleanalysisoftandemrepeatvariabilityinthehumangenome
AT jansenan largescaleanalysisoftandemrepeatvariabilityinthehumangenome
AT beletstefanie largescaleanalysisoftandemrepeatvariabilityinthehumangenome
AT vermeeschjorisr largescaleanalysisoftandemrepeatvariabilityinthehumangenome
AT verstrepenkevinj largescaleanalysisoftandemrepeatvariabilityinthehumangenome
AT froyenguy largescaleanalysisoftandemrepeatvariabilityinthehumangenome