Cargando…

Digital Genotyping of Macrosatellites and Multicopy Genes Reveals Novel Biological Functions Associated with Copy Number Variation of Large Tandem Repeats

Tandem repeats are common in eukaryotic genomes, but due to difficulties in assaying them remain poorly studied. Here, we demonstrate the utility of Nanostring technology as a targeted approach to perform accurate measurement of tandem repeats even at extremely high copy number, and apply this techn...

Descripción completa

Detalles Bibliográficos
Autores principales: Brahmachary, Manisha, Guilmatre, Audrey, Quilez, Javier, Hasson, Dan, Borel, Christelle, Warburton, Peter, Sharp, Andrew J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4063668/
https://www.ncbi.nlm.nih.gov/pubmed/24945355
http://dx.doi.org/10.1371/journal.pgen.1004418
_version_ 1782321833809281024
author Brahmachary, Manisha
Guilmatre, Audrey
Quilez, Javier
Hasson, Dan
Borel, Christelle
Warburton, Peter
Sharp, Andrew J.
author_facet Brahmachary, Manisha
Guilmatre, Audrey
Quilez, Javier
Hasson, Dan
Borel, Christelle
Warburton, Peter
Sharp, Andrew J.
author_sort Brahmachary, Manisha
collection PubMed
description Tandem repeats are common in eukaryotic genomes, but due to difficulties in assaying them remain poorly studied. Here, we demonstrate the utility of Nanostring technology as a targeted approach to perform accurate measurement of tandem repeats even at extremely high copy number, and apply this technology to genotype 165 HapMap samples from three different populations and five species of non-human primates. We observed extreme variability in copy number of tandemly repeated genes, with many loci showing 5–10 fold variation in copy number among humans. Many of these loci show hallmarks of genome assembly errors, and the true copy number of many large tandem repeats is significantly under-represented even in the high quality ‘finished’ human reference assembly. Importantly, we demonstrate that most large tandem repeat variations are not tagged by nearby SNPs, and are therefore essentially invisible to SNP-based GWAS approaches. Using association analysis we identify many cis correlations of large tandem repeat variants with nearby gene expression and DNA methylation levels, indicating that variations of tandem repeat length are associated with functional effects on the local genomic environment. This includes an example where expansion of a macrosatellite repeat is associated with increased DNA methylation and suppression of nearby gene expression, suggesting a mechanism termed “repeat induced gene silencing”, which has previously been observed only in transgenic organisms. We also observed multiple signatures consistent with altered selective pressures at tandemly repeated loci, suggesting important biological functions. Our studies show that tandemly repeated loci represent a highly variable fraction of the genome that have been systematically ignored by most previous studies, copy number variation of which can exert functionally significant effects. We suggest that future studies of tandem repeat loci will lead to many novel insights into their role in modulating both genomic and phenotypic diversity.
format Online
Article
Text
id pubmed-4063668
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40636682014-06-25 Digital Genotyping of Macrosatellites and Multicopy Genes Reveals Novel Biological Functions Associated with Copy Number Variation of Large Tandem Repeats Brahmachary, Manisha Guilmatre, Audrey Quilez, Javier Hasson, Dan Borel, Christelle Warburton, Peter Sharp, Andrew J. PLoS Genet Research Article Tandem repeats are common in eukaryotic genomes, but due to difficulties in assaying them remain poorly studied. Here, we demonstrate the utility of Nanostring technology as a targeted approach to perform accurate measurement of tandem repeats even at extremely high copy number, and apply this technology to genotype 165 HapMap samples from three different populations and five species of non-human primates. We observed extreme variability in copy number of tandemly repeated genes, with many loci showing 5–10 fold variation in copy number among humans. Many of these loci show hallmarks of genome assembly errors, and the true copy number of many large tandem repeats is significantly under-represented even in the high quality ‘finished’ human reference assembly. Importantly, we demonstrate that most large tandem repeat variations are not tagged by nearby SNPs, and are therefore essentially invisible to SNP-based GWAS approaches. Using association analysis we identify many cis correlations of large tandem repeat variants with nearby gene expression and DNA methylation levels, indicating that variations of tandem repeat length are associated with functional effects on the local genomic environment. This includes an example where expansion of a macrosatellite repeat is associated with increased DNA methylation and suppression of nearby gene expression, suggesting a mechanism termed “repeat induced gene silencing”, which has previously been observed only in transgenic organisms. We also observed multiple signatures consistent with altered selective pressures at tandemly repeated loci, suggesting important biological functions. Our studies show that tandemly repeated loci represent a highly variable fraction of the genome that have been systematically ignored by most previous studies, copy number variation of which can exert functionally significant effects. We suggest that future studies of tandem repeat loci will lead to many novel insights into their role in modulating both genomic and phenotypic diversity. Public Library of Science 2014-06-19 /pmc/articles/PMC4063668/ /pubmed/24945355 http://dx.doi.org/10.1371/journal.pgen.1004418 Text en © 2014 Brahmachary et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Brahmachary, Manisha
Guilmatre, Audrey
Quilez, Javier
Hasson, Dan
Borel, Christelle
Warburton, Peter
Sharp, Andrew J.
Digital Genotyping of Macrosatellites and Multicopy Genes Reveals Novel Biological Functions Associated with Copy Number Variation of Large Tandem Repeats
title Digital Genotyping of Macrosatellites and Multicopy Genes Reveals Novel Biological Functions Associated with Copy Number Variation of Large Tandem Repeats
title_full Digital Genotyping of Macrosatellites and Multicopy Genes Reveals Novel Biological Functions Associated with Copy Number Variation of Large Tandem Repeats
title_fullStr Digital Genotyping of Macrosatellites and Multicopy Genes Reveals Novel Biological Functions Associated with Copy Number Variation of Large Tandem Repeats
title_full_unstemmed Digital Genotyping of Macrosatellites and Multicopy Genes Reveals Novel Biological Functions Associated with Copy Number Variation of Large Tandem Repeats
title_short Digital Genotyping of Macrosatellites and Multicopy Genes Reveals Novel Biological Functions Associated with Copy Number Variation of Large Tandem Repeats
title_sort digital genotyping of macrosatellites and multicopy genes reveals novel biological functions associated with copy number variation of large tandem repeats
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4063668/
https://www.ncbi.nlm.nih.gov/pubmed/24945355
http://dx.doi.org/10.1371/journal.pgen.1004418
work_keys_str_mv AT brahmacharymanisha digitalgenotypingofmacrosatellitesandmulticopygenesrevealsnovelbiologicalfunctionsassociatedwithcopynumbervariationoflargetandemrepeats
AT guilmatreaudrey digitalgenotypingofmacrosatellitesandmulticopygenesrevealsnovelbiologicalfunctionsassociatedwithcopynumbervariationoflargetandemrepeats
AT quilezjavier digitalgenotypingofmacrosatellitesandmulticopygenesrevealsnovelbiologicalfunctionsassociatedwithcopynumbervariationoflargetandemrepeats
AT hassondan digitalgenotypingofmacrosatellitesandmulticopygenesrevealsnovelbiologicalfunctionsassociatedwithcopynumbervariationoflargetandemrepeats
AT borelchristelle digitalgenotypingofmacrosatellitesandmulticopygenesrevealsnovelbiologicalfunctionsassociatedwithcopynumbervariationoflargetandemrepeats
AT warburtonpeter digitalgenotypingofmacrosatellitesandmulticopygenesrevealsnovelbiologicalfunctionsassociatedwithcopynumbervariationoflargetandemrepeats
AT sharpandrewj digitalgenotypingofmacrosatellitesandmulticopygenesrevealsnovelbiologicalfunctionsassociatedwithcopynumbervariationoflargetandemrepeats