Cargando…

Thousands of high-quality sequencing samples fail to show meaningful correlation between 5S and 45S ribosomal DNA arrays in humans

The ribosomal RNA genes (rDNA) are tandemly arrayed in most eukaryotes and exhibit vast copy number variation. There is growing interest in integrating this variation into genotype–phenotype associations. Here, we explored a possible association of rDNA copy number variation with autism spectrum dis...

Descripción completa

Detalles Bibliográficos
Autores principales: Hall, Ashley N., Turner, Tychele N., Queitsch, Christine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7801704/
https://www.ncbi.nlm.nih.gov/pubmed/33432083
http://dx.doi.org/10.1038/s41598-020-80049-y
_version_ 1783635632673783808
author Hall, Ashley N.
Turner, Tychele N.
Queitsch, Christine
author_facet Hall, Ashley N.
Turner, Tychele N.
Queitsch, Christine
author_sort Hall, Ashley N.
collection PubMed
description The ribosomal RNA genes (rDNA) are tandemly arrayed in most eukaryotes and exhibit vast copy number variation. There is growing interest in integrating this variation into genotype–phenotype associations. Here, we explored a possible association of rDNA copy number variation with autism spectrum disorder and found no difference between probands and unaffected siblings. Because short-read sequencing estimates of rDNA copy number are error prone, we sought to validate our 45S estimates. Previous studies reported tightly correlated, concerted copy number variation between the 45S and 5S arrays, which should enable the validation of 45S copy number estimates with pulsed-field gel-verified 5S copy numbers. Here, we show that the previously reported strong concerted copy number variation may be an artifact of variable data quality in the earlier published 1000 Genomes Project sequences. We failed to detect a meaningful correlation between 45S and 5S copy numbers in thousands of samples from the high-coverage Simons Simplex Collection dataset as well as in the recent high-coverage 1000 Genomes Project sequences. Our findings illustrate the challenge of genotyping repetitive DNA regions accurately and call into question the accuracy of recently published studies of rDNA copy number variation in cancer that relied on diverse publicly available resources for sequence data.
format Online
Article
Text
id pubmed-7801704
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-78017042021-01-13 Thousands of high-quality sequencing samples fail to show meaningful correlation between 5S and 45S ribosomal DNA arrays in humans Hall, Ashley N. Turner, Tychele N. Queitsch, Christine Sci Rep Article The ribosomal RNA genes (rDNA) are tandemly arrayed in most eukaryotes and exhibit vast copy number variation. There is growing interest in integrating this variation into genotype–phenotype associations. Here, we explored a possible association of rDNA copy number variation with autism spectrum disorder and found no difference between probands and unaffected siblings. Because short-read sequencing estimates of rDNA copy number are error prone, we sought to validate our 45S estimates. Previous studies reported tightly correlated, concerted copy number variation between the 45S and 5S arrays, which should enable the validation of 45S copy number estimates with pulsed-field gel-verified 5S copy numbers. Here, we show that the previously reported strong concerted copy number variation may be an artifact of variable data quality in the earlier published 1000 Genomes Project sequences. We failed to detect a meaningful correlation between 45S and 5S copy numbers in thousands of samples from the high-coverage Simons Simplex Collection dataset as well as in the recent high-coverage 1000 Genomes Project sequences. Our findings illustrate the challenge of genotyping repetitive DNA regions accurately and call into question the accuracy of recently published studies of rDNA copy number variation in cancer that relied on diverse publicly available resources for sequence data. Nature Publishing Group UK 2021-01-11 /pmc/articles/PMC7801704/ /pubmed/33432083 http://dx.doi.org/10.1038/s41598-020-80049-y Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Hall, Ashley N.
Turner, Tychele N.
Queitsch, Christine
Thousands of high-quality sequencing samples fail to show meaningful correlation between 5S and 45S ribosomal DNA arrays in humans
title Thousands of high-quality sequencing samples fail to show meaningful correlation between 5S and 45S ribosomal DNA arrays in humans
title_full Thousands of high-quality sequencing samples fail to show meaningful correlation between 5S and 45S ribosomal DNA arrays in humans
title_fullStr Thousands of high-quality sequencing samples fail to show meaningful correlation between 5S and 45S ribosomal DNA arrays in humans
title_full_unstemmed Thousands of high-quality sequencing samples fail to show meaningful correlation between 5S and 45S ribosomal DNA arrays in humans
title_short Thousands of high-quality sequencing samples fail to show meaningful correlation between 5S and 45S ribosomal DNA arrays in humans
title_sort thousands of high-quality sequencing samples fail to show meaningful correlation between 5s and 45s ribosomal dna arrays in humans
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7801704/
https://www.ncbi.nlm.nih.gov/pubmed/33432083
http://dx.doi.org/10.1038/s41598-020-80049-y
work_keys_str_mv AT hallashleyn thousandsofhighqualitysequencingsamplesfailtoshowmeaningfulcorrelationbetween5sand45sribosomaldnaarraysinhumans
AT turnertychelen thousandsofhighqualitysequencingsamplesfailtoshowmeaningfulcorrelationbetween5sand45sribosomaldnaarraysinhumans
AT queitschchristine thousandsofhighqualitysequencingsamplesfailtoshowmeaningfulcorrelationbetween5sand45sribosomaldnaarraysinhumans