Cargando…

Concerning the eXclusion in human genomics: The choice of sex chromosome representation in the human genome drastically affects number of identified variants

Over the past 30 years, a community of scientists have pieced together every base pair of the human reference genome from telomere-to-telomere. Interestingly, most human genomics studies omit more than 5% of the genome from their analyses. Under ‘normal’ circumstances, omitting any chromosome(s) fro...

Descripción completa

Detalles Bibliográficos
Autores principales: Pinto, Brendan J., O’Connor, Brian, Schatz, Michael C., Zarate, Samantha, Wilson, Melissa A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9980147/
https://www.ncbi.nlm.nih.gov/pubmed/36865318
http://dx.doi.org/10.1101/2023.02.22.529542
_version_ 1784899857004101632
author Pinto, Brendan J.
O’Connor, Brian
Schatz, Michael C.
Zarate, Samantha
Wilson, Melissa A.
author_facet Pinto, Brendan J.
O’Connor, Brian
Schatz, Michael C.
Zarate, Samantha
Wilson, Melissa A.
author_sort Pinto, Brendan J.
collection PubMed
description Over the past 30 years, a community of scientists have pieced together every base pair of the human reference genome from telomere-to-telomere. Interestingly, most human genomics studies omit more than 5% of the genome from their analyses. Under ‘normal’ circumstances, omitting any chromosome(s) from analysis of the human genome would be reason for concern—the exception being the sex chromosomes. Sex chromosomes in eutherians share an evolutionary origin as an ancestral pair of autosomes. In humans, they share three regions of high sequence identity (~98–100%), which—along with the unique transmission patterns of the sex chromosomes—introduce technical artifacts into genomic analyses. However, the human X chromosome bears numerous important genes—including more “immune response” genes than any other chromosome—which makes its exclusion irresponsible when sex differences across human diseases are widespread. To better characterize the effect that including/excluding the X chromosome may have on variants called, we conducted a pilot study on the Terra cloud platform to replicate a subset of standard genomic practices using both the CHM13 reference genome and sex chromosome complement-aware (SCC-aware) reference genome. We compared quality of variant calling, expression quantification, and allele-specific expression using these two reference genome versions across 50 human samples from the Genotype-Tissue-Expression consortium annotated as females. We found that after correction, the whole X chromosome (100%) can generate reliable variant calls—allowing for the inclusion of the whole genome in human genomics analyses as a departure from the status quo of omitting the sex chromosomes from empirical and clinical genomics studies.
format Online
Article
Text
id pubmed-9980147
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-99801472023-03-03 Concerning the eXclusion in human genomics: The choice of sex chromosome representation in the human genome drastically affects number of identified variants Pinto, Brendan J. O’Connor, Brian Schatz, Michael C. Zarate, Samantha Wilson, Melissa A. bioRxiv Article Over the past 30 years, a community of scientists have pieced together every base pair of the human reference genome from telomere-to-telomere. Interestingly, most human genomics studies omit more than 5% of the genome from their analyses. Under ‘normal’ circumstances, omitting any chromosome(s) from analysis of the human genome would be reason for concern—the exception being the sex chromosomes. Sex chromosomes in eutherians share an evolutionary origin as an ancestral pair of autosomes. In humans, they share three regions of high sequence identity (~98–100%), which—along with the unique transmission patterns of the sex chromosomes—introduce technical artifacts into genomic analyses. However, the human X chromosome bears numerous important genes—including more “immune response” genes than any other chromosome—which makes its exclusion irresponsible when sex differences across human diseases are widespread. To better characterize the effect that including/excluding the X chromosome may have on variants called, we conducted a pilot study on the Terra cloud platform to replicate a subset of standard genomic practices using both the CHM13 reference genome and sex chromosome complement-aware (SCC-aware) reference genome. We compared quality of variant calling, expression quantification, and allele-specific expression using these two reference genome versions across 50 human samples from the Genotype-Tissue-Expression consortium annotated as females. We found that after correction, the whole X chromosome (100%) can generate reliable variant calls—allowing for the inclusion of the whole genome in human genomics analyses as a departure from the status quo of omitting the sex chromosomes from empirical and clinical genomics studies. Cold Spring Harbor Laboratory 2023-02-22 /pmc/articles/PMC9980147/ /pubmed/36865318 http://dx.doi.org/10.1101/2023.02.22.529542 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Pinto, Brendan J.
O’Connor, Brian
Schatz, Michael C.
Zarate, Samantha
Wilson, Melissa A.
Concerning the eXclusion in human genomics: The choice of sex chromosome representation in the human genome drastically affects number of identified variants
title Concerning the eXclusion in human genomics: The choice of sex chromosome representation in the human genome drastically affects number of identified variants
title_full Concerning the eXclusion in human genomics: The choice of sex chromosome representation in the human genome drastically affects number of identified variants
title_fullStr Concerning the eXclusion in human genomics: The choice of sex chromosome representation in the human genome drastically affects number of identified variants
title_full_unstemmed Concerning the eXclusion in human genomics: The choice of sex chromosome representation in the human genome drastically affects number of identified variants
title_short Concerning the eXclusion in human genomics: The choice of sex chromosome representation in the human genome drastically affects number of identified variants
title_sort concerning the exclusion in human genomics: the choice of sex chromosome representation in the human genome drastically affects number of identified variants
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9980147/
https://www.ncbi.nlm.nih.gov/pubmed/36865318
http://dx.doi.org/10.1101/2023.02.22.529542
work_keys_str_mv AT pintobrendanj concerningtheexclusioninhumangenomicsthechoiceofsexchromosomerepresentationinthehumangenomedrasticallyaffectsnumberofidentifiedvariants
AT oconnorbrian concerningtheexclusioninhumangenomicsthechoiceofsexchromosomerepresentationinthehumangenomedrasticallyaffectsnumberofidentifiedvariants
AT schatzmichaelc concerningtheexclusioninhumangenomicsthechoiceofsexchromosomerepresentationinthehumangenomedrasticallyaffectsnumberofidentifiedvariants
AT zaratesamantha concerningtheexclusioninhumangenomicsthechoiceofsexchromosomerepresentationinthehumangenomedrasticallyaffectsnumberofidentifiedvariants
AT wilsonmelissaa concerningtheexclusioninhumangenomicsthechoiceofsexchromosomerepresentationinthehumangenomedrasticallyaffectsnumberofidentifiedvariants