Cargando…

Virus-derived variation in diverse human genomes

Acquisition of genetic material from viruses by their hosts can generate inter-host structural genome variation. We developed computational tools enabling us to study virus-derived structural variants (SVs) in population-scale whole genome sequencing (WGS) datasets and applied them to 3,332 humans....

Descripción completa

Detalles Bibliográficos
Autores principales: Kojima, Shohei, Kamada, Anselmo Jiro, Parrish, Nicholas F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8101998/
https://www.ncbi.nlm.nih.gov/pubmed/33901175
http://dx.doi.org/10.1371/journal.pgen.1009324
_version_ 1783689041098571776
author Kojima, Shohei
Kamada, Anselmo Jiro
Parrish, Nicholas F.
author_facet Kojima, Shohei
Kamada, Anselmo Jiro
Parrish, Nicholas F.
author_sort Kojima, Shohei
collection PubMed
description Acquisition of genetic material from viruses by their hosts can generate inter-host structural genome variation. We developed computational tools enabling us to study virus-derived structural variants (SVs) in population-scale whole genome sequencing (WGS) datasets and applied them to 3,332 humans. Although SVs had already been cataloged in these subjects, we found previously-overlooked virus-derived SVs. We detected non-germline SVs derived from squirrel monkey retrovirus (SMRV), human immunodeficiency virus 1 (HIV-1), and human T lymphotropic virus (HTLV-1); these variants are attributable to infection of the sequenced lymphoblastoid cell lines (LCLs) or their progenitor cells and may impact gene expression results and the biosafety of experiments using these cells. In addition, we detected new heritable SVs derived from human herpesvirus 6 (HHV-6) and human endogenous retrovirus-K (HERV-K). We report the first solo-direct repeat (DR) HHV-6 likely to reflect DR rearrangement of a known full-length endogenous HHV-6. We used linkage disequilibrium between single nucleotide variants (SNVs) and variants in reads that align to HERV-K, which often cannot be mapped uniquely using conventional short-read sequencing analysis methods, to locate previously-unknown polymorphic HERV-K loci. Some of these loci are tightly linked to trait-associated SNVs, some are in complex genome regions inaccessible by prior methods, and some contain novel HERV-K haplotypes likely derived from gene conversion from an unknown source or introgression. These tools and results broaden our perspective on the coevolution between viruses and humans, including ongoing virus-to-human gene transfer contributing to genetic variation between humans.
format Online
Article
Text
id pubmed-8101998
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-81019982021-05-17 Virus-derived variation in diverse human genomes Kojima, Shohei Kamada, Anselmo Jiro Parrish, Nicholas F. PLoS Genet Research Article Acquisition of genetic material from viruses by their hosts can generate inter-host structural genome variation. We developed computational tools enabling us to study virus-derived structural variants (SVs) in population-scale whole genome sequencing (WGS) datasets and applied them to 3,332 humans. Although SVs had already been cataloged in these subjects, we found previously-overlooked virus-derived SVs. We detected non-germline SVs derived from squirrel monkey retrovirus (SMRV), human immunodeficiency virus 1 (HIV-1), and human T lymphotropic virus (HTLV-1); these variants are attributable to infection of the sequenced lymphoblastoid cell lines (LCLs) or their progenitor cells and may impact gene expression results and the biosafety of experiments using these cells. In addition, we detected new heritable SVs derived from human herpesvirus 6 (HHV-6) and human endogenous retrovirus-K (HERV-K). We report the first solo-direct repeat (DR) HHV-6 likely to reflect DR rearrangement of a known full-length endogenous HHV-6. We used linkage disequilibrium between single nucleotide variants (SNVs) and variants in reads that align to HERV-K, which often cannot be mapped uniquely using conventional short-read sequencing analysis methods, to locate previously-unknown polymorphic HERV-K loci. Some of these loci are tightly linked to trait-associated SNVs, some are in complex genome regions inaccessible by prior methods, and some contain novel HERV-K haplotypes likely derived from gene conversion from an unknown source or introgression. These tools and results broaden our perspective on the coevolution between viruses and humans, including ongoing virus-to-human gene transfer contributing to genetic variation between humans. Public Library of Science 2021-04-26 /pmc/articles/PMC8101998/ /pubmed/33901175 http://dx.doi.org/10.1371/journal.pgen.1009324 Text en © 2021 Kojima et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kojima, Shohei
Kamada, Anselmo Jiro
Parrish, Nicholas F.
Virus-derived variation in diverse human genomes
title Virus-derived variation in diverse human genomes
title_full Virus-derived variation in diverse human genomes
title_fullStr Virus-derived variation in diverse human genomes
title_full_unstemmed Virus-derived variation in diverse human genomes
title_short Virus-derived variation in diverse human genomes
title_sort virus-derived variation in diverse human genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8101998/
https://www.ncbi.nlm.nih.gov/pubmed/33901175
http://dx.doi.org/10.1371/journal.pgen.1009324
work_keys_str_mv AT kojimashohei virusderivedvariationindiversehumangenomes
AT kamadaanselmojiro virusderivedvariationindiversehumangenomes
AT parrishnicholasf virusderivedvariationindiversehumangenomes