Cargando…
Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans
Genomic structural variations (SVs) and transposable elements (TEs) can be significant contributors to genome evolution, altered gene expression, and risk of genetic diseases. Recent advancements in long-read sequencing have greatly improved the quality of de novo genome assemblies and enhanced the...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634987/ https://www.ncbi.nlm.nih.gov/pubmed/37961628 http://dx.doi.org/10.1101/2023.01.13.523974 |
_version_ | 1785146271218008064 |
---|---|
author | Bush, Zachary D. Naftaly, Alice F. S. Dinwiddie, Devin Albers, Cora Hillers, Kenneth J. Libuda, Diana E. |
author_facet | Bush, Zachary D. Naftaly, Alice F. S. Dinwiddie, Devin Albers, Cora Hillers, Kenneth J. Libuda, Diana E. |
author_sort | Bush, Zachary D. |
collection | PubMed |
description | Genomic structural variations (SVs) and transposable elements (TEs) can be significant contributors to genome evolution, altered gene expression, and risk of genetic diseases. Recent advancements in long-read sequencing have greatly improved the quality of de novo genome assemblies and enhanced the detection of sequence variants at the scale of hundreds or thousands of bases. Comparisons between two diverged wild isolates of Caenorhabditis elegans, the Bristol and Hawaiian strains, have been widely utilized in the analysis of small genetic variations. Genetic drift, including SVs and rearrangements of repeated sequences such as TEs, can occur over time from long-term maintenance of wild type isolates within the laboratory. To comprehensively detect both large and small structural variations as well as TEs due to genetic drift, we generated de novo genome assemblies and annotations for each strain from our lab collection using both long- and short-read sequencing and compared our assemblies and annotations with that of other lab wild type strains. Within our lab assemblies, we annotate over 3.1Mb of sequence divergence between the Bristol and Hawaiian isolates: 337,584 SNPs, 94,503 small insertion-deletions (<50bp), and 4,334 structural variations (>50bp). Further, we define the location and movement of specific DNA TEs between N2 Bristol and CB4856 Hawaiian wild type isolates. Specifically, we find the N2 Bristol genome has 20.6% more TEs from the Tc1/mariner family than the CB4856 Hawaiian genome. Moreover, we identified Zator elements as the most abundant and mobile TE family in the genome. Using specific TE sequences with unique SNPs, we also identify 38 TEs that moved intrachromosomally and 9 TEs that moved interchromosomally between the N2 Bristol and CB4856 Hawaiian genomes. By comparing the de novo genome assembly of our lab collection Bristol isolate to the VC2010 Bristol assembly, we also reveal that lab lineages display over 2 Mb of total variation: 1,162 SNPs, 1,528 indels, and 897 SVs with 95% of the variation due to SVs. Overall, our work demonstrates the unique contribution of SVs and TEs to variation and genetic drift between wild type laboratory strains assumed to be isogenic despite growing evidence of genetic drift and phenotypic variation. |
format | Online Article Text |
id | pubmed-10634987 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-106349872023-11-13 Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans Bush, Zachary D. Naftaly, Alice F. S. Dinwiddie, Devin Albers, Cora Hillers, Kenneth J. Libuda, Diana E. bioRxiv Article Genomic structural variations (SVs) and transposable elements (TEs) can be significant contributors to genome evolution, altered gene expression, and risk of genetic diseases. Recent advancements in long-read sequencing have greatly improved the quality of de novo genome assemblies and enhanced the detection of sequence variants at the scale of hundreds or thousands of bases. Comparisons between two diverged wild isolates of Caenorhabditis elegans, the Bristol and Hawaiian strains, have been widely utilized in the analysis of small genetic variations. Genetic drift, including SVs and rearrangements of repeated sequences such as TEs, can occur over time from long-term maintenance of wild type isolates within the laboratory. To comprehensively detect both large and small structural variations as well as TEs due to genetic drift, we generated de novo genome assemblies and annotations for each strain from our lab collection using both long- and short-read sequencing and compared our assemblies and annotations with that of other lab wild type strains. Within our lab assemblies, we annotate over 3.1Mb of sequence divergence between the Bristol and Hawaiian isolates: 337,584 SNPs, 94,503 small insertion-deletions (<50bp), and 4,334 structural variations (>50bp). Further, we define the location and movement of specific DNA TEs between N2 Bristol and CB4856 Hawaiian wild type isolates. Specifically, we find the N2 Bristol genome has 20.6% more TEs from the Tc1/mariner family than the CB4856 Hawaiian genome. Moreover, we identified Zator elements as the most abundant and mobile TE family in the genome. Using specific TE sequences with unique SNPs, we also identify 38 TEs that moved intrachromosomally and 9 TEs that moved interchromosomally between the N2 Bristol and CB4856 Hawaiian genomes. By comparing the de novo genome assembly of our lab collection Bristol isolate to the VC2010 Bristol assembly, we also reveal that lab lineages display over 2 Mb of total variation: 1,162 SNPs, 1,528 indels, and 897 SVs with 95% of the variation due to SVs. Overall, our work demonstrates the unique contribution of SVs and TEs to variation and genetic drift between wild type laboratory strains assumed to be isogenic despite growing evidence of genetic drift and phenotypic variation. Cold Spring Harbor Laboratory 2023-11-03 /pmc/articles/PMC10634987/ /pubmed/37961628 http://dx.doi.org/10.1101/2023.01.13.523974 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Bush, Zachary D. Naftaly, Alice F. S. Dinwiddie, Devin Albers, Cora Hillers, Kenneth J. Libuda, Diana E. Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans |
title | Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans |
title_full | Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans |
title_fullStr | Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans |
title_full_unstemmed | Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans |
title_short | Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans |
title_sort | comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of c. elegans |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634987/ https://www.ncbi.nlm.nih.gov/pubmed/37961628 http://dx.doi.org/10.1101/2023.01.13.523974 |
work_keys_str_mv | AT bushzacharyd comprehensivedetectionofstructuralvariationandtransposableelementdifferencesbetweenwildtypelaboratorylineagesofcelegans AT naftalyalicefs comprehensivedetectionofstructuralvariationandtransposableelementdifferencesbetweenwildtypelaboratorylineagesofcelegans AT dinwiddiedevin comprehensivedetectionofstructuralvariationandtransposableelementdifferencesbetweenwildtypelaboratorylineagesofcelegans AT alberscora comprehensivedetectionofstructuralvariationandtransposableelementdifferencesbetweenwildtypelaboratorylineagesofcelegans AT hillerskennethj comprehensivedetectionofstructuralvariationandtransposableelementdifferencesbetweenwildtypelaboratorylineagesofcelegans AT libudadianae comprehensivedetectionofstructuralvariationandtransposableelementdifferencesbetweenwildtypelaboratorylineagesofcelegans |