Cargando…

Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing

BACKGROUND: De novo mutations arising in the germline are a source of genetic variation and their discovery broadens our understanding of genetic disorders and evolutionary patterns. Although the number of de novo single nucleotide variants (dnSNVs) has been studied in a number of species, relativel...

Descripción completa

Detalles Bibliográficos
Autores principales: Steensma, Marije J., Lee, Y. L., Bouwman, A. C., Pita Barros, C., Derks, M. F.L., Bink, M. C.A.M., Harlizius, B., Huisman, A. E., Crooijmans, R. P.M.A., Groenen, M. A.M., Mulder, H. A., Rochus, C. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10114323/
https://www.ncbi.nlm.nih.gov/pubmed/37072725
http://dx.doi.org/10.1186/s12864-023-09296-3
_version_ 1785027991182508032
author Steensma, Marije J.
Lee, Y. L.
Bouwman, A. C.
Pita Barros, C.
Derks, M. F.L.
Bink, M. C.A.M.
Harlizius, B.
Huisman, A. E.
Crooijmans, R. P.M.A.
Groenen, M. A.M.
Mulder, H. A.
Rochus, C. M.
author_facet Steensma, Marije J.
Lee, Y. L.
Bouwman, A. C.
Pita Barros, C.
Derks, M. F.L.
Bink, M. C.A.M.
Harlizius, B.
Huisman, A. E.
Crooijmans, R. P.M.A.
Groenen, M. A.M.
Mulder, H. A.
Rochus, C. M.
author_sort Steensma, Marije J.
collection PubMed
description BACKGROUND: De novo mutations arising in the germline are a source of genetic variation and their discovery broadens our understanding of genetic disorders and evolutionary patterns. Although the number of de novo single nucleotide variants (dnSNVs) has been studied in a number of species, relatively little is known about the occurrence of de novo structural variants (dnSVs). In this study, we investigated 37 deeply sequenced pig trios from two commercial lines to identify dnSVs present in the offspring. The identified dnSVs were characterised by identifying their parent of origin, their functional annotations and characterizing sequence homology at the breakpoints. RESULTS: We identified four swine germline dnSVs, all located in intronic regions of protein-coding genes. Our conservative, first estimate of the swine germline dnSV rate is 0.108 (95% CI 0.038–0.255) per generation (one dnSV per nine offspring), detected using short-read sequencing. Two detected dnSVs are clusters of mutations. Mutation cluster 1 contains a de novo duplication, a dnSNV and a de novo deletion. Mutation cluster 2 contains a de novo deletion and three de novo duplications, of which one is inverted. Mutation cluster 2 is 25 kb in size, whereas mutation cluster 1 (197 bp) and the other two individual dnSVs (64 and 573 bp) are smaller. Only mutation cluster 2 could be phased and is located on the paternal haplotype. Mutation cluster 2 originates from both micro-homology as well as non-homology mutation mechanisms, where mutation cluster 1 and the other two dnSVs are caused by mutation mechanisms lacking sequence homology. The 64 bp deletion and mutation cluster 1 were validated through PCR. Lastly, the 64 bp deletion and the 573 bp duplication were validated in sequenced offspring of probands with three generations of sequence data. CONCLUSIONS: Our estimate of 0.108 dnSVs per generation in the swine germline is conservative, due to our small sample size and restricted possibilities of dnSV detection from short-read sequencing. The current study highlights the complexity of dnSVs and shows the potential of breeding programs for pigs and livestock species in general, to provide a suitable population structure for identification and characterisation of dnSVs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09296-3.
format Online
Article
Text
id pubmed-10114323
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101143232023-04-20 Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing Steensma, Marije J. Lee, Y. L. Bouwman, A. C. Pita Barros, C. Derks, M. F.L. Bink, M. C.A.M. Harlizius, B. Huisman, A. E. Crooijmans, R. P.M.A. Groenen, M. A.M. Mulder, H. A. Rochus, C. M. BMC Genomics Research BACKGROUND: De novo mutations arising in the germline are a source of genetic variation and their discovery broadens our understanding of genetic disorders and evolutionary patterns. Although the number of de novo single nucleotide variants (dnSNVs) has been studied in a number of species, relatively little is known about the occurrence of de novo structural variants (dnSVs). In this study, we investigated 37 deeply sequenced pig trios from two commercial lines to identify dnSVs present in the offspring. The identified dnSVs were characterised by identifying their parent of origin, their functional annotations and characterizing sequence homology at the breakpoints. RESULTS: We identified four swine germline dnSVs, all located in intronic regions of protein-coding genes. Our conservative, first estimate of the swine germline dnSV rate is 0.108 (95% CI 0.038–0.255) per generation (one dnSV per nine offspring), detected using short-read sequencing. Two detected dnSVs are clusters of mutations. Mutation cluster 1 contains a de novo duplication, a dnSNV and a de novo deletion. Mutation cluster 2 contains a de novo deletion and three de novo duplications, of which one is inverted. Mutation cluster 2 is 25 kb in size, whereas mutation cluster 1 (197 bp) and the other two individual dnSVs (64 and 573 bp) are smaller. Only mutation cluster 2 could be phased and is located on the paternal haplotype. Mutation cluster 2 originates from both micro-homology as well as non-homology mutation mechanisms, where mutation cluster 1 and the other two dnSVs are caused by mutation mechanisms lacking sequence homology. The 64 bp deletion and mutation cluster 1 were validated through PCR. Lastly, the 64 bp deletion and the 573 bp duplication were validated in sequenced offspring of probands with three generations of sequence data. CONCLUSIONS: Our estimate of 0.108 dnSVs per generation in the swine germline is conservative, due to our small sample size and restricted possibilities of dnSV detection from short-read sequencing. The current study highlights the complexity of dnSVs and shows the potential of breeding programs for pigs and livestock species in general, to provide a suitable population structure for identification and characterisation of dnSVs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09296-3. BioMed Central 2023-04-18 /pmc/articles/PMC10114323/ /pubmed/37072725 http://dx.doi.org/10.1186/s12864-023-09296-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Steensma, Marije J.
Lee, Y. L.
Bouwman, A. C.
Pita Barros, C.
Derks, M. F.L.
Bink, M. C.A.M.
Harlizius, B.
Huisman, A. E.
Crooijmans, R. P.M.A.
Groenen, M. A.M.
Mulder, H. A.
Rochus, C. M.
Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing
title Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing
title_full Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing
title_fullStr Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing
title_full_unstemmed Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing
title_short Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing
title_sort identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10114323/
https://www.ncbi.nlm.nih.gov/pubmed/37072725
http://dx.doi.org/10.1186/s12864-023-09296-3
work_keys_str_mv AT steensmamarijej identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT leeyl identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT bouwmanac identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT pitabarrosc identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT derksmfl identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT binkmcam identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT harliziusb identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT huismanae identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT crooijmansrpma identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT groenenmam identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT mulderha identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing
AT rochuscm identificationandcharacterisationofdenovogermlinestructuralvariantsintwocommercialpiglinesusingtriobasedwholegenomesequencing