Cargando…

High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data

BACKGROUND: Structural variants (SVs) are chromosomal segments that differ between genomes, such as deletions, duplications, insertions, inversions and translocations. The genomics revolution enabled the discovery of sub-microscopic SVs via array and whole-genome sequencing (WGS) data, paving the wa...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Young-Lim, Bosse, Mirte, Takeda, Haruko, Moreira, Gabriel Costa Monteiro, Karim, Latifa, Druet, Tom, Oget-Ebrad, Claire, Coppieters, Wouter, Veerkamp, Roel F., Groenen, Martien A. M., Georges, Michel, Bouwman, Aniek C., Charlier, Carole
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10152703/
https://www.ncbi.nlm.nih.gov/pubmed/37127590
http://dx.doi.org/10.1186/s12864-023-09259-8
_version_ 1785035793054564352
author Lee, Young-Lim
Bosse, Mirte
Takeda, Haruko
Moreira, Gabriel Costa Monteiro
Karim, Latifa
Druet, Tom
Oget-Ebrad, Claire
Coppieters, Wouter
Veerkamp, Roel F.
Groenen, Martien A. M.
Georges, Michel
Bouwman, Aniek C.
Charlier, Carole
author_facet Lee, Young-Lim
Bosse, Mirte
Takeda, Haruko
Moreira, Gabriel Costa Monteiro
Karim, Latifa
Druet, Tom
Oget-Ebrad, Claire
Coppieters, Wouter
Veerkamp, Roel F.
Groenen, Martien A. M.
Georges, Michel
Bouwman, Aniek C.
Charlier, Carole
author_sort Lee, Young-Lim
collection PubMed
description BACKGROUND: Structural variants (SVs) are chromosomal segments that differ between genomes, such as deletions, duplications, insertions, inversions and translocations. The genomics revolution enabled the discovery of sub-microscopic SVs via array and whole-genome sequencing (WGS) data, paving the way to unravel the functional impact of SVs. Recent human expression QTL mapping studies demonstrated that SVs play a disproportionally large role in altering gene expression, underlining the importance of including SVs in genetic analyses. Therefore, this study aimed to generate and explore a high-quality bovine SV catalogue exploiting a unique cattle family cohort data (total 266 samples, forming 127 trios). RESULTS: We curated 13,731 SVs segregating in the population, consisting of 12,201 deletions, 1,509 duplications, and 21 multi-allelic CNVs (> 50-bp). Of these, we validated a subset of copy number variants (CNVs) utilising a direct genotyping approach in an independent cohort, indicating that at least 62% of the CNVs are true variants, segregating in the population. Among gene-disrupting SVs, we prioritised two likely high impact duplications, encompassing ORM1 and POPDC3 genes, respectively. Liver expression QTL mapping results revealed that these duplications are likely causing altered gene expression, confirming the functional importance of SVs. Although most of the accurately genotyped CNVs are tagged by single nucleotide polymorphisms (SNPs) ascertained in WGS data, most CNVs were not captured by individual SNPs obtained from a 50K genotyping array. CONCLUSION: We generated a high-quality SV catalogue exploiting unique whole genome sequenced bovine family cohort data. Two high impact duplications upregulating the ORM1 and POPDC3 are putative candidates for postpartum feed intake and hoof health traits, thus warranting further investigation. Generally, CNVs were in low LD with SNPs on the 50K array. Hence, it remains crucial to incorporate CNVs via means other than tagging SNPs, such as investigation of tagging haplotypes, direct imputation of CNVs, or direct genotyping as done in the current study. The SV catalogue and the custom genotyping array generated in the current study will serve as valuable resources accelerating utilisation of full spectrum of genetic variants in bovine genomes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09259-8.
format Online
Article
Text
id pubmed-10152703
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101527032023-05-03 High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data Lee, Young-Lim Bosse, Mirte Takeda, Haruko Moreira, Gabriel Costa Monteiro Karim, Latifa Druet, Tom Oget-Ebrad, Claire Coppieters, Wouter Veerkamp, Roel F. Groenen, Martien A. M. Georges, Michel Bouwman, Aniek C. Charlier, Carole BMC Genomics Research BACKGROUND: Structural variants (SVs) are chromosomal segments that differ between genomes, such as deletions, duplications, insertions, inversions and translocations. The genomics revolution enabled the discovery of sub-microscopic SVs via array and whole-genome sequencing (WGS) data, paving the way to unravel the functional impact of SVs. Recent human expression QTL mapping studies demonstrated that SVs play a disproportionally large role in altering gene expression, underlining the importance of including SVs in genetic analyses. Therefore, this study aimed to generate and explore a high-quality bovine SV catalogue exploiting a unique cattle family cohort data (total 266 samples, forming 127 trios). RESULTS: We curated 13,731 SVs segregating in the population, consisting of 12,201 deletions, 1,509 duplications, and 21 multi-allelic CNVs (> 50-bp). Of these, we validated a subset of copy number variants (CNVs) utilising a direct genotyping approach in an independent cohort, indicating that at least 62% of the CNVs are true variants, segregating in the population. Among gene-disrupting SVs, we prioritised two likely high impact duplications, encompassing ORM1 and POPDC3 genes, respectively. Liver expression QTL mapping results revealed that these duplications are likely causing altered gene expression, confirming the functional importance of SVs. Although most of the accurately genotyped CNVs are tagged by single nucleotide polymorphisms (SNPs) ascertained in WGS data, most CNVs were not captured by individual SNPs obtained from a 50K genotyping array. CONCLUSION: We generated a high-quality SV catalogue exploiting unique whole genome sequenced bovine family cohort data. Two high impact duplications upregulating the ORM1 and POPDC3 are putative candidates for postpartum feed intake and hoof health traits, thus warranting further investigation. Generally, CNVs were in low LD with SNPs on the 50K array. Hence, it remains crucial to incorporate CNVs via means other than tagging SNPs, such as investigation of tagging haplotypes, direct imputation of CNVs, or direct genotyping as done in the current study. The SV catalogue and the custom genotyping array generated in the current study will serve as valuable resources accelerating utilisation of full spectrum of genetic variants in bovine genomes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09259-8. BioMed Central 2023-05-01 /pmc/articles/PMC10152703/ /pubmed/37127590 http://dx.doi.org/10.1186/s12864-023-09259-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Lee, Young-Lim
Bosse, Mirte
Takeda, Haruko
Moreira, Gabriel Costa Monteiro
Karim, Latifa
Druet, Tom
Oget-Ebrad, Claire
Coppieters, Wouter
Veerkamp, Roel F.
Groenen, Martien A. M.
Georges, Michel
Bouwman, Aniek C.
Charlier, Carole
High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data
title High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data
title_full High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data
title_fullStr High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data
title_full_unstemmed High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data
title_short High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data
title_sort high-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10152703/
https://www.ncbi.nlm.nih.gov/pubmed/37127590
http://dx.doi.org/10.1186/s12864-023-09259-8
work_keys_str_mv AT leeyounglim highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT bossemirte highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT takedaharuko highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT moreiragabrielcostamonteiro highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT karimlatifa highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT druettom highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT ogetebradclaire highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT coppieterswouter highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT veerkamproelf highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT groenenmartienam highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT georgesmichel highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT bouwmananiekc highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata
AT charliercarole highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata