Cargando…
High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data
BACKGROUND: Structural variants (SVs) are chromosomal segments that differ between genomes, such as deletions, duplications, insertions, inversions and translocations. The genomics revolution enabled the discovery of sub-microscopic SVs via array and whole-genome sequencing (WGS) data, paving the wa...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10152703/ https://www.ncbi.nlm.nih.gov/pubmed/37127590 http://dx.doi.org/10.1186/s12864-023-09259-8 |
_version_ | 1785035793054564352 |
---|---|
author | Lee, Young-Lim Bosse, Mirte Takeda, Haruko Moreira, Gabriel Costa Monteiro Karim, Latifa Druet, Tom Oget-Ebrad, Claire Coppieters, Wouter Veerkamp, Roel F. Groenen, Martien A. M. Georges, Michel Bouwman, Aniek C. Charlier, Carole |
author_facet | Lee, Young-Lim Bosse, Mirte Takeda, Haruko Moreira, Gabriel Costa Monteiro Karim, Latifa Druet, Tom Oget-Ebrad, Claire Coppieters, Wouter Veerkamp, Roel F. Groenen, Martien A. M. Georges, Michel Bouwman, Aniek C. Charlier, Carole |
author_sort | Lee, Young-Lim |
collection | PubMed |
description | BACKGROUND: Structural variants (SVs) are chromosomal segments that differ between genomes, such as deletions, duplications, insertions, inversions and translocations. The genomics revolution enabled the discovery of sub-microscopic SVs via array and whole-genome sequencing (WGS) data, paving the way to unravel the functional impact of SVs. Recent human expression QTL mapping studies demonstrated that SVs play a disproportionally large role in altering gene expression, underlining the importance of including SVs in genetic analyses. Therefore, this study aimed to generate and explore a high-quality bovine SV catalogue exploiting a unique cattle family cohort data (total 266 samples, forming 127 trios). RESULTS: We curated 13,731 SVs segregating in the population, consisting of 12,201 deletions, 1,509 duplications, and 21 multi-allelic CNVs (> 50-bp). Of these, we validated a subset of copy number variants (CNVs) utilising a direct genotyping approach in an independent cohort, indicating that at least 62% of the CNVs are true variants, segregating in the population. Among gene-disrupting SVs, we prioritised two likely high impact duplications, encompassing ORM1 and POPDC3 genes, respectively. Liver expression QTL mapping results revealed that these duplications are likely causing altered gene expression, confirming the functional importance of SVs. Although most of the accurately genotyped CNVs are tagged by single nucleotide polymorphisms (SNPs) ascertained in WGS data, most CNVs were not captured by individual SNPs obtained from a 50K genotyping array. CONCLUSION: We generated a high-quality SV catalogue exploiting unique whole genome sequenced bovine family cohort data. Two high impact duplications upregulating the ORM1 and POPDC3 are putative candidates for postpartum feed intake and hoof health traits, thus warranting further investigation. Generally, CNVs were in low LD with SNPs on the 50K array. Hence, it remains crucial to incorporate CNVs via means other than tagging SNPs, such as investigation of tagging haplotypes, direct imputation of CNVs, or direct genotyping as done in the current study. The SV catalogue and the custom genotyping array generated in the current study will serve as valuable resources accelerating utilisation of full spectrum of genetic variants in bovine genomes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09259-8. |
format | Online Article Text |
id | pubmed-10152703 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101527032023-05-03 High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data Lee, Young-Lim Bosse, Mirte Takeda, Haruko Moreira, Gabriel Costa Monteiro Karim, Latifa Druet, Tom Oget-Ebrad, Claire Coppieters, Wouter Veerkamp, Roel F. Groenen, Martien A. M. Georges, Michel Bouwman, Aniek C. Charlier, Carole BMC Genomics Research BACKGROUND: Structural variants (SVs) are chromosomal segments that differ between genomes, such as deletions, duplications, insertions, inversions and translocations. The genomics revolution enabled the discovery of sub-microscopic SVs via array and whole-genome sequencing (WGS) data, paving the way to unravel the functional impact of SVs. Recent human expression QTL mapping studies demonstrated that SVs play a disproportionally large role in altering gene expression, underlining the importance of including SVs in genetic analyses. Therefore, this study aimed to generate and explore a high-quality bovine SV catalogue exploiting a unique cattle family cohort data (total 266 samples, forming 127 trios). RESULTS: We curated 13,731 SVs segregating in the population, consisting of 12,201 deletions, 1,509 duplications, and 21 multi-allelic CNVs (> 50-bp). Of these, we validated a subset of copy number variants (CNVs) utilising a direct genotyping approach in an independent cohort, indicating that at least 62% of the CNVs are true variants, segregating in the population. Among gene-disrupting SVs, we prioritised two likely high impact duplications, encompassing ORM1 and POPDC3 genes, respectively. Liver expression QTL mapping results revealed that these duplications are likely causing altered gene expression, confirming the functional importance of SVs. Although most of the accurately genotyped CNVs are tagged by single nucleotide polymorphisms (SNPs) ascertained in WGS data, most CNVs were not captured by individual SNPs obtained from a 50K genotyping array. CONCLUSION: We generated a high-quality SV catalogue exploiting unique whole genome sequenced bovine family cohort data. Two high impact duplications upregulating the ORM1 and POPDC3 are putative candidates for postpartum feed intake and hoof health traits, thus warranting further investigation. Generally, CNVs were in low LD with SNPs on the 50K array. Hence, it remains crucial to incorporate CNVs via means other than tagging SNPs, such as investigation of tagging haplotypes, direct imputation of CNVs, or direct genotyping as done in the current study. The SV catalogue and the custom genotyping array generated in the current study will serve as valuable resources accelerating utilisation of full spectrum of genetic variants in bovine genomes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09259-8. BioMed Central 2023-05-01 /pmc/articles/PMC10152703/ /pubmed/37127590 http://dx.doi.org/10.1186/s12864-023-09259-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Lee, Young-Lim Bosse, Mirte Takeda, Haruko Moreira, Gabriel Costa Monteiro Karim, Latifa Druet, Tom Oget-Ebrad, Claire Coppieters, Wouter Veerkamp, Roel F. Groenen, Martien A. M. Georges, Michel Bouwman, Aniek C. Charlier, Carole High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data |
title | High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data |
title_full | High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data |
title_fullStr | High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data |
title_full_unstemmed | High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data |
title_short | High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data |
title_sort | high-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10152703/ https://www.ncbi.nlm.nih.gov/pubmed/37127590 http://dx.doi.org/10.1186/s12864-023-09259-8 |
work_keys_str_mv | AT leeyounglim highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT bossemirte highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT takedaharuko highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT moreiragabrielcostamonteiro highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT karimlatifa highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT druettom highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT ogetebradclaire highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT coppieterswouter highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT veerkamproelf highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT groenenmartienam highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT georgesmichel highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT bouwmananiekc highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata AT charliercarole highresolutionstructuralvariantscatalogueinalargescalewholegenomesequencedbovinefamilycohortdata |