Cargando…

Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history

A cattle pangenome representation was created based on the genome sequences of 898 cattle representing 57 breeds. The pangenome identified 83 Mb of sequence not found in the cattle reference genome, representing 3.1% novel sequence compared with the 2.71-Gb reference. A catalog of structural variant...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yang, Yang, Lv, Han, Xiaotao, Han, Jiazheng, Hu, Yan, Li, Fan, Xia, Han, Peng, Lingwei, Boschiero, Clarissa, Rosen, Benjamin D., Bickhart, Derek M., Zhang, Shujun, Guo, Aizhen, Van Tassell, Curtis P., Smith, Timothy P.L., Yang, Liguo, Liu, George E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9435747/
https://www.ncbi.nlm.nih.gov/pubmed/35977842
http://dx.doi.org/10.1101/gr.276550.122
_version_ 1784781217710735360
author Zhou, Yang
Yang, Lv
Han, Xiaotao
Han, Jiazheng
Hu, Yan
Li, Fan
Xia, Han
Peng, Lingwei
Boschiero, Clarissa
Rosen, Benjamin D.
Bickhart, Derek M.
Zhang, Shujun
Guo, Aizhen
Van Tassell, Curtis P.
Smith, Timothy P.L.
Yang, Liguo
Liu, George E.
author_facet Zhou, Yang
Yang, Lv
Han, Xiaotao
Han, Jiazheng
Hu, Yan
Li, Fan
Xia, Han
Peng, Lingwei
Boschiero, Clarissa
Rosen, Benjamin D.
Bickhart, Derek M.
Zhang, Shujun
Guo, Aizhen
Van Tassell, Curtis P.
Smith, Timothy P.L.
Yang, Liguo
Liu, George E.
author_sort Zhou, Yang
collection PubMed
description A cattle pangenome representation was created based on the genome sequences of 898 cattle representing 57 breeds. The pangenome identified 83 Mb of sequence not found in the cattle reference genome, representing 3.1% novel sequence compared with the 2.71-Gb reference. A catalog of structural variants developed from this cattle population identified 3.3 million deletions, 0.12 million inversions, and 0.18 million duplications. Estimates of breed ancestry and hybridization between cattle breeds using insertion/deletions as markers were similar to those produced by single nucleotide polymorphism–based analysis. Hundreds of deletions were observed to have stratification based on subspecies and breed. For example, an insertion of a Bov-tA1 repeat element was identified in the first intron of the APPL2 gene and correlated with cattle breed geographic distribution. This insertion falls within a segment overlapping predicted enhancer and promoter regions of the gene, and could affect important traits such as immune response, olfactory functions, cell proliferation, and glucose metabolism in muscle. The results indicate that pangenomes are a valuable resource for studying diversity and evolutionary history, and help to delineate how domestication, trait-based breeding, and adaptive introgression have shaped the cattle genome.
format Online
Article
Text
id pubmed-9435747
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-94357472023-02-01 Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history Zhou, Yang Yang, Lv Han, Xiaotao Han, Jiazheng Hu, Yan Li, Fan Xia, Han Peng, Lingwei Boschiero, Clarissa Rosen, Benjamin D. Bickhart, Derek M. Zhang, Shujun Guo, Aizhen Van Tassell, Curtis P. Smith, Timothy P.L. Yang, Liguo Liu, George E. Genome Res Resource A cattle pangenome representation was created based on the genome sequences of 898 cattle representing 57 breeds. The pangenome identified 83 Mb of sequence not found in the cattle reference genome, representing 3.1% novel sequence compared with the 2.71-Gb reference. A catalog of structural variants developed from this cattle population identified 3.3 million deletions, 0.12 million inversions, and 0.18 million duplications. Estimates of breed ancestry and hybridization between cattle breeds using insertion/deletions as markers were similar to those produced by single nucleotide polymorphism–based analysis. Hundreds of deletions were observed to have stratification based on subspecies and breed. For example, an insertion of a Bov-tA1 repeat element was identified in the first intron of the APPL2 gene and correlated with cattle breed geographic distribution. This insertion falls within a segment overlapping predicted enhancer and promoter regions of the gene, and could affect important traits such as immune response, olfactory functions, cell proliferation, and glucose metabolism in muscle. The results indicate that pangenomes are a valuable resource for studying diversity and evolutionary history, and help to delineate how domestication, trait-based breeding, and adaptive introgression have shaped the cattle genome. Cold Spring Harbor Laboratory Press 2022-08 /pmc/articles/PMC9435747/ /pubmed/35977842 http://dx.doi.org/10.1101/gr.276550.122 Text en © 2022 Zhou et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Resource
Zhou, Yang
Yang, Lv
Han, Xiaotao
Han, Jiazheng
Hu, Yan
Li, Fan
Xia, Han
Peng, Lingwei
Boschiero, Clarissa
Rosen, Benjamin D.
Bickhart, Derek M.
Zhang, Shujun
Guo, Aizhen
Van Tassell, Curtis P.
Smith, Timothy P.L.
Yang, Liguo
Liu, George E.
Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history
title Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history
title_full Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history
title_fullStr Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history
title_full_unstemmed Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history
title_short Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history
title_sort assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9435747/
https://www.ncbi.nlm.nih.gov/pubmed/35977842
http://dx.doi.org/10.1101/gr.276550.122
work_keys_str_mv AT zhouyang assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT yanglv assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT hanxiaotao assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT hanjiazheng assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT huyan assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT lifan assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT xiahan assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT penglingwei assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT boschieroclarissa assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT rosenbenjamind assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT bickhartderekm assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT zhangshujun assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT guoaizhen assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT vantassellcurtisp assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT smithtimothypl assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT yangliguo assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory
AT liugeorgee assemblyofapangenomeforglobalcattlerevealsmissingsequencesandnovelstructuralvariationsprovidingnewinsightsintotheirdiversityandevolutionaryhistory