Cargando…

Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome

It is broadly expected that next generation sequencing will ultimately generate a complete genome as is the latest goat reference genome (ARS1), which is considered to be one of the most continuous assemblies in livestock. However, the rich diversity of worldwide goat breeds indicates that a genome...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Ran, Fu, Weiwei, Su, Rui, Tian, Xiaomeng, Du, Duo, Zhao, Yue, Zheng, Zhuqing, Chen, Qiuming, Gao, Shan, Cai, Yudong, Wang, Xihong, Li, Jinquan, Jiang, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6874019/
https://www.ncbi.nlm.nih.gov/pubmed/31803240
http://dx.doi.org/10.3389/fgene.2019.01169
_version_ 1783472764116533248
author Li, Ran
Fu, Weiwei
Su, Rui
Tian, Xiaomeng
Du, Duo
Zhao, Yue
Zheng, Zhuqing
Chen, Qiuming
Gao, Shan
Cai, Yudong
Wang, Xihong
Li, Jinquan
Jiang, Yu
author_facet Li, Ran
Fu, Weiwei
Su, Rui
Tian, Xiaomeng
Du, Duo
Zhao, Yue
Zheng, Zhuqing
Chen, Qiuming
Gao, Shan
Cai, Yudong
Wang, Xihong
Li, Jinquan
Jiang, Yu
author_sort Li, Ran
collection PubMed
description It is broadly expected that next generation sequencing will ultimately generate a complete genome as is the latest goat reference genome (ARS1), which is considered to be one of the most continuous assemblies in livestock. However, the rich diversity of worldwide goat breeds indicates that a genome from one individual would be insufficient to represent the whole genomic contents of goats. By comparing nine de novo assemblies from seven sibling species of domestic goat with ARS1 and using resequencing and transcriptome data from goats for verification, we identified a total of 38.3 Mb sequences that were absent in ARS1. The pan-sequences contain genic fractions with considerable expression. Using the pan-genome (ARS1 together with the pan-sequences) as a reference genome, variation calling efficacy can be appreciably improved. A total of 56,657 spurious SNPs per individual were repressed and 24,414 novel SNPs per individual on average were recovered as a result of better reads mapping quality. The transcriptomic mapping rate was also increased by ∼1.15%. Our study demonstrated that comparing de novo assemblies from closely related species is an efficient and reliable strategy for finding missing sequences from the reference genome and could be applicable to other species. Pan-genome can serve as an improved reference genome in animals for a better exploration of the underlying genomic variations and could increase the probability of finding genotype-phenotype associations assessed by a comprehensive variation database containing much more differences between individuals. We have constructed a goat pan-genome web interface for data visualization (http://animal.nwsuaf.edu.cn/panGoat).
format Online
Article
Text
id pubmed-6874019
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68740192019-12-04 Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome Li, Ran Fu, Weiwei Su, Rui Tian, Xiaomeng Du, Duo Zhao, Yue Zheng, Zhuqing Chen, Qiuming Gao, Shan Cai, Yudong Wang, Xihong Li, Jinquan Jiang, Yu Front Genet Genetics It is broadly expected that next generation sequencing will ultimately generate a complete genome as is the latest goat reference genome (ARS1), which is considered to be one of the most continuous assemblies in livestock. However, the rich diversity of worldwide goat breeds indicates that a genome from one individual would be insufficient to represent the whole genomic contents of goats. By comparing nine de novo assemblies from seven sibling species of domestic goat with ARS1 and using resequencing and transcriptome data from goats for verification, we identified a total of 38.3 Mb sequences that were absent in ARS1. The pan-sequences contain genic fractions with considerable expression. Using the pan-genome (ARS1 together with the pan-sequences) as a reference genome, variation calling efficacy can be appreciably improved. A total of 56,657 spurious SNPs per individual were repressed and 24,414 novel SNPs per individual on average were recovered as a result of better reads mapping quality. The transcriptomic mapping rate was also increased by ∼1.15%. Our study demonstrated that comparing de novo assemblies from closely related species is an efficient and reliable strategy for finding missing sequences from the reference genome and could be applicable to other species. Pan-genome can serve as an improved reference genome in animals for a better exploration of the underlying genomic variations and could increase the probability of finding genotype-phenotype associations assessed by a comprehensive variation database containing much more differences between individuals. We have constructed a goat pan-genome web interface for data visualization (http://animal.nwsuaf.edu.cn/panGoat). Frontiers Media S.A. 2019-11-15 /pmc/articles/PMC6874019/ /pubmed/31803240 http://dx.doi.org/10.3389/fgene.2019.01169 Text en Copyright © 2019 Li, Fu, Su, Tian, Du, Zhao, Zheng, Chen, Gao, Cai, Wang, Li and Jiang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Ran
Fu, Weiwei
Su, Rui
Tian, Xiaomeng
Du, Duo
Zhao, Yue
Zheng, Zhuqing
Chen, Qiuming
Gao, Shan
Cai, Yudong
Wang, Xihong
Li, Jinquan
Jiang, Yu
Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome
title Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome
title_full Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome
title_fullStr Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome
title_full_unstemmed Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome
title_short Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome
title_sort towards the complete goat pan-genome by recovering missing genomic segments from the reference genome
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6874019/
https://www.ncbi.nlm.nih.gov/pubmed/31803240
http://dx.doi.org/10.3389/fgene.2019.01169
work_keys_str_mv AT liran towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT fuweiwei towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT surui towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT tianxiaomeng towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT duduo towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT zhaoyue towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT zhengzhuqing towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT chenqiuming towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT gaoshan towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT caiyudong towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT wangxihong towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT lijinquan towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome
AT jiangyu towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome