Cargando…
Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome
It is broadly expected that next generation sequencing will ultimately generate a complete genome as is the latest goat reference genome (ARS1), which is considered to be one of the most continuous assemblies in livestock. However, the rich diversity of worldwide goat breeds indicates that a genome...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6874019/ https://www.ncbi.nlm.nih.gov/pubmed/31803240 http://dx.doi.org/10.3389/fgene.2019.01169 |
_version_ | 1783472764116533248 |
---|---|
author | Li, Ran Fu, Weiwei Su, Rui Tian, Xiaomeng Du, Duo Zhao, Yue Zheng, Zhuqing Chen, Qiuming Gao, Shan Cai, Yudong Wang, Xihong Li, Jinquan Jiang, Yu |
author_facet | Li, Ran Fu, Weiwei Su, Rui Tian, Xiaomeng Du, Duo Zhao, Yue Zheng, Zhuqing Chen, Qiuming Gao, Shan Cai, Yudong Wang, Xihong Li, Jinquan Jiang, Yu |
author_sort | Li, Ran |
collection | PubMed |
description | It is broadly expected that next generation sequencing will ultimately generate a complete genome as is the latest goat reference genome (ARS1), which is considered to be one of the most continuous assemblies in livestock. However, the rich diversity of worldwide goat breeds indicates that a genome from one individual would be insufficient to represent the whole genomic contents of goats. By comparing nine de novo assemblies from seven sibling species of domestic goat with ARS1 and using resequencing and transcriptome data from goats for verification, we identified a total of 38.3 Mb sequences that were absent in ARS1. The pan-sequences contain genic fractions with considerable expression. Using the pan-genome (ARS1 together with the pan-sequences) as a reference genome, variation calling efficacy can be appreciably improved. A total of 56,657 spurious SNPs per individual were repressed and 24,414 novel SNPs per individual on average were recovered as a result of better reads mapping quality. The transcriptomic mapping rate was also increased by ∼1.15%. Our study demonstrated that comparing de novo assemblies from closely related species is an efficient and reliable strategy for finding missing sequences from the reference genome and could be applicable to other species. Pan-genome can serve as an improved reference genome in animals for a better exploration of the underlying genomic variations and could increase the probability of finding genotype-phenotype associations assessed by a comprehensive variation database containing much more differences between individuals. We have constructed a goat pan-genome web interface for data visualization (http://animal.nwsuaf.edu.cn/panGoat). |
format | Online Article Text |
id | pubmed-6874019 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-68740192019-12-04 Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome Li, Ran Fu, Weiwei Su, Rui Tian, Xiaomeng Du, Duo Zhao, Yue Zheng, Zhuqing Chen, Qiuming Gao, Shan Cai, Yudong Wang, Xihong Li, Jinquan Jiang, Yu Front Genet Genetics It is broadly expected that next generation sequencing will ultimately generate a complete genome as is the latest goat reference genome (ARS1), which is considered to be one of the most continuous assemblies in livestock. However, the rich diversity of worldwide goat breeds indicates that a genome from one individual would be insufficient to represent the whole genomic contents of goats. By comparing nine de novo assemblies from seven sibling species of domestic goat with ARS1 and using resequencing and transcriptome data from goats for verification, we identified a total of 38.3 Mb sequences that were absent in ARS1. The pan-sequences contain genic fractions with considerable expression. Using the pan-genome (ARS1 together with the pan-sequences) as a reference genome, variation calling efficacy can be appreciably improved. A total of 56,657 spurious SNPs per individual were repressed and 24,414 novel SNPs per individual on average were recovered as a result of better reads mapping quality. The transcriptomic mapping rate was also increased by ∼1.15%. Our study demonstrated that comparing de novo assemblies from closely related species is an efficient and reliable strategy for finding missing sequences from the reference genome and could be applicable to other species. Pan-genome can serve as an improved reference genome in animals for a better exploration of the underlying genomic variations and could increase the probability of finding genotype-phenotype associations assessed by a comprehensive variation database containing much more differences between individuals. We have constructed a goat pan-genome web interface for data visualization (http://animal.nwsuaf.edu.cn/panGoat). Frontiers Media S.A. 2019-11-15 /pmc/articles/PMC6874019/ /pubmed/31803240 http://dx.doi.org/10.3389/fgene.2019.01169 Text en Copyright © 2019 Li, Fu, Su, Tian, Du, Zhao, Zheng, Chen, Gao, Cai, Wang, Li and Jiang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Li, Ran Fu, Weiwei Su, Rui Tian, Xiaomeng Du, Duo Zhao, Yue Zheng, Zhuqing Chen, Qiuming Gao, Shan Cai, Yudong Wang, Xihong Li, Jinquan Jiang, Yu Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome |
title | Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome |
title_full | Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome |
title_fullStr | Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome |
title_full_unstemmed | Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome |
title_short | Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome |
title_sort | towards the complete goat pan-genome by recovering missing genomic segments from the reference genome |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6874019/ https://www.ncbi.nlm.nih.gov/pubmed/31803240 http://dx.doi.org/10.3389/fgene.2019.01169 |
work_keys_str_mv | AT liran towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT fuweiwei towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT surui towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT tianxiaomeng towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT duduo towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT zhaoyue towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT zhengzhuqing towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT chenqiuming towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT gaoshan towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT caiyudong towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT wangxihong towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT lijinquan towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome AT jiangyu towardsthecompletegoatpangenomebyrecoveringmissinggenomicsegmentsfromthereferencegenome |