Cargando…

CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome

Genetic diversity in plants is remarkably high. Recent whole genome sequencing (WGS) of 67 rice accessions recovered 10,872 novel genes. Comparison of the genetic architecture among divergent populations or between crops and wild relatives is essential for obtaining functional components determining...

Descripción completa

Detalles Bibliográficos
Autores principales: Qi, Meifang, Li, Zijuan, Liu, Chunmei, Hu, Wenyan, Ye, Luhuan, Xie, Yilin, Zhuang, Yili, Zhao, Fei, Teng, Wan, Zheng, Qi, Fan, Zhenjun, Xu, Lin, Lang, Zhaobo, Tong, Yiping, Zhang, Yijing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6182137/
https://www.ncbi.nlm.nih.gov/pubmed/29931324
http://dx.doi.org/10.1093/nar/gky522
_version_ 1783362497384808448
author Qi, Meifang
Li, Zijuan
Liu, Chunmei
Hu, Wenyan
Ye, Luhuan
Xie, Yilin
Zhuang, Yili
Zhao, Fei
Teng, Wan
Zheng, Qi
Fan, Zhenjun
Xu, Lin
Lang, Zhaobo
Tong, Yiping
Zhang, Yijing
author_facet Qi, Meifang
Li, Zijuan
Liu, Chunmei
Hu, Wenyan
Ye, Luhuan
Xie, Yilin
Zhuang, Yili
Zhao, Fei
Teng, Wan
Zheng, Qi
Fan, Zhenjun
Xu, Lin
Lang, Zhaobo
Tong, Yiping
Zhang, Yijing
author_sort Qi, Meifang
collection PubMed
description Genetic diversity in plants is remarkably high. Recent whole genome sequencing (WGS) of 67 rice accessions recovered 10,872 novel genes. Comparison of the genetic architecture among divergent populations or between crops and wild relatives is essential for obtaining functional components determining crucial traits. However, many major crops have gigabase-scale genomes, which are not well-suited to WGS. Existing cost-effective sequencing approaches including re-sequencing, exome-sequencing and restriction enzyme-based methods all have difficulty in obtaining long novel genomic sequences from highly divergent population with large genome size. The present study presented a reference-independent core genome targeted sequencing approach, CGT-seq, which employed epigenomic information from both active and repressive epigenetic marks to guide the assembly of the core genome mainly composed of promoter and intragenic regions. This method was relatively easily implemented, and displayed high sensitivity and specificity for capturing the core genome of bread wheat. 95% intragenic and 89% promoter region from wheat were covered by CGT-seq read. We further demonstrated in rice that CGT-seq captured hundreds of novel genes and regulatory sequences from a previously unsequenced ecotype. Together, with specific enrichment and sequencing of regions within and nearby genes, CGT-seq is a time- and resource-effective approach to profiling functionally relevant regions in sequenced and non-sequenced populations with large genomes.
format Online
Article
Text
id pubmed-6182137
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61821372018-10-18 CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome Qi, Meifang Li, Zijuan Liu, Chunmei Hu, Wenyan Ye, Luhuan Xie, Yilin Zhuang, Yili Zhao, Fei Teng, Wan Zheng, Qi Fan, Zhenjun Xu, Lin Lang, Zhaobo Tong, Yiping Zhang, Yijing Nucleic Acids Res Methods Online Genetic diversity in plants is remarkably high. Recent whole genome sequencing (WGS) of 67 rice accessions recovered 10,872 novel genes. Comparison of the genetic architecture among divergent populations or between crops and wild relatives is essential for obtaining functional components determining crucial traits. However, many major crops have gigabase-scale genomes, which are not well-suited to WGS. Existing cost-effective sequencing approaches including re-sequencing, exome-sequencing and restriction enzyme-based methods all have difficulty in obtaining long novel genomic sequences from highly divergent population with large genome size. The present study presented a reference-independent core genome targeted sequencing approach, CGT-seq, which employed epigenomic information from both active and repressive epigenetic marks to guide the assembly of the core genome mainly composed of promoter and intragenic regions. This method was relatively easily implemented, and displayed high sensitivity and specificity for capturing the core genome of bread wheat. 95% intragenic and 89% promoter region from wheat were covered by CGT-seq read. We further demonstrated in rice that CGT-seq captured hundreds of novel genes and regulatory sequences from a previously unsequenced ecotype. Together, with specific enrichment and sequencing of regions within and nearby genes, CGT-seq is a time- and resource-effective approach to profiling functionally relevant regions in sequenced and non-sequenced populations with large genomes. Oxford University Press 2018-10-12 2018-06-21 /pmc/articles/PMC6182137/ /pubmed/29931324 http://dx.doi.org/10.1093/nar/gky522 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Qi, Meifang
Li, Zijuan
Liu, Chunmei
Hu, Wenyan
Ye, Luhuan
Xie, Yilin
Zhuang, Yili
Zhao, Fei
Teng, Wan
Zheng, Qi
Fan, Zhenjun
Xu, Lin
Lang, Zhaobo
Tong, Yiping
Zhang, Yijing
CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome
title CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome
title_full CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome
title_fullStr CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome
title_full_unstemmed CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome
title_short CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome
title_sort cgt-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6182137/
https://www.ncbi.nlm.nih.gov/pubmed/29931324
http://dx.doi.org/10.1093/nar/gky522
work_keys_str_mv AT qimeifang cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT lizijuan cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT liuchunmei cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT huwenyan cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT yeluhuan cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT xieyilin cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT zhuangyili cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT zhaofei cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT tengwan cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT zhengqi cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT fanzhenjun cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT xulin cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT langzhaobo cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT tongyiping cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome
AT zhangyijing cgtseqepigenomeguideddenovoassemblyofthecoregenomefordivergentpopulationswithlargegenome