Cargando…

Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content

Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine con...

Descripción completa

Detalles Bibliográficos
Autores principales: Tran, Hue T.M., Ramaraj, Thiruvarangan, Furtado, Agnelo, Lee, Leonard Slade, Henry, Robert J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6131422/
https://www.ncbi.nlm.nih.gov/pubmed/29509991
http://dx.doi.org/10.1111/pbi.12912
_version_ 1783354101260615680
author Tran, Hue T.M.
Ramaraj, Thiruvarangan
Furtado, Agnelo
Lee, Leonard Slade
Henry, Robert J.
author_facet Tran, Hue T.M.
Ramaraj, Thiruvarangan
Furtado, Agnelo
Lee, Leonard Slade
Henry, Robert J.
author_sort Tran, Hue T.M.
collection PubMed
description Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long‐read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway‐based analysis, 65 caffeine‐associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee.
format Online
Article
Text
id pubmed-6131422
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-61314222018-09-13 Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content Tran, Hue T.M. Ramaraj, Thiruvarangan Furtado, Agnelo Lee, Leonard Slade Henry, Robert J. Plant Biotechnol J Research Articles Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long‐read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway‐based analysis, 65 caffeine‐associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee. John Wiley and Sons Inc. 2018-04-13 2018-10 /pmc/articles/PMC6131422/ /pubmed/29509991 http://dx.doi.org/10.1111/pbi.12912 Text en © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Tran, Hue T.M.
Ramaraj, Thiruvarangan
Furtado, Agnelo
Lee, Leonard Slade
Henry, Robert J.
Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content
title Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content
title_full Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content
title_fullStr Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content
title_full_unstemmed Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content
title_short Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content
title_sort use of a draft genome of coffee (coffea arabica) to identify snps associated with caffeine content
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6131422/
https://www.ncbi.nlm.nih.gov/pubmed/29509991
http://dx.doi.org/10.1111/pbi.12912
work_keys_str_mv AT tranhuetm useofadraftgenomeofcoffeecoffeaarabicatoidentifysnpsassociatedwithcaffeinecontent
AT ramarajthiruvarangan useofadraftgenomeofcoffeecoffeaarabicatoidentifysnpsassociatedwithcaffeinecontent
AT furtadoagnelo useofadraftgenomeofcoffeecoffeaarabicatoidentifysnpsassociatedwithcaffeinecontent
AT leeleonardslade useofadraftgenomeofcoffeecoffeaarabicatoidentifysnpsassociatedwithcaffeinecontent
AT henryrobertj useofadraftgenomeofcoffeecoffeaarabicatoidentifysnpsassociatedwithcaffeinecontent