Cargando…

Chloroplast genome assembly of Serjania erecta Raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of Sapindaceae

Serjania erecta Raldk is an essential genetic resource due to its anti-inflammatory, gastric protection, and anti-Alzheimer properties. However, the genetic and evolutionary aspects of the species remain poorly known. Here, we sequenced and assembled the complete chloroplast genome of S. erecta and...

Descripción completa

Detalles Bibliográficos
Autores principales: Corvalán, Leonardo C. J., Sobreiro, Mariane B., Carvalho, Larissa R., Dias, Renata O., Braga-Ferreira, Ramilla S., Targueta, Cintia P., Silva-Neto, Carlos M. e, Berton, Bianca W., Pereira, Ana Maria S., Diniz-filho, José A. F., Telles, Mariana P. C., Nunes, Rhewter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562606/
https://www.ncbi.nlm.nih.gov/pubmed/37822334
http://dx.doi.org/10.3389/fpls.2023.1258794
_version_ 1785118165452193792
author Corvalán, Leonardo C. J.
Sobreiro, Mariane B.
Carvalho, Larissa R.
Dias, Renata O.
Braga-Ferreira, Ramilla S.
Targueta, Cintia P.
Silva-Neto, Carlos M. e
Berton, Bianca W.
Pereira, Ana Maria S.
Diniz-filho, José A. F.
Telles, Mariana P. C.
Nunes, Rhewter
author_facet Corvalán, Leonardo C. J.
Sobreiro, Mariane B.
Carvalho, Larissa R.
Dias, Renata O.
Braga-Ferreira, Ramilla S.
Targueta, Cintia P.
Silva-Neto, Carlos M. e
Berton, Bianca W.
Pereira, Ana Maria S.
Diniz-filho, José A. F.
Telles, Mariana P. C.
Nunes, Rhewter
author_sort Corvalán, Leonardo C. J.
collection PubMed
description Serjania erecta Raldk is an essential genetic resource due to its anti-inflammatory, gastric protection, and anti-Alzheimer properties. However, the genetic and evolutionary aspects of the species remain poorly known. Here, we sequenced and assembled the complete chloroplast genome of S. erecta and used it in a comparative analysis within the Sapindaceae family. S. erecta has a chloroplast genome (cpDNA) of 159,297 bp, divided into a Large Single Copy region (LSC) of 84,556 bp and a Small Single Copy region (SSC) of 18,057 bp that are surrounded by two Inverted Repeat regions (IRa and IRb) of 28,342 bp. Among the 12 species used in the comparative analysis, S. erecta has the fewest long and microsatellite repeats. The genome structure of Sapindaceae species is relatively conserved; the number of genes varies from 128 to 132 genes, and this variation is associated with three main factors: (1) Expansion and retraction events in the size of the IRs, resulting in variations in the number of rpl22, rps19, and rps3 genes; (2) Pseudogenization of the rps2 gene; and (3) Loss or duplication of genes encoding tRNAs, associated with the duplication of trnH-GUG in X. sorbifolium and the absence of trnT-CGU in the Dodonaeoideae subfamily. We identified 10 and 11 mutational hotspots for Sapindaceae and Sapindoideae, respectively, and identified six highly diverse regions (tRNA-Lys — rps16, ndhC – tRNA-Val, petA – psbJ, ndhF, rpl32 – ccsA, and ycf1) are found in both groups, which show potential for the development of DNA barcode markers for molecular taxonomic identification of Serjania. We identified that the psaI gene evolves under neutrality in Sapindaceae, while all other chloroplast genes are under strong negative selection. However, local positive selection exists in the ndhF, rpoC2, ycf1, and ycf2 genes. The genes ndhF and ycf1 also present high nucleotide diversity and local positive selection, demonstrating significant potential as markers. Our findings include providing the first chloroplast genome of a member of the Paullinieae tribe. Furthermore, we identified patterns in variations in the number of genes and selection in genes possibly associated with the family’s evolutionary history.
format Online
Article
Text
id pubmed-10562606
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105626062023-10-11 Chloroplast genome assembly of Serjania erecta Raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of Sapindaceae Corvalán, Leonardo C. J. Sobreiro, Mariane B. Carvalho, Larissa R. Dias, Renata O. Braga-Ferreira, Ramilla S. Targueta, Cintia P. Silva-Neto, Carlos M. e Berton, Bianca W. Pereira, Ana Maria S. Diniz-filho, José A. F. Telles, Mariana P. C. Nunes, Rhewter Front Plant Sci Plant Science Serjania erecta Raldk is an essential genetic resource due to its anti-inflammatory, gastric protection, and anti-Alzheimer properties. However, the genetic and evolutionary aspects of the species remain poorly known. Here, we sequenced and assembled the complete chloroplast genome of S. erecta and used it in a comparative analysis within the Sapindaceae family. S. erecta has a chloroplast genome (cpDNA) of 159,297 bp, divided into a Large Single Copy region (LSC) of 84,556 bp and a Small Single Copy region (SSC) of 18,057 bp that are surrounded by two Inverted Repeat regions (IRa and IRb) of 28,342 bp. Among the 12 species used in the comparative analysis, S. erecta has the fewest long and microsatellite repeats. The genome structure of Sapindaceae species is relatively conserved; the number of genes varies from 128 to 132 genes, and this variation is associated with three main factors: (1) Expansion and retraction events in the size of the IRs, resulting in variations in the number of rpl22, rps19, and rps3 genes; (2) Pseudogenization of the rps2 gene; and (3) Loss or duplication of genes encoding tRNAs, associated with the duplication of trnH-GUG in X. sorbifolium and the absence of trnT-CGU in the Dodonaeoideae subfamily. We identified 10 and 11 mutational hotspots for Sapindaceae and Sapindoideae, respectively, and identified six highly diverse regions (tRNA-Lys — rps16, ndhC – tRNA-Val, petA – psbJ, ndhF, rpl32 – ccsA, and ycf1) are found in both groups, which show potential for the development of DNA barcode markers for molecular taxonomic identification of Serjania. We identified that the psaI gene evolves under neutrality in Sapindaceae, while all other chloroplast genes are under strong negative selection. However, local positive selection exists in the ndhF, rpoC2, ycf1, and ycf2 genes. The genes ndhF and ycf1 also present high nucleotide diversity and local positive selection, demonstrating significant potential as markers. Our findings include providing the first chloroplast genome of a member of the Paullinieae tribe. Furthermore, we identified patterns in variations in the number of genes and selection in genes possibly associated with the family’s evolutionary history. Frontiers Media S.A. 2023-09-26 /pmc/articles/PMC10562606/ /pubmed/37822334 http://dx.doi.org/10.3389/fpls.2023.1258794 Text en Copyright © 2023 Corvalán, Sobreiro, Carvalho, Dias, Braga-Ferreira, Targueta, Silva-Neto, Berton, Pereira, Diniz-filho, Telles and Nunes https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Corvalán, Leonardo C. J.
Sobreiro, Mariane B.
Carvalho, Larissa R.
Dias, Renata O.
Braga-Ferreira, Ramilla S.
Targueta, Cintia P.
Silva-Neto, Carlos M. e
Berton, Bianca W.
Pereira, Ana Maria S.
Diniz-filho, José A. F.
Telles, Mariana P. C.
Nunes, Rhewter
Chloroplast genome assembly of Serjania erecta Raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of Sapindaceae
title Chloroplast genome assembly of Serjania erecta Raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of Sapindaceae
title_full Chloroplast genome assembly of Serjania erecta Raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of Sapindaceae
title_fullStr Chloroplast genome assembly of Serjania erecta Raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of Sapindaceae
title_full_unstemmed Chloroplast genome assembly of Serjania erecta Raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of Sapindaceae
title_short Chloroplast genome assembly of Serjania erecta Raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of Sapindaceae
title_sort chloroplast genome assembly of serjania erecta raldk: comparative analysis reveals gene number variation and selection in protein-coding plastid genes of sapindaceae
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562606/
https://www.ncbi.nlm.nih.gov/pubmed/37822334
http://dx.doi.org/10.3389/fpls.2023.1258794
work_keys_str_mv AT corvalanleonardocj chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT sobreiromarianeb chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT carvalholarissar chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT diasrenatao chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT bragaferreiraramillas chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT targuetacintiap chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT silvanetocarlosme chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT bertonbiancaw chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT pereiraanamarias chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT dinizfilhojoseaf chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT tellesmarianapc chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae
AT nunesrhewter chloroplastgenomeassemblyofserjaniaerectaraldkcomparativeanalysisrevealsgenenumbervariationandselectioninproteincodingplastidgenesofsapindaceae