Cargando…

Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization

Plant mitochondrial genomes display an enormous structural complexity, as recombining repeat-pairs lead to the generation of various sub-genomic molecules, rendering these genomes extremely challenging to assemble. We present a novel bioinformatic data-processing pipeline called SAGBAC (Semi-Automat...

Descripción completa

Detalles Bibliográficos
Autores principales: Fischer, Axel, Dotzek, Jana, Walther, Dirk, Greiner, Stephan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8969700/
https://www.ncbi.nlm.nih.gov/pubmed/35372837
http://dx.doi.org/10.1093/nargab/lqac027
_version_ 1784679311978004480
author Fischer, Axel
Dotzek, Jana
Walther, Dirk
Greiner, Stephan
author_facet Fischer, Axel
Dotzek, Jana
Walther, Dirk
Greiner, Stephan
author_sort Fischer, Axel
collection PubMed
description Plant mitochondrial genomes display an enormous structural complexity, as recombining repeat-pairs lead to the generation of various sub-genomic molecules, rendering these genomes extremely challenging to assemble. We present a novel bioinformatic data-processing pipeline called SAGBAC (Semi-Automated Graph-Based Assembly Curator) that identifies recombinogenic repeat-pairs and reconstructs plant mitochondrial genomes. SAGBAC processes assembly outputs and applies our novel ISEIS (Iterative Sequence Ends Identity Search) algorithm to obtain a graph-based visualization. We applied this approach to three mitochondrial genomes of evening primrose (Oenothera), a plant genus used for cytoplasmic genetics studies. All identified repeat pairs were found to be flanked by two alternative and unique sequence-contigs defining so-called ‘double forks’, resulting in four possible contig-repeat‐contig combinations for each repeat pair. Based on the inferred structural models, the stoichiometry of the different contig-repeat-contig combinations was analyzed using Illumina mate-pair and PacBio RSII data. This uncovered a remarkable structural diversity of the three closely related mitochondrial genomes, as well as substantial phylogenetic variation of the underlying repeats. Our model allows predicting all recombination events and, thus, all possible sub-genomes. In future work, the proposed methodology may prove useful for the investigation of the sub-genome organization and dynamics in different tissues and at various developmental stages.
format Online
Article
Text
id pubmed-8969700
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-89697002022-04-01 Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization Fischer, Axel Dotzek, Jana Walther, Dirk Greiner, Stephan NAR Genom Bioinform Standard Article Plant mitochondrial genomes display an enormous structural complexity, as recombining repeat-pairs lead to the generation of various sub-genomic molecules, rendering these genomes extremely challenging to assemble. We present a novel bioinformatic data-processing pipeline called SAGBAC (Semi-Automated Graph-Based Assembly Curator) that identifies recombinogenic repeat-pairs and reconstructs plant mitochondrial genomes. SAGBAC processes assembly outputs and applies our novel ISEIS (Iterative Sequence Ends Identity Search) algorithm to obtain a graph-based visualization. We applied this approach to three mitochondrial genomes of evening primrose (Oenothera), a plant genus used for cytoplasmic genetics studies. All identified repeat pairs were found to be flanked by two alternative and unique sequence-contigs defining so-called ‘double forks’, resulting in four possible contig-repeat‐contig combinations for each repeat pair. Based on the inferred structural models, the stoichiometry of the different contig-repeat-contig combinations was analyzed using Illumina mate-pair and PacBio RSII data. This uncovered a remarkable structural diversity of the three closely related mitochondrial genomes, as well as substantial phylogenetic variation of the underlying repeats. Our model allows predicting all recombination events and, thus, all possible sub-genomes. In future work, the proposed methodology may prove useful for the investigation of the sub-genome organization and dynamics in different tissues and at various developmental stages. Oxford University Press 2022-03-31 /pmc/articles/PMC8969700/ /pubmed/35372837 http://dx.doi.org/10.1093/nargab/lqac027 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Standard Article
Fischer, Axel
Dotzek, Jana
Walther, Dirk
Greiner, Stephan
Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization
title Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization
title_full Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization
title_fullStr Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization
title_full_unstemmed Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization
title_short Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization
title_sort graph-based models of the oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial dna organization
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8969700/
https://www.ncbi.nlm.nih.gov/pubmed/35372837
http://dx.doi.org/10.1093/nargab/lqac027
work_keys_str_mv AT fischeraxel graphbasedmodelsoftheoenotheramitochondrialgenomecapturetheenormouscomplexityofhigherplantmitochondrialdnaorganization
AT dotzekjana graphbasedmodelsoftheoenotheramitochondrialgenomecapturetheenormouscomplexityofhigherplantmitochondrialdnaorganization
AT waltherdirk graphbasedmodelsoftheoenotheramitochondrialgenomecapturetheenormouscomplexityofhigherplantmitochondrialdnaorganization
AT greinerstephan graphbasedmodelsoftheoenotheramitochondrialgenomecapturetheenormouscomplexityofhigherplantmitochondrialdnaorganization