Cargando…
Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization
Plant mitochondrial genomes display an enormous structural complexity, as recombining repeat-pairs lead to the generation of various sub-genomic molecules, rendering these genomes extremely challenging to assemble. We present a novel bioinformatic data-processing pipeline called SAGBAC (Semi-Automat...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8969700/ https://www.ncbi.nlm.nih.gov/pubmed/35372837 http://dx.doi.org/10.1093/nargab/lqac027 |
_version_ | 1784679311978004480 |
---|---|
author | Fischer, Axel Dotzek, Jana Walther, Dirk Greiner, Stephan |
author_facet | Fischer, Axel Dotzek, Jana Walther, Dirk Greiner, Stephan |
author_sort | Fischer, Axel |
collection | PubMed |
description | Plant mitochondrial genomes display an enormous structural complexity, as recombining repeat-pairs lead to the generation of various sub-genomic molecules, rendering these genomes extremely challenging to assemble. We present a novel bioinformatic data-processing pipeline called SAGBAC (Semi-Automated Graph-Based Assembly Curator) that identifies recombinogenic repeat-pairs and reconstructs plant mitochondrial genomes. SAGBAC processes assembly outputs and applies our novel ISEIS (Iterative Sequence Ends Identity Search) algorithm to obtain a graph-based visualization. We applied this approach to three mitochondrial genomes of evening primrose (Oenothera), a plant genus used for cytoplasmic genetics studies. All identified repeat pairs were found to be flanked by two alternative and unique sequence-contigs defining so-called ‘double forks’, resulting in four possible contig-repeat‐contig combinations for each repeat pair. Based on the inferred structural models, the stoichiometry of the different contig-repeat-contig combinations was analyzed using Illumina mate-pair and PacBio RSII data. This uncovered a remarkable structural diversity of the three closely related mitochondrial genomes, as well as substantial phylogenetic variation of the underlying repeats. Our model allows predicting all recombination events and, thus, all possible sub-genomes. In future work, the proposed methodology may prove useful for the investigation of the sub-genome organization and dynamics in different tissues and at various developmental stages. |
format | Online Article Text |
id | pubmed-8969700 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-89697002022-04-01 Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization Fischer, Axel Dotzek, Jana Walther, Dirk Greiner, Stephan NAR Genom Bioinform Standard Article Plant mitochondrial genomes display an enormous structural complexity, as recombining repeat-pairs lead to the generation of various sub-genomic molecules, rendering these genomes extremely challenging to assemble. We present a novel bioinformatic data-processing pipeline called SAGBAC (Semi-Automated Graph-Based Assembly Curator) that identifies recombinogenic repeat-pairs and reconstructs plant mitochondrial genomes. SAGBAC processes assembly outputs and applies our novel ISEIS (Iterative Sequence Ends Identity Search) algorithm to obtain a graph-based visualization. We applied this approach to three mitochondrial genomes of evening primrose (Oenothera), a plant genus used for cytoplasmic genetics studies. All identified repeat pairs were found to be flanked by two alternative and unique sequence-contigs defining so-called ‘double forks’, resulting in four possible contig-repeat‐contig combinations for each repeat pair. Based on the inferred structural models, the stoichiometry of the different contig-repeat-contig combinations was analyzed using Illumina mate-pair and PacBio RSII data. This uncovered a remarkable structural diversity of the three closely related mitochondrial genomes, as well as substantial phylogenetic variation of the underlying repeats. Our model allows predicting all recombination events and, thus, all possible sub-genomes. In future work, the proposed methodology may prove useful for the investigation of the sub-genome organization and dynamics in different tissues and at various developmental stages. Oxford University Press 2022-03-31 /pmc/articles/PMC8969700/ /pubmed/35372837 http://dx.doi.org/10.1093/nargab/lqac027 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Standard Article Fischer, Axel Dotzek, Jana Walther, Dirk Greiner, Stephan Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization |
title | Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization |
title_full | Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization |
title_fullStr | Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization |
title_full_unstemmed | Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization |
title_short | Graph-based models of the Oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial DNA organization |
title_sort | graph-based models of the oenothera mitochondrial genome capture the enormous complexity of higher plant mitochondrial dna organization |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8969700/ https://www.ncbi.nlm.nih.gov/pubmed/35372837 http://dx.doi.org/10.1093/nargab/lqac027 |
work_keys_str_mv | AT fischeraxel graphbasedmodelsoftheoenotheramitochondrialgenomecapturetheenormouscomplexityofhigherplantmitochondrialdnaorganization AT dotzekjana graphbasedmodelsoftheoenotheramitochondrialgenomecapturetheenormouscomplexityofhigherplantmitochondrialdnaorganization AT waltherdirk graphbasedmodelsoftheoenotheramitochondrialgenomecapturetheenormouscomplexityofhigherplantmitochondrialdnaorganization AT greinerstephan graphbasedmodelsoftheoenotheramitochondrialgenomecapturetheenormouscomplexityofhigherplantmitochondrialdnaorganization |