Cargando…

rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects

BACKGROUND: The advent of population-scale genome projects has revolutionized our biological understanding of parasitic protozoa. However, while hundreds to thousands of nuclear genomes of parasitic protozoa have been generated and analyzed, information about the diversity, structure and evolution o...

Descripción completa

Detalles Bibliográficos
Autores principales: Geerts, Manon, Schnaufer, Achim, Van den Broeck, Frederik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8479924/
https://www.ncbi.nlm.nih.gov/pubmed/34583651
http://dx.doi.org/10.1186/s12859-021-04384-1
_version_ 1784576363571707904
author Geerts, Manon
Schnaufer, Achim
Van den Broeck, Frederik
author_facet Geerts, Manon
Schnaufer, Achim
Van den Broeck, Frederik
author_sort Geerts, Manon
collection PubMed
description BACKGROUND: The advent of population-scale genome projects has revolutionized our biological understanding of parasitic protozoa. However, while hundreds to thousands of nuclear genomes of parasitic protozoa have been generated and analyzed, information about the diversity, structure and evolution of their mitochondrial genomes remains fragmentary, mainly because of their extraordinary complexity. Indeed, unicellular flagellates of the order Kinetoplastida contain structurally the most complex mitochondrial genome of all eukaryotes, organized as a giant network of homogeneous maxicircles and heterogeneous minicircles. We recently developed KOMICS, an analysis toolkit that automates the assembly and circularization of the mitochondrial genomes of Kinetoplastid parasites. While this tool overcomes the limitation of extracting mitochondrial assemblies from Next-Generation Sequencing datasets, interpreting and visualizing the genetic (dis)similarity within and between samples remains a time-consuming process. RESULTS: Here, we present a new analysis toolkit—rKOMICS—to streamline the analyses of minicircle sequence diversity in population-scale genome projects. rKOMICS is a user-friendly R package that has simple installation requirements and that is applicable to all 27 trypanosomatid genera. Once minicircle sequence alignments are generated, rKOMICS allows to examine, summarize and visualize minicircle sequence diversity within and between samples through the analyses of minicircle sequence clusters. We showcase the functionalities of the (r)KOMICS tool suite using a whole-genome sequencing dataset from a recently published study on the history of diversification of the Leishmania braziliensis species complex in Peru. Analyses of population diversity and structure highlighted differences in minicircle sequence richness and composition between Leishmania subspecies, and between subpopulations within subspecies. CONCLUSION: The rKOMICS package establishes a critical framework to manipulate, explore and extract biologically relevant information from mitochondrial minicircle assemblies in tens to hundreds of samples simultaneously and efficiently. This should facilitate research that aims to develop new molecular markers for identifying species-specific minicircles, or to study the ancestry of parasites for complementary insights into their evolutionary history.
format Online
Article
Text
id pubmed-8479924
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-84799242021-09-29 rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects Geerts, Manon Schnaufer, Achim Van den Broeck, Frederik BMC Bioinformatics Software BACKGROUND: The advent of population-scale genome projects has revolutionized our biological understanding of parasitic protozoa. However, while hundreds to thousands of nuclear genomes of parasitic protozoa have been generated and analyzed, information about the diversity, structure and evolution of their mitochondrial genomes remains fragmentary, mainly because of their extraordinary complexity. Indeed, unicellular flagellates of the order Kinetoplastida contain structurally the most complex mitochondrial genome of all eukaryotes, organized as a giant network of homogeneous maxicircles and heterogeneous minicircles. We recently developed KOMICS, an analysis toolkit that automates the assembly and circularization of the mitochondrial genomes of Kinetoplastid parasites. While this tool overcomes the limitation of extracting mitochondrial assemblies from Next-Generation Sequencing datasets, interpreting and visualizing the genetic (dis)similarity within and between samples remains a time-consuming process. RESULTS: Here, we present a new analysis toolkit—rKOMICS—to streamline the analyses of minicircle sequence diversity in population-scale genome projects. rKOMICS is a user-friendly R package that has simple installation requirements and that is applicable to all 27 trypanosomatid genera. Once minicircle sequence alignments are generated, rKOMICS allows to examine, summarize and visualize minicircle sequence diversity within and between samples through the analyses of minicircle sequence clusters. We showcase the functionalities of the (r)KOMICS tool suite using a whole-genome sequencing dataset from a recently published study on the history of diversification of the Leishmania braziliensis species complex in Peru. Analyses of population diversity and structure highlighted differences in minicircle sequence richness and composition between Leishmania subspecies, and between subpopulations within subspecies. CONCLUSION: The rKOMICS package establishes a critical framework to manipulate, explore and extract biologically relevant information from mitochondrial minicircle assemblies in tens to hundreds of samples simultaneously and efficiently. This should facilitate research that aims to develop new molecular markers for identifying species-specific minicircles, or to study the ancestry of parasites for complementary insights into their evolutionary history. BioMed Central 2021-09-28 /pmc/articles/PMC8479924/ /pubmed/34583651 http://dx.doi.org/10.1186/s12859-021-04384-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Geerts, Manon
Schnaufer, Achim
Van den Broeck, Frederik
rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects
title rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects
title_full rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects
title_fullStr rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects
title_full_unstemmed rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects
title_short rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects
title_sort rkomics: an r package for processing mitochondrial minicircle assemblies in population-scale genome projects
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8479924/
https://www.ncbi.nlm.nih.gov/pubmed/34583651
http://dx.doi.org/10.1186/s12859-021-04384-1
work_keys_str_mv AT geertsmanon rkomicsanrpackageforprocessingmitochondrialminicircleassembliesinpopulationscalegenomeprojects
AT schnauferachim rkomicsanrpackageforprocessingmitochondrialminicircleassembliesinpopulationscalegenomeprojects
AT vandenbroeckfrederik rkomicsanrpackageforprocessingmitochondrialminicircleassembliesinpopulationscalegenomeprojects