Cargando…

MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics

Thanks to the development of high‐throughput sequencing technologies, target enrichment sequencing of nuclear ultraconserved DNA elements (UCEs) now allows routine inference of phylogenetic relationships from thousands of genomic markers. Recently, it has been shown that mitochondrial DNA (mtDNA) is...

Descripción completa

Detalles Bibliográficos
Autores principales: Allio, Rémi, Schomaker‐Bastos, Alex, Romiguier, Jonathan, Prosdocimi, Francisco, Nabholz, Benoit, Delsuc, Frédéric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7497042/
https://www.ncbi.nlm.nih.gov/pubmed/32243090
http://dx.doi.org/10.1111/1755-0998.13160
_version_ 1783583231296143360
author Allio, Rémi
Schomaker‐Bastos, Alex
Romiguier, Jonathan
Prosdocimi, Francisco
Nabholz, Benoit
Delsuc, Frédéric
author_facet Allio, Rémi
Schomaker‐Bastos, Alex
Romiguier, Jonathan
Prosdocimi, Francisco
Nabholz, Benoit
Delsuc, Frédéric
author_sort Allio, Rémi
collection PubMed
description Thanks to the development of high‐throughput sequencing technologies, target enrichment sequencing of nuclear ultraconserved DNA elements (UCEs) now allows routine inference of phylogenetic relationships from thousands of genomic markers. Recently, it has been shown that mitochondrial DNA (mtDNA) is frequently sequenced alongside the targeted loci in such capture experiments. Despite its broad evolutionary interest, mtDNA is rarely assembled and used in conjunction with nuclear markers in capture‐based studies. Here, we developed MitoFinder, a user‐friendly bioinformatic pipeline, to efficiently assemble and annotate mitogenomic data from hundreds of UCE libraries. As a case study, we used ants (Formicidae) for which 501 UCE libraries have been sequenced whereas only 29 mitogenomes are available. We compared the efficiency of four different assemblers (IDBA‐UD, MEGAHIT, MetaSPAdes, and Trinity) for assembling both UCE and mtDNA loci. Using MitoFinder, we show that metagenomic assemblers, in particular MetaSPAdes, are well suited to assemble both UCEs and mtDNA. Mitogenomic signal was successfully extracted from all 501 UCE libraries, allowing us to confirm species identification using CO1 barcoding. Moreover, our automated procedure retrieved 296 cases in which the mitochondrial genome was assembled in a single contig, thus increasing the number of available ant mitogenomes by an order of magnitude. By utilizing the power of metagenomic assemblers, MitoFinder provides an efficient tool to extract complementary mitogenomic data from UCE libraries, allowing testing for potential mitonuclear discordance. Our approach is potentially applicable to other sequence capture methods, transcriptomic data and whole genome shotgun sequencing in diverse taxa. The MitoFinder software is available from GitHub (https://github.com/RemiAllio/MitoFinder).
format Online
Article
Text
id pubmed-7497042
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-74970422020-09-25 MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics Allio, Rémi Schomaker‐Bastos, Alex Romiguier, Jonathan Prosdocimi, Francisco Nabholz, Benoit Delsuc, Frédéric Mol Ecol Resour RESOURCE ARTICLES Thanks to the development of high‐throughput sequencing technologies, target enrichment sequencing of nuclear ultraconserved DNA elements (UCEs) now allows routine inference of phylogenetic relationships from thousands of genomic markers. Recently, it has been shown that mitochondrial DNA (mtDNA) is frequently sequenced alongside the targeted loci in such capture experiments. Despite its broad evolutionary interest, mtDNA is rarely assembled and used in conjunction with nuclear markers in capture‐based studies. Here, we developed MitoFinder, a user‐friendly bioinformatic pipeline, to efficiently assemble and annotate mitogenomic data from hundreds of UCE libraries. As a case study, we used ants (Formicidae) for which 501 UCE libraries have been sequenced whereas only 29 mitogenomes are available. We compared the efficiency of four different assemblers (IDBA‐UD, MEGAHIT, MetaSPAdes, and Trinity) for assembling both UCE and mtDNA loci. Using MitoFinder, we show that metagenomic assemblers, in particular MetaSPAdes, are well suited to assemble both UCEs and mtDNA. Mitogenomic signal was successfully extracted from all 501 UCE libraries, allowing us to confirm species identification using CO1 barcoding. Moreover, our automated procedure retrieved 296 cases in which the mitochondrial genome was assembled in a single contig, thus increasing the number of available ant mitogenomes by an order of magnitude. By utilizing the power of metagenomic assemblers, MitoFinder provides an efficient tool to extract complementary mitogenomic data from UCE libraries, allowing testing for potential mitonuclear discordance. Our approach is potentially applicable to other sequence capture methods, transcriptomic data and whole genome shotgun sequencing in diverse taxa. The MitoFinder software is available from GitHub (https://github.com/RemiAllio/MitoFinder). John Wiley and Sons Inc. 2020-04-25 2020-07 /pmc/articles/PMC7497042/ /pubmed/32243090 http://dx.doi.org/10.1111/1755-0998.13160 Text en © 2020 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle RESOURCE ARTICLES
Allio, Rémi
Schomaker‐Bastos, Alex
Romiguier, Jonathan
Prosdocimi, Francisco
Nabholz, Benoit
Delsuc, Frédéric
MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics
title MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics
title_full MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics
title_fullStr MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics
title_full_unstemmed MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics
title_short MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics
title_sort mitofinder: efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics
topic RESOURCE ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7497042/
https://www.ncbi.nlm.nih.gov/pubmed/32243090
http://dx.doi.org/10.1111/1755-0998.13160
work_keys_str_mv AT allioremi mitofinderefficientautomatedlargescaleextractionofmitogenomicdataintargetenrichmentphylogenomics
AT schomakerbastosalex mitofinderefficientautomatedlargescaleextractionofmitogenomicdataintargetenrichmentphylogenomics
AT romiguierjonathan mitofinderefficientautomatedlargescaleextractionofmitogenomicdataintargetenrichmentphylogenomics
AT prosdocimifrancisco mitofinderefficientautomatedlargescaleextractionofmitogenomicdataintargetenrichmentphylogenomics
AT nabholzbenoit mitofinderefficientautomatedlargescaleextractionofmitogenomicdataintargetenrichmentphylogenomics
AT delsucfrederic mitofinderefficientautomatedlargescaleextractionofmitogenomicdataintargetenrichmentphylogenomics