Cargando…

Complete vertebrate mitogenomes reveal widespread repeats and gene duplications

BACKGROUND: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. RESULTS: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated...

Descripción completa

Detalles Bibliográficos
Autores principales: Formenti, Giulio, Rhie, Arang, Balacco, Jennifer, Haase, Bettina, Mountcastle, Jacquelyn, Fedrigo, Olivier, Brown, Samara, Capodiferro, Marco Rosario, Al-Ajli, Farooq O., Ambrosini, Roberto, Houde, Peter, Koren, Sergey, Oliver, Karen, Smith, Michelle, Skelton, Jason, Betteridge, Emma, Dolucan, Jale, Corton, Craig, Bista, Iliana, Torrance, James, Tracey, Alan, Wood, Jonathan, Uliano-Silva, Marcela, Howe, Kerstin, McCarthy, Shane, Winkler, Sylke, Kwak, Woori, Korlach, Jonas, Fungtammasan, Arkarachai, Fordham, Daniel, Costa, Vania, Mayes, Simon, Chiara, Matteo, Horner, David S., Myers, Eugene, Durbin, Richard, Achilli, Alessandro, Braun, Edward L., Phillippy, Adam M., Jarvis, Erich D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8082918/
https://www.ncbi.nlm.nih.gov/pubmed/33910595
http://dx.doi.org/10.1186/s13059-021-02336-9
_version_ 1783685930356310016
author Formenti, Giulio
Rhie, Arang
Balacco, Jennifer
Haase, Bettina
Mountcastle, Jacquelyn
Fedrigo, Olivier
Brown, Samara
Capodiferro, Marco Rosario
Al-Ajli, Farooq O.
Ambrosini, Roberto
Houde, Peter
Koren, Sergey
Oliver, Karen
Smith, Michelle
Skelton, Jason
Betteridge, Emma
Dolucan, Jale
Corton, Craig
Bista, Iliana
Torrance, James
Tracey, Alan
Wood, Jonathan
Uliano-Silva, Marcela
Howe, Kerstin
McCarthy, Shane
Winkler, Sylke
Kwak, Woori
Korlach, Jonas
Fungtammasan, Arkarachai
Fordham, Daniel
Costa, Vania
Mayes, Simon
Chiara, Matteo
Horner, David S.
Myers, Eugene
Durbin, Richard
Achilli, Alessandro
Braun, Edward L.
Phillippy, Adam M.
Jarvis, Erich D.
author_facet Formenti, Giulio
Rhie, Arang
Balacco, Jennifer
Haase, Bettina
Mountcastle, Jacquelyn
Fedrigo, Olivier
Brown, Samara
Capodiferro, Marco Rosario
Al-Ajli, Farooq O.
Ambrosini, Roberto
Houde, Peter
Koren, Sergey
Oliver, Karen
Smith, Michelle
Skelton, Jason
Betteridge, Emma
Dolucan, Jale
Corton, Craig
Bista, Iliana
Torrance, James
Tracey, Alan
Wood, Jonathan
Uliano-Silva, Marcela
Howe, Kerstin
McCarthy, Shane
Winkler, Sylke
Kwak, Woori
Korlach, Jonas
Fungtammasan, Arkarachai
Fordham, Daniel
Costa, Vania
Mayes, Simon
Chiara, Matteo
Horner, David S.
Myers, Eugene
Durbin, Richard
Achilli, Alessandro
Braun, Edward L.
Phillippy, Adam M.
Jarvis, Erich D.
author_sort Formenti, Giulio
collection PubMed
description BACKGROUND: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. RESULTS: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100–300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization. CONCLUSIONS: Our results indicate that even in the “simple” case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02336-9.
format Online
Article
Text
id pubmed-8082918
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-80829182021-04-29 Complete vertebrate mitogenomes reveal widespread repeats and gene duplications Formenti, Giulio Rhie, Arang Balacco, Jennifer Haase, Bettina Mountcastle, Jacquelyn Fedrigo, Olivier Brown, Samara Capodiferro, Marco Rosario Al-Ajli, Farooq O. Ambrosini, Roberto Houde, Peter Koren, Sergey Oliver, Karen Smith, Michelle Skelton, Jason Betteridge, Emma Dolucan, Jale Corton, Craig Bista, Iliana Torrance, James Tracey, Alan Wood, Jonathan Uliano-Silva, Marcela Howe, Kerstin McCarthy, Shane Winkler, Sylke Kwak, Woori Korlach, Jonas Fungtammasan, Arkarachai Fordham, Daniel Costa, Vania Mayes, Simon Chiara, Matteo Horner, David S. Myers, Eugene Durbin, Richard Achilli, Alessandro Braun, Edward L. Phillippy, Adam M. Jarvis, Erich D. Genome Biol Research BACKGROUND: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. RESULTS: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100–300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization. CONCLUSIONS: Our results indicate that even in the “simple” case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02336-9. BioMed Central 2021-04-29 /pmc/articles/PMC8082918/ /pubmed/33910595 http://dx.doi.org/10.1186/s13059-021-02336-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Formenti, Giulio
Rhie, Arang
Balacco, Jennifer
Haase, Bettina
Mountcastle, Jacquelyn
Fedrigo, Olivier
Brown, Samara
Capodiferro, Marco Rosario
Al-Ajli, Farooq O.
Ambrosini, Roberto
Houde, Peter
Koren, Sergey
Oliver, Karen
Smith, Michelle
Skelton, Jason
Betteridge, Emma
Dolucan, Jale
Corton, Craig
Bista, Iliana
Torrance, James
Tracey, Alan
Wood, Jonathan
Uliano-Silva, Marcela
Howe, Kerstin
McCarthy, Shane
Winkler, Sylke
Kwak, Woori
Korlach, Jonas
Fungtammasan, Arkarachai
Fordham, Daniel
Costa, Vania
Mayes, Simon
Chiara, Matteo
Horner, David S.
Myers, Eugene
Durbin, Richard
Achilli, Alessandro
Braun, Edward L.
Phillippy, Adam M.
Jarvis, Erich D.
Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
title Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
title_full Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
title_fullStr Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
title_full_unstemmed Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
title_short Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
title_sort complete vertebrate mitogenomes reveal widespread repeats and gene duplications
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8082918/
https://www.ncbi.nlm.nih.gov/pubmed/33910595
http://dx.doi.org/10.1186/s13059-021-02336-9
work_keys_str_mv AT formentigiulio completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT rhiearang completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT balaccojennifer completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT haasebettina completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT mountcastlejacquelyn completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT fedrigoolivier completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT brownsamara completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT capodiferromarcorosario completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT alajlifarooqo completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT ambrosiniroberto completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT houdepeter completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT korensergey completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT oliverkaren completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT smithmichelle completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT skeltonjason completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT betteridgeemma completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT dolucanjale completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT cortoncraig completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT bistailiana completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT torrancejames completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT traceyalan completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT woodjonathan completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT ulianosilvamarcela completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT howekerstin completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT mccarthyshane completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT winklersylke completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT kwakwoori completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT korlachjonas completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT fungtammasanarkarachai completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT fordhamdaniel completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT costavania completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT mayessimon completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT chiaramatteo completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT hornerdavids completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT myerseugene completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT durbinrichard completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT achillialessandro completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT braunedwardl completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT phillippyadamm completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT jarviserichd completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications
AT completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications