Cargando…
Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
BACKGROUND: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. RESULTS: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8082918/ https://www.ncbi.nlm.nih.gov/pubmed/33910595 http://dx.doi.org/10.1186/s13059-021-02336-9 |
_version_ | 1783685930356310016 |
---|---|
author | Formenti, Giulio Rhie, Arang Balacco, Jennifer Haase, Bettina Mountcastle, Jacquelyn Fedrigo, Olivier Brown, Samara Capodiferro, Marco Rosario Al-Ajli, Farooq O. Ambrosini, Roberto Houde, Peter Koren, Sergey Oliver, Karen Smith, Michelle Skelton, Jason Betteridge, Emma Dolucan, Jale Corton, Craig Bista, Iliana Torrance, James Tracey, Alan Wood, Jonathan Uliano-Silva, Marcela Howe, Kerstin McCarthy, Shane Winkler, Sylke Kwak, Woori Korlach, Jonas Fungtammasan, Arkarachai Fordham, Daniel Costa, Vania Mayes, Simon Chiara, Matteo Horner, David S. Myers, Eugene Durbin, Richard Achilli, Alessandro Braun, Edward L. Phillippy, Adam M. Jarvis, Erich D. |
author_facet | Formenti, Giulio Rhie, Arang Balacco, Jennifer Haase, Bettina Mountcastle, Jacquelyn Fedrigo, Olivier Brown, Samara Capodiferro, Marco Rosario Al-Ajli, Farooq O. Ambrosini, Roberto Houde, Peter Koren, Sergey Oliver, Karen Smith, Michelle Skelton, Jason Betteridge, Emma Dolucan, Jale Corton, Craig Bista, Iliana Torrance, James Tracey, Alan Wood, Jonathan Uliano-Silva, Marcela Howe, Kerstin McCarthy, Shane Winkler, Sylke Kwak, Woori Korlach, Jonas Fungtammasan, Arkarachai Fordham, Daniel Costa, Vania Mayes, Simon Chiara, Matteo Horner, David S. Myers, Eugene Durbin, Richard Achilli, Alessandro Braun, Edward L. Phillippy, Adam M. Jarvis, Erich D. |
author_sort | Formenti, Giulio |
collection | PubMed |
description | BACKGROUND: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. RESULTS: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100–300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization. CONCLUSIONS: Our results indicate that even in the “simple” case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02336-9. |
format | Online Article Text |
id | pubmed-8082918 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80829182021-04-29 Complete vertebrate mitogenomes reveal widespread repeats and gene duplications Formenti, Giulio Rhie, Arang Balacco, Jennifer Haase, Bettina Mountcastle, Jacquelyn Fedrigo, Olivier Brown, Samara Capodiferro, Marco Rosario Al-Ajli, Farooq O. Ambrosini, Roberto Houde, Peter Koren, Sergey Oliver, Karen Smith, Michelle Skelton, Jason Betteridge, Emma Dolucan, Jale Corton, Craig Bista, Iliana Torrance, James Tracey, Alan Wood, Jonathan Uliano-Silva, Marcela Howe, Kerstin McCarthy, Shane Winkler, Sylke Kwak, Woori Korlach, Jonas Fungtammasan, Arkarachai Fordham, Daniel Costa, Vania Mayes, Simon Chiara, Matteo Horner, David S. Myers, Eugene Durbin, Richard Achilli, Alessandro Braun, Edward L. Phillippy, Adam M. Jarvis, Erich D. Genome Biol Research BACKGROUND: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. RESULTS: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100–300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization. CONCLUSIONS: Our results indicate that even in the “simple” case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02336-9. BioMed Central 2021-04-29 /pmc/articles/PMC8082918/ /pubmed/33910595 http://dx.doi.org/10.1186/s13059-021-02336-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Formenti, Giulio Rhie, Arang Balacco, Jennifer Haase, Bettina Mountcastle, Jacquelyn Fedrigo, Olivier Brown, Samara Capodiferro, Marco Rosario Al-Ajli, Farooq O. Ambrosini, Roberto Houde, Peter Koren, Sergey Oliver, Karen Smith, Michelle Skelton, Jason Betteridge, Emma Dolucan, Jale Corton, Craig Bista, Iliana Torrance, James Tracey, Alan Wood, Jonathan Uliano-Silva, Marcela Howe, Kerstin McCarthy, Shane Winkler, Sylke Kwak, Woori Korlach, Jonas Fungtammasan, Arkarachai Fordham, Daniel Costa, Vania Mayes, Simon Chiara, Matteo Horner, David S. Myers, Eugene Durbin, Richard Achilli, Alessandro Braun, Edward L. Phillippy, Adam M. Jarvis, Erich D. Complete vertebrate mitogenomes reveal widespread repeats and gene duplications |
title | Complete vertebrate mitogenomes reveal widespread repeats and gene duplications |
title_full | Complete vertebrate mitogenomes reveal widespread repeats and gene duplications |
title_fullStr | Complete vertebrate mitogenomes reveal widespread repeats and gene duplications |
title_full_unstemmed | Complete vertebrate mitogenomes reveal widespread repeats and gene duplications |
title_short | Complete vertebrate mitogenomes reveal widespread repeats and gene duplications |
title_sort | complete vertebrate mitogenomes reveal widespread repeats and gene duplications |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8082918/ https://www.ncbi.nlm.nih.gov/pubmed/33910595 http://dx.doi.org/10.1186/s13059-021-02336-9 |
work_keys_str_mv | AT formentigiulio completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT rhiearang completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT balaccojennifer completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT haasebettina completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT mountcastlejacquelyn completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT fedrigoolivier completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT brownsamara completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT capodiferromarcorosario completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT alajlifarooqo completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT ambrosiniroberto completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT houdepeter completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT korensergey completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT oliverkaren completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT smithmichelle completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT skeltonjason completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT betteridgeemma completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT dolucanjale completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT cortoncraig completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT bistailiana completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT torrancejames completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT traceyalan completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT woodjonathan completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT ulianosilvamarcela completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT howekerstin completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT mccarthyshane completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT winklersylke completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT kwakwoori completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT korlachjonas completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT fungtammasanarkarachai completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT fordhamdaniel completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT costavania completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT mayessimon completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT chiaramatteo completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT hornerdavids completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT myerseugene completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT durbinrichard completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT achillialessandro completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT braunedwardl completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT phillippyadamm completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT jarviserichd completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications AT completevertebratemitogenomesrevealwidespreadrepeatsandgeneduplications |