Cargando…
Visualization of consensus genome structure without using a reference genome
BACKGROUND: Standard graphical tools for whole genome comparison require a reference genome. However, any reference is also subject to annotation biases and rearrangements, and may not serve as the standard except for those of extensively studied model species. To fully exploit the rapidly accumulat...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374595/ https://www.ncbi.nlm.nih.gov/pubmed/28361716 http://dx.doi.org/10.1186/s12864-017-3499-7 |
_version_ | 1782518921265414144 |
---|---|
author | Tada, Ipputa Tanizawa, Yasuhiro Arita, Masanori |
author_facet | Tada, Ipputa Tanizawa, Yasuhiro Arita, Masanori |
author_sort | Tada, Ipputa |
collection | PubMed |
description | BACKGROUND: Standard graphical tools for whole genome comparison require a reference genome. However, any reference is also subject to annotation biases and rearrangements, and may not serve as the standard except for those of extensively studied model species. To fully exploit the rapidly accumulating sequence data from the recent sequencing technologies, genome comparison without any reference has been anticipated. RESULTS: We introduce a circular genome visualizer to compare complete genomes of closely related species. This tool visualizes the position of orthologous gene clusters rather than actual sequences or their features, thereby achieving the comparative view without using a single reference genome. The essential information is the matrix of orthologous gene clusters whose positions (not sequences) are color-coded in circular graphics. As a demonstration, comparison of 14 Lactobacillus paracasei strains and one L. casei strain revealed not only large-scale rearrangements but also genomic islands that are strain-specific. Comparison of 73 Helicobacter pylori strains confirmed their genetic consistency and also revealed the three general patterns of large-scale genome inversions. CONCLUSIONS: From the ample sequence information in the GenBank/ENA/DDBJ repository, we can reconstruct a genomic consensus for particular species. By visualizing multiple strains at a glance, we can identify conserved as well as strain-specific regions in multiply sequenced genomes. Positional consistency for orthologous genes provides information orthogonal to major sequence features such as the GC content or sequence similarity of marker genes. The positional comparison is therefore useful for identifying large-scale genome rearrangements or gene transfers. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3499-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5374595 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53745952017-03-31 Visualization of consensus genome structure without using a reference genome Tada, Ipputa Tanizawa, Yasuhiro Arita, Masanori BMC Genomics Research BACKGROUND: Standard graphical tools for whole genome comparison require a reference genome. However, any reference is also subject to annotation biases and rearrangements, and may not serve as the standard except for those of extensively studied model species. To fully exploit the rapidly accumulating sequence data from the recent sequencing technologies, genome comparison without any reference has been anticipated. RESULTS: We introduce a circular genome visualizer to compare complete genomes of closely related species. This tool visualizes the position of orthologous gene clusters rather than actual sequences or their features, thereby achieving the comparative view without using a single reference genome. The essential information is the matrix of orthologous gene clusters whose positions (not sequences) are color-coded in circular graphics. As a demonstration, comparison of 14 Lactobacillus paracasei strains and one L. casei strain revealed not only large-scale rearrangements but also genomic islands that are strain-specific. Comparison of 73 Helicobacter pylori strains confirmed their genetic consistency and also revealed the three general patterns of large-scale genome inversions. CONCLUSIONS: From the ample sequence information in the GenBank/ENA/DDBJ repository, we can reconstruct a genomic consensus for particular species. By visualizing multiple strains at a glance, we can identify conserved as well as strain-specific regions in multiply sequenced genomes. Positional consistency for orthologous genes provides information orthogonal to major sequence features such as the GC content or sequence similarity of marker genes. The positional comparison is therefore useful for identifying large-scale genome rearrangements or gene transfers. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3499-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-03-14 /pmc/articles/PMC5374595/ /pubmed/28361716 http://dx.doi.org/10.1186/s12864-017-3499-7 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Tada, Ipputa Tanizawa, Yasuhiro Arita, Masanori Visualization of consensus genome structure without using a reference genome |
title | Visualization of consensus genome structure without using a reference genome |
title_full | Visualization of consensus genome structure without using a reference genome |
title_fullStr | Visualization of consensus genome structure without using a reference genome |
title_full_unstemmed | Visualization of consensus genome structure without using a reference genome |
title_short | Visualization of consensus genome structure without using a reference genome |
title_sort | visualization of consensus genome structure without using a reference genome |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374595/ https://www.ncbi.nlm.nih.gov/pubmed/28361716 http://dx.doi.org/10.1186/s12864-017-3499-7 |
work_keys_str_mv | AT tadaipputa visualizationofconsensusgenomestructurewithoutusingareferencegenome AT tanizawayasuhiro visualizationofconsensusgenomestructurewithoutusingareferencegenome AT aritamasanori visualizationofconsensusgenomestructurewithoutusingareferencegenome |