Cargando…

G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences

BACKGROUND: Cross-species whole-genome sequence alignment is a critical first step for genome comparative analyses, ranging from the detection of sequence variants to studies of chromosome evolution. Animal genomes are large and complex, and whole-genome alignment is a computationally intense proces...

Descripción completa

Detalles Bibliográficos
Autores principales: Lenis, Vasileios Panagiotis E, Swain, Martin, Larkin, Denis M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5961299/
https://www.ncbi.nlm.nih.gov/pubmed/29618053
http://dx.doi.org/10.1093/gigascience/giy017
_version_ 1783324705107738624
author Lenis, Vasileios Panagiotis E
Swain, Martin
Larkin, Denis M
author_facet Lenis, Vasileios Panagiotis E
Swain, Martin
Larkin, Denis M
author_sort Lenis, Vasileios Panagiotis E
collection PubMed
description BACKGROUND: Cross-species whole-genome sequence alignment is a critical first step for genome comparative analyses, ranging from the detection of sequence variants to studies of chromosome evolution. Animal genomes are large and complex, and whole-genome alignment is a computationally intense process, requiring expensive high-performance computing systems due to the need to explore extensive local alignments. With hundreds of sequenced animal genomes available from multiple projects, there is an increasing demand for genome comparative analyses. RESULTS: Here, we introduce G-Anchor, a new, fast, and efficient pipeline that uses a strictly limited but highly effective set of local sequence alignments to anchor (or map) an animal genome to another species’ reference genome. G-Anchor makes novel use of a databank of highly conserved DNA sequence elements. We demonstrate how these elements may be aligned to a pair of genomes, creating anchors. These anchors enable the rapid mapping of scaffolds from a de novo assembled genome to chromosome assemblies of a reference species. Our results demonstrate that G-Anchor can successfully anchor a vertebrate genome onto a phylogenetically related reference species genome using a desktop or laptop computer within a few hours and with comparable accuracy to that achieved by a highly accurate whole-genome alignment tool such as LASTZ. G-Anchor thus makes whole-genome comparisons accessible to researchers with limited computational resources. CONCLUSIONS: G-Anchor is a ready-to-use tool for anchoring a pair of vertebrate genomes. It may be used with large genomes that contain a significant fraction of evolutionally conserved DNA sequences and that are not highly repetitive, polypoid, or excessively fragmented. G-Anchor is not a substitute for whole-genome aligning software but can be used for fast and accurate initial genome comparisons. G-Anchor is freely available and a ready-to-use tool for the pairwise comparison of two genomes.
format Online
Article
Text
id pubmed-5961299
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-59612992018-06-06 G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences Lenis, Vasileios Panagiotis E Swain, Martin Larkin, Denis M Gigascience Technical Note BACKGROUND: Cross-species whole-genome sequence alignment is a critical first step for genome comparative analyses, ranging from the detection of sequence variants to studies of chromosome evolution. Animal genomes are large and complex, and whole-genome alignment is a computationally intense process, requiring expensive high-performance computing systems due to the need to explore extensive local alignments. With hundreds of sequenced animal genomes available from multiple projects, there is an increasing demand for genome comparative analyses. RESULTS: Here, we introduce G-Anchor, a new, fast, and efficient pipeline that uses a strictly limited but highly effective set of local sequence alignments to anchor (or map) an animal genome to another species’ reference genome. G-Anchor makes novel use of a databank of highly conserved DNA sequence elements. We demonstrate how these elements may be aligned to a pair of genomes, creating anchors. These anchors enable the rapid mapping of scaffolds from a de novo assembled genome to chromosome assemblies of a reference species. Our results demonstrate that G-Anchor can successfully anchor a vertebrate genome onto a phylogenetically related reference species genome using a desktop or laptop computer within a few hours and with comparable accuracy to that achieved by a highly accurate whole-genome alignment tool such as LASTZ. G-Anchor thus makes whole-genome comparisons accessible to researchers with limited computational resources. CONCLUSIONS: G-Anchor is a ready-to-use tool for anchoring a pair of vertebrate genomes. It may be used with large genomes that contain a significant fraction of evolutionally conserved DNA sequences and that are not highly repetitive, polypoid, or excessively fragmented. G-Anchor is not a substitute for whole-genome aligning software but can be used for fast and accurate initial genome comparisons. G-Anchor is freely available and a ready-to-use tool for the pairwise comparison of two genomes. Oxford University Press 2018-04-03 /pmc/articles/PMC5961299/ /pubmed/29618053 http://dx.doi.org/10.1093/gigascience/giy017 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Lenis, Vasileios Panagiotis E
Swain, Martin
Larkin, Denis M
G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences
title G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences
title_full G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences
title_fullStr G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences
title_full_unstemmed G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences
title_short G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences
title_sort g-anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved dna sequences
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5961299/
https://www.ncbi.nlm.nih.gov/pubmed/29618053
http://dx.doi.org/10.1093/gigascience/giy017
work_keys_str_mv AT lenisvasileiospanagiotise ganchoranovelapproachforwholegenomecomparativemappingutilizingevolutionaryconserveddnasequences
AT swainmartin ganchoranovelapproachforwholegenomecomparativemappingutilizingevolutionaryconserveddnasequences
AT larkindenism ganchoranovelapproachforwholegenomecomparativemappingutilizingevolutionaryconserveddnasequences