Cargando…

galign: A Tool for Rapid Genome Polymorphism Discovery

BACKGROUND: Highly parallel sequencing technologies have become important tools in the analysis of sequence polymorphisms on a genomic scale. However, the development of customized software to analyze data produced by these methods has lagged behind. METHODS/PRINCIPAL FINDINGS: Here I describe a too...

Descripción completa

Detalles Bibliográficos
Autor principal: Shaham, Shai
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746318/
https://www.ncbi.nlm.nih.gov/pubmed/19779626
http://dx.doi.org/10.1371/journal.pone.0007188
_version_ 1782172040349876224
author Shaham, Shai
author_facet Shaham, Shai
author_sort Shaham, Shai
collection PubMed
description BACKGROUND: Highly parallel sequencing technologies have become important tools in the analysis of sequence polymorphisms on a genomic scale. However, the development of customized software to analyze data produced by these methods has lagged behind. METHODS/PRINCIPAL FINDINGS: Here I describe a tool, ‘galign’, designed to identify polymorphisms between sequence reads obtained using Illumina/Solexa technology and a reference genome. The ‘galign’ alignment tool does not use Smith-Waterman matrices for sequence comparisons. Instead, a simple algorithm comparing parsed sequence reads to parsed reference genome sequences is used. ‘galign’ output is geared towards immediate user application, displaying polymorphism locations, nucleotide changes, and relevant predicted amino-acid changes for ease of information processing. To do so, ‘galign’ requires several accessory files easily derived from an annotated reference genome. Direct sequencing as well as in silico studies demonstrate that ‘galign’ provides lesion predictions comparable in accuracy to available prediction programs, accompanied by greater processing speed and more user-friendly output. We demonstrate the use of ‘galign’ to identify mutations leading to phenotypic consequences in C. elegans. CONCLUSION/SIGNIFICANCE: Our studies suggest that ‘galign’ is a useful tool for polymorphism discovery, and is of immediate utility for sequence mining in C. elegans.
format Text
id pubmed-2746318
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27463182009-09-25 galign: A Tool for Rapid Genome Polymorphism Discovery Shaham, Shai PLoS One Research Article BACKGROUND: Highly parallel sequencing technologies have become important tools in the analysis of sequence polymorphisms on a genomic scale. However, the development of customized software to analyze data produced by these methods has lagged behind. METHODS/PRINCIPAL FINDINGS: Here I describe a tool, ‘galign’, designed to identify polymorphisms between sequence reads obtained using Illumina/Solexa technology and a reference genome. The ‘galign’ alignment tool does not use Smith-Waterman matrices for sequence comparisons. Instead, a simple algorithm comparing parsed sequence reads to parsed reference genome sequences is used. ‘galign’ output is geared towards immediate user application, displaying polymorphism locations, nucleotide changes, and relevant predicted amino-acid changes for ease of information processing. To do so, ‘galign’ requires several accessory files easily derived from an annotated reference genome. Direct sequencing as well as in silico studies demonstrate that ‘galign’ provides lesion predictions comparable in accuracy to available prediction programs, accompanied by greater processing speed and more user-friendly output. We demonstrate the use of ‘galign’ to identify mutations leading to phenotypic consequences in C. elegans. CONCLUSION/SIGNIFICANCE: Our studies suggest that ‘galign’ is a useful tool for polymorphism discovery, and is of immediate utility for sequence mining in C. elegans. Public Library of Science 2009-09-25 /pmc/articles/PMC2746318/ /pubmed/19779626 http://dx.doi.org/10.1371/journal.pone.0007188 Text en Shai Shaham. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Shaham, Shai
galign: A Tool for Rapid Genome Polymorphism Discovery
title galign: A Tool for Rapid Genome Polymorphism Discovery
title_full galign: A Tool for Rapid Genome Polymorphism Discovery
title_fullStr galign: A Tool for Rapid Genome Polymorphism Discovery
title_full_unstemmed galign: A Tool for Rapid Genome Polymorphism Discovery
title_short galign: A Tool for Rapid Genome Polymorphism Discovery
title_sort galign: a tool for rapid genome polymorphism discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746318/
https://www.ncbi.nlm.nih.gov/pubmed/19779626
http://dx.doi.org/10.1371/journal.pone.0007188
work_keys_str_mv AT shahamshai galignatoolforrapidgenomepolymorphismdiscovery