Cargando…
galign: A Tool for Rapid Genome Polymorphism Discovery
BACKGROUND: Highly parallel sequencing technologies have become important tools in the analysis of sequence polymorphisms on a genomic scale. However, the development of customized software to analyze data produced by these methods has lagged behind. METHODS/PRINCIPAL FINDINGS: Here I describe a too...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746318/ https://www.ncbi.nlm.nih.gov/pubmed/19779626 http://dx.doi.org/10.1371/journal.pone.0007188 |
_version_ | 1782172040349876224 |
---|---|
author | Shaham, Shai |
author_facet | Shaham, Shai |
author_sort | Shaham, Shai |
collection | PubMed |
description | BACKGROUND: Highly parallel sequencing technologies have become important tools in the analysis of sequence polymorphisms on a genomic scale. However, the development of customized software to analyze data produced by these methods has lagged behind. METHODS/PRINCIPAL FINDINGS: Here I describe a tool, ‘galign’, designed to identify polymorphisms between sequence reads obtained using Illumina/Solexa technology and a reference genome. The ‘galign’ alignment tool does not use Smith-Waterman matrices for sequence comparisons. Instead, a simple algorithm comparing parsed sequence reads to parsed reference genome sequences is used. ‘galign’ output is geared towards immediate user application, displaying polymorphism locations, nucleotide changes, and relevant predicted amino-acid changes for ease of information processing. To do so, ‘galign’ requires several accessory files easily derived from an annotated reference genome. Direct sequencing as well as in silico studies demonstrate that ‘galign’ provides lesion predictions comparable in accuracy to available prediction programs, accompanied by greater processing speed and more user-friendly output. We demonstrate the use of ‘galign’ to identify mutations leading to phenotypic consequences in C. elegans. CONCLUSION/SIGNIFICANCE: Our studies suggest that ‘galign’ is a useful tool for polymorphism discovery, and is of immediate utility for sequence mining in C. elegans. |
format | Text |
id | pubmed-2746318 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-27463182009-09-25 galign: A Tool for Rapid Genome Polymorphism Discovery Shaham, Shai PLoS One Research Article BACKGROUND: Highly parallel sequencing technologies have become important tools in the analysis of sequence polymorphisms on a genomic scale. However, the development of customized software to analyze data produced by these methods has lagged behind. METHODS/PRINCIPAL FINDINGS: Here I describe a tool, ‘galign’, designed to identify polymorphisms between sequence reads obtained using Illumina/Solexa technology and a reference genome. The ‘galign’ alignment tool does not use Smith-Waterman matrices for sequence comparisons. Instead, a simple algorithm comparing parsed sequence reads to parsed reference genome sequences is used. ‘galign’ output is geared towards immediate user application, displaying polymorphism locations, nucleotide changes, and relevant predicted amino-acid changes for ease of information processing. To do so, ‘galign’ requires several accessory files easily derived from an annotated reference genome. Direct sequencing as well as in silico studies demonstrate that ‘galign’ provides lesion predictions comparable in accuracy to available prediction programs, accompanied by greater processing speed and more user-friendly output. We demonstrate the use of ‘galign’ to identify mutations leading to phenotypic consequences in C. elegans. CONCLUSION/SIGNIFICANCE: Our studies suggest that ‘galign’ is a useful tool for polymorphism discovery, and is of immediate utility for sequence mining in C. elegans. Public Library of Science 2009-09-25 /pmc/articles/PMC2746318/ /pubmed/19779626 http://dx.doi.org/10.1371/journal.pone.0007188 Text en Shai Shaham. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Shaham, Shai galign: A Tool for Rapid Genome Polymorphism Discovery |
title | galign: A Tool for Rapid Genome Polymorphism Discovery |
title_full | galign: A Tool for Rapid Genome Polymorphism Discovery |
title_fullStr | galign: A Tool for Rapid Genome Polymorphism Discovery |
title_full_unstemmed | galign: A Tool for Rapid Genome Polymorphism Discovery |
title_short | galign: A Tool for Rapid Genome Polymorphism Discovery |
title_sort | galign: a tool for rapid genome polymorphism discovery |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746318/ https://www.ncbi.nlm.nih.gov/pubmed/19779626 http://dx.doi.org/10.1371/journal.pone.0007188 |
work_keys_str_mv | AT shahamshai galignatoolforrapidgenomepolymorphismdiscovery |