Cargando…
A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units
BACKGROUND: Single Nucleotide Polymorphism (SNP) genotyping analysis is very susceptible to SNPs chromosomal position errors. As it is known, SNPs mapping data are provided along the SNP arrays without any necessary information to assess in advance their accuracy. Moreover, these mapping data are re...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015528/ https://www.ncbi.nlm.nih.gov/pubmed/24564714 http://dx.doi.org/10.1186/1471-2105-15-S1-S10 |
_version_ | 1782315349717286912 |
---|---|
author | Manconi, Andrea Orro, Alessandro Manca, Emanuele Armano, Giuliano Milanesi, Luciano |
author_facet | Manconi, Andrea Orro, Alessandro Manca, Emanuele Armano, Giuliano Milanesi, Luciano |
author_sort | Manconi, Andrea |
collection | PubMed |
description | BACKGROUND: Single Nucleotide Polymorphism (SNP) genotyping analysis is very susceptible to SNPs chromosomal position errors. As it is known, SNPs mapping data are provided along the SNP arrays without any necessary information to assess in advance their accuracy. Moreover, these mapping data are related to a given build of a genome and need to be updated when a new build is available. As a consequence, researchers often plan to remap SNPs with the aim to obtain more up-to-date SNPs chromosomal positions. In this work, we present G-SNPM a GPU (Graphics Processing Unit) based tool to map SNPs on a genome. METHODS: G-SNPM is a tool that maps a short sequence representative of a SNP against a reference DNA sequence in order to find the physical position of the SNP in that sequence. In G-SNPM each SNP is mapped on its related chromosome by means of an automatic three-stage pipeline. In the first stage, G-SNPM uses the GPU-based short-read mapping tool SOAP3-dp to parallel align on a reference chromosome its related sequences representative of a SNP. In the second stage G-SNPM uses another short-read mapping tool to remap the sequences unaligned or ambiguously aligned by SOAP3-dp (in this stage SHRiMP2 is used, which exploits specialized vector computing hardware to speed-up the dynamic programming algorithm of Smith-Waterman). In the last stage, G-SNPM analyzes the alignments obtained by SOAP3-dp and SHRiMP2 to identify the absolute position of each SNP. RESULTS AND CONCLUSIONS: To assess G-SNPM, we used it to remap the SNPs of some commercial chips. Experimental results shown that G-SNPM has been able to remap without ambiguity almost all SNPs. Based on modern GPUs, G-SNPM provides fast mappings without worsening the accuracy of the results. G-SNPM can be used to deal with specialized Genome Wide Association Studies (GWAS), as well as in annotation tasks that require to update the SNP mapping probes. |
format | Online Article Text |
id | pubmed-4015528 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40155282014-05-23 A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units Manconi, Andrea Orro, Alessandro Manca, Emanuele Armano, Giuliano Milanesi, Luciano BMC Bioinformatics Research BACKGROUND: Single Nucleotide Polymorphism (SNP) genotyping analysis is very susceptible to SNPs chromosomal position errors. As it is known, SNPs mapping data are provided along the SNP arrays without any necessary information to assess in advance their accuracy. Moreover, these mapping data are related to a given build of a genome and need to be updated when a new build is available. As a consequence, researchers often plan to remap SNPs with the aim to obtain more up-to-date SNPs chromosomal positions. In this work, we present G-SNPM a GPU (Graphics Processing Unit) based tool to map SNPs on a genome. METHODS: G-SNPM is a tool that maps a short sequence representative of a SNP against a reference DNA sequence in order to find the physical position of the SNP in that sequence. In G-SNPM each SNP is mapped on its related chromosome by means of an automatic three-stage pipeline. In the first stage, G-SNPM uses the GPU-based short-read mapping tool SOAP3-dp to parallel align on a reference chromosome its related sequences representative of a SNP. In the second stage G-SNPM uses another short-read mapping tool to remap the sequences unaligned or ambiguously aligned by SOAP3-dp (in this stage SHRiMP2 is used, which exploits specialized vector computing hardware to speed-up the dynamic programming algorithm of Smith-Waterman). In the last stage, G-SNPM analyzes the alignments obtained by SOAP3-dp and SHRiMP2 to identify the absolute position of each SNP. RESULTS AND CONCLUSIONS: To assess G-SNPM, we used it to remap the SNPs of some commercial chips. Experimental results shown that G-SNPM has been able to remap without ambiguity almost all SNPs. Based on modern GPUs, G-SNPM provides fast mappings without worsening the accuracy of the results. G-SNPM can be used to deal with specialized Genome Wide Association Studies (GWAS), as well as in annotation tasks that require to update the SNP mapping probes. BioMed Central 2014-01-10 /pmc/articles/PMC4015528/ /pubmed/24564714 http://dx.doi.org/10.1186/1471-2105-15-S1-S10 Text en Copyright © 2014 Manconi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Manconi, Andrea Orro, Alessandro Manca, Emanuele Armano, Giuliano Milanesi, Luciano A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units |
title | A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units |
title_full | A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units |
title_fullStr | A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units |
title_full_unstemmed | A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units |
title_short | A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units |
title_sort | tool for mapping single nucleotide polymorphisms using graphics processing units |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015528/ https://www.ncbi.nlm.nih.gov/pubmed/24564714 http://dx.doi.org/10.1186/1471-2105-15-S1-S10 |
work_keys_str_mv | AT manconiandrea atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT orroalessandro atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT mancaemanuele atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT armanogiuliano atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT milanesiluciano atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT manconiandrea toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT orroalessandro toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT mancaemanuele toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT armanogiuliano toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits AT milanesiluciano toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits |