Cargando…

A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units

BACKGROUND: Single Nucleotide Polymorphism (SNP) genotyping analysis is very susceptible to SNPs chromosomal position errors. As it is known, SNPs mapping data are provided along the SNP arrays without any necessary information to assess in advance their accuracy. Moreover, these mapping data are re...

Descripción completa

Detalles Bibliográficos
Autores principales: Manconi, Andrea, Orro, Alessandro, Manca, Emanuele, Armano, Giuliano, Milanesi, Luciano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015528/
https://www.ncbi.nlm.nih.gov/pubmed/24564714
http://dx.doi.org/10.1186/1471-2105-15-S1-S10
_version_ 1782315349717286912
author Manconi, Andrea
Orro, Alessandro
Manca, Emanuele
Armano, Giuliano
Milanesi, Luciano
author_facet Manconi, Andrea
Orro, Alessandro
Manca, Emanuele
Armano, Giuliano
Milanesi, Luciano
author_sort Manconi, Andrea
collection PubMed
description BACKGROUND: Single Nucleotide Polymorphism (SNP) genotyping analysis is very susceptible to SNPs chromosomal position errors. As it is known, SNPs mapping data are provided along the SNP arrays without any necessary information to assess in advance their accuracy. Moreover, these mapping data are related to a given build of a genome and need to be updated when a new build is available. As a consequence, researchers often plan to remap SNPs with the aim to obtain more up-to-date SNPs chromosomal positions. In this work, we present G-SNPM a GPU (Graphics Processing Unit) based tool to map SNPs on a genome. METHODS: G-SNPM is a tool that maps a short sequence representative of a SNP against a reference DNA sequence in order to find the physical position of the SNP in that sequence. In G-SNPM each SNP is mapped on its related chromosome by means of an automatic three-stage pipeline. In the first stage, G-SNPM uses the GPU-based short-read mapping tool SOAP3-dp to parallel align on a reference chromosome its related sequences representative of a SNP. In the second stage G-SNPM uses another short-read mapping tool to remap the sequences unaligned or ambiguously aligned by SOAP3-dp (in this stage SHRiMP2 is used, which exploits specialized vector computing hardware to speed-up the dynamic programming algorithm of Smith-Waterman). In the last stage, G-SNPM analyzes the alignments obtained by SOAP3-dp and SHRiMP2 to identify the absolute position of each SNP. RESULTS AND CONCLUSIONS: To assess G-SNPM, we used it to remap the SNPs of some commercial chips. Experimental results shown that G-SNPM has been able to remap without ambiguity almost all SNPs. Based on modern GPUs, G-SNPM provides fast mappings without worsening the accuracy of the results. G-SNPM can be used to deal with specialized Genome Wide Association Studies (GWAS), as well as in annotation tasks that require to update the SNP mapping probes.
format Online
Article
Text
id pubmed-4015528
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40155282014-05-23 A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units Manconi, Andrea Orro, Alessandro Manca, Emanuele Armano, Giuliano Milanesi, Luciano BMC Bioinformatics Research BACKGROUND: Single Nucleotide Polymorphism (SNP) genotyping analysis is very susceptible to SNPs chromosomal position errors. As it is known, SNPs mapping data are provided along the SNP arrays without any necessary information to assess in advance their accuracy. Moreover, these mapping data are related to a given build of a genome and need to be updated when a new build is available. As a consequence, researchers often plan to remap SNPs with the aim to obtain more up-to-date SNPs chromosomal positions. In this work, we present G-SNPM a GPU (Graphics Processing Unit) based tool to map SNPs on a genome. METHODS: G-SNPM is a tool that maps a short sequence representative of a SNP against a reference DNA sequence in order to find the physical position of the SNP in that sequence. In G-SNPM each SNP is mapped on its related chromosome by means of an automatic three-stage pipeline. In the first stage, G-SNPM uses the GPU-based short-read mapping tool SOAP3-dp to parallel align on a reference chromosome its related sequences representative of a SNP. In the second stage G-SNPM uses another short-read mapping tool to remap the sequences unaligned or ambiguously aligned by SOAP3-dp (in this stage SHRiMP2 is used, which exploits specialized vector computing hardware to speed-up the dynamic programming algorithm of Smith-Waterman). In the last stage, G-SNPM analyzes the alignments obtained by SOAP3-dp and SHRiMP2 to identify the absolute position of each SNP. RESULTS AND CONCLUSIONS: To assess G-SNPM, we used it to remap the SNPs of some commercial chips. Experimental results shown that G-SNPM has been able to remap without ambiguity almost all SNPs. Based on modern GPUs, G-SNPM provides fast mappings without worsening the accuracy of the results. G-SNPM can be used to deal with specialized Genome Wide Association Studies (GWAS), as well as in annotation tasks that require to update the SNP mapping probes. BioMed Central 2014-01-10 /pmc/articles/PMC4015528/ /pubmed/24564714 http://dx.doi.org/10.1186/1471-2105-15-S1-S10 Text en Copyright © 2014 Manconi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Manconi, Andrea
Orro, Alessandro
Manca, Emanuele
Armano, Giuliano
Milanesi, Luciano
A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units
title A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units
title_full A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units
title_fullStr A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units
title_full_unstemmed A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units
title_short A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units
title_sort tool for mapping single nucleotide polymorphisms using graphics processing units
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015528/
https://www.ncbi.nlm.nih.gov/pubmed/24564714
http://dx.doi.org/10.1186/1471-2105-15-S1-S10
work_keys_str_mv AT manconiandrea atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT orroalessandro atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT mancaemanuele atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT armanogiuliano atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT milanesiluciano atoolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT manconiandrea toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT orroalessandro toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT mancaemanuele toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT armanogiuliano toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits
AT milanesiluciano toolformappingsinglenucleotidepolymorphismsusinggraphicsprocessingunits