Cargando…

Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data

BACKGROUND: Accurate detection of polymorphisms with a next generation sequencer data is an important element of current genetic analysis. However, there is still no detection pipeline that is completely reliable. RESULT: We demonstrate two new detection methods of polymorphisms focusing on the Poly...

Descripción completa

Detalles Bibliográficos
Autores principales: Miyao, Akio, Kiyomiya, Jianyu Song, Iida, Keiko, Doi, Koji, Yasue, Hiroshi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6599308/
https://www.ncbi.nlm.nih.gov/pubmed/31253084
http://dx.doi.org/10.1186/s12859-019-2955-6
_version_ 1783430936522653696
author Miyao, Akio
Kiyomiya, Jianyu Song
Iida, Keiko
Doi, Koji
Yasue, Hiroshi
author_facet Miyao, Akio
Kiyomiya, Jianyu Song
Iida, Keiko
Doi, Koji
Yasue, Hiroshi
author_sort Miyao, Akio
collection PubMed
description BACKGROUND: Accurate detection of polymorphisms with a next generation sequencer data is an important element of current genetic analysis. However, there is still no detection pipeline that is completely reliable. RESULT: We demonstrate two new detection methods of polymorphisms focusing on the Polymorphic Edge (PED). In the matching between two homologous sequences, the first mismatched base to appear is the SNP, or the edge of the structural variation. The first method is based on k-mers from short reads and detects polymorphic edges with k-mers for which there is no match between target and control, making it possible to detect SNPs by direct comparison of short-reads in two datasets (target and control) without a reference genome sequence. The second method is based on bidirectional alignment to detect polymorphic edges, not only SNPs but also insertions, deletions, inversions and translocations. Using these two methods, we succeed in making a high-quality comparison map between rice cultivars showing good match to the theoretical value of introgression, and in detecting specific large deletions across cultivars. CONCLUSIONS: Using Polymorphic Edge Detection (PED), the k-mer method is able to detect SNPs by direct comparison of short-reads in two datasets without genomic alignment step, and the bidirectional alignment method is able to detect SNPs and structural variations from even single-end short-reads. The PED is an efficient tool to obtain accurate data for both SNPs and structural variations. AVAILABILITY: The PED software is available at: https://github.com/akiomiyao/ped. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2955-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6599308
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65993082019-07-11 Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data Miyao, Akio Kiyomiya, Jianyu Song Iida, Keiko Doi, Koji Yasue, Hiroshi BMC Bioinformatics Research Article BACKGROUND: Accurate detection of polymorphisms with a next generation sequencer data is an important element of current genetic analysis. However, there is still no detection pipeline that is completely reliable. RESULT: We demonstrate two new detection methods of polymorphisms focusing on the Polymorphic Edge (PED). In the matching between two homologous sequences, the first mismatched base to appear is the SNP, or the edge of the structural variation. The first method is based on k-mers from short reads and detects polymorphic edges with k-mers for which there is no match between target and control, making it possible to detect SNPs by direct comparison of short-reads in two datasets (target and control) without a reference genome sequence. The second method is based on bidirectional alignment to detect polymorphic edges, not only SNPs but also insertions, deletions, inversions and translocations. Using these two methods, we succeed in making a high-quality comparison map between rice cultivars showing good match to the theoretical value of introgression, and in detecting specific large deletions across cultivars. CONCLUSIONS: Using Polymorphic Edge Detection (PED), the k-mer method is able to detect SNPs by direct comparison of short-reads in two datasets without genomic alignment step, and the bidirectional alignment method is able to detect SNPs and structural variations from even single-end short-reads. The PED is an efficient tool to obtain accurate data for both SNPs and structural variations. AVAILABILITY: The PED software is available at: https://github.com/akiomiyao/ped. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2955-6) contains supplementary material, which is available to authorized users. BioMed Central 2019-06-28 /pmc/articles/PMC6599308/ /pubmed/31253084 http://dx.doi.org/10.1186/s12859-019-2955-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Miyao, Akio
Kiyomiya, Jianyu Song
Iida, Keiko
Doi, Koji
Yasue, Hiroshi
Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data
title Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data
title_full Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data
title_fullStr Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data
title_full_unstemmed Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data
title_short Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data
title_sort polymorphic edge detection (ped): two efficient methods of polymorphism detection from next-generation sequencing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6599308/
https://www.ncbi.nlm.nih.gov/pubmed/31253084
http://dx.doi.org/10.1186/s12859-019-2955-6
work_keys_str_mv AT miyaoakio polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata
AT kiyomiyajianyusong polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata
AT iidakeiko polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata
AT doikoji polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata
AT yasuehiroshi polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata