Cargando…
Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data
BACKGROUND: Accurate detection of polymorphisms with a next generation sequencer data is an important element of current genetic analysis. However, there is still no detection pipeline that is completely reliable. RESULT: We demonstrate two new detection methods of polymorphisms focusing on the Poly...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6599308/ https://www.ncbi.nlm.nih.gov/pubmed/31253084 http://dx.doi.org/10.1186/s12859-019-2955-6 |
_version_ | 1783430936522653696 |
---|---|
author | Miyao, Akio Kiyomiya, Jianyu Song Iida, Keiko Doi, Koji Yasue, Hiroshi |
author_facet | Miyao, Akio Kiyomiya, Jianyu Song Iida, Keiko Doi, Koji Yasue, Hiroshi |
author_sort | Miyao, Akio |
collection | PubMed |
description | BACKGROUND: Accurate detection of polymorphisms with a next generation sequencer data is an important element of current genetic analysis. However, there is still no detection pipeline that is completely reliable. RESULT: We demonstrate two new detection methods of polymorphisms focusing on the Polymorphic Edge (PED). In the matching between two homologous sequences, the first mismatched base to appear is the SNP, or the edge of the structural variation. The first method is based on k-mers from short reads and detects polymorphic edges with k-mers for which there is no match between target and control, making it possible to detect SNPs by direct comparison of short-reads in two datasets (target and control) without a reference genome sequence. The second method is based on bidirectional alignment to detect polymorphic edges, not only SNPs but also insertions, deletions, inversions and translocations. Using these two methods, we succeed in making a high-quality comparison map between rice cultivars showing good match to the theoretical value of introgression, and in detecting specific large deletions across cultivars. CONCLUSIONS: Using Polymorphic Edge Detection (PED), the k-mer method is able to detect SNPs by direct comparison of short-reads in two datasets without genomic alignment step, and the bidirectional alignment method is able to detect SNPs and structural variations from even single-end short-reads. The PED is an efficient tool to obtain accurate data for both SNPs and structural variations. AVAILABILITY: The PED software is available at: https://github.com/akiomiyao/ped. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2955-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6599308 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-65993082019-07-11 Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data Miyao, Akio Kiyomiya, Jianyu Song Iida, Keiko Doi, Koji Yasue, Hiroshi BMC Bioinformatics Research Article BACKGROUND: Accurate detection of polymorphisms with a next generation sequencer data is an important element of current genetic analysis. However, there is still no detection pipeline that is completely reliable. RESULT: We demonstrate two new detection methods of polymorphisms focusing on the Polymorphic Edge (PED). In the matching between two homologous sequences, the first mismatched base to appear is the SNP, or the edge of the structural variation. The first method is based on k-mers from short reads and detects polymorphic edges with k-mers for which there is no match between target and control, making it possible to detect SNPs by direct comparison of short-reads in two datasets (target and control) without a reference genome sequence. The second method is based on bidirectional alignment to detect polymorphic edges, not only SNPs but also insertions, deletions, inversions and translocations. Using these two methods, we succeed in making a high-quality comparison map between rice cultivars showing good match to the theoretical value of introgression, and in detecting specific large deletions across cultivars. CONCLUSIONS: Using Polymorphic Edge Detection (PED), the k-mer method is able to detect SNPs by direct comparison of short-reads in two datasets without genomic alignment step, and the bidirectional alignment method is able to detect SNPs and structural variations from even single-end short-reads. The PED is an efficient tool to obtain accurate data for both SNPs and structural variations. AVAILABILITY: The PED software is available at: https://github.com/akiomiyao/ped. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2955-6) contains supplementary material, which is available to authorized users. BioMed Central 2019-06-28 /pmc/articles/PMC6599308/ /pubmed/31253084 http://dx.doi.org/10.1186/s12859-019-2955-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Miyao, Akio Kiyomiya, Jianyu Song Iida, Keiko Doi, Koji Yasue, Hiroshi Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data |
title | Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data |
title_full | Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data |
title_fullStr | Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data |
title_full_unstemmed | Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data |
title_short | Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data |
title_sort | polymorphic edge detection (ped): two efficient methods of polymorphism detection from next-generation sequencing data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6599308/ https://www.ncbi.nlm.nih.gov/pubmed/31253084 http://dx.doi.org/10.1186/s12859-019-2955-6 |
work_keys_str_mv | AT miyaoakio polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata AT kiyomiyajianyusong polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata AT iidakeiko polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata AT doikoji polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata AT yasuehiroshi polymorphicedgedetectionpedtwoefficientmethodsofpolymorphismdetectionfromnextgenerationsequencingdata |