Cargando…
A practical method to detect SNVs and indels from whole genome and exome sequencing data
The recent development of massively parallel sequencing technology has allowed the creation of comprehensive catalogs of genetic variation. However, due to the relatively high sequencing error rate for short read sequence data, sophisticated analysis methods are required to obtain high-quality varia...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3703611/ https://www.ncbi.nlm.nih.gov/pubmed/23831772 http://dx.doi.org/10.1038/srep02161 |
_version_ | 1782275928513052672 |
---|---|
author | Shigemizu, Daichi Fujimoto, Akihiro Akiyama, Shintaro Abe, Tetsuo Nakano, Kaoru Boroevich, Keith A. Yamamoto, Yujiro Furuta, Mayuko Kubo, Michiaki Nakagawa, Hidewaki Tsunoda, Tatsuhiko |
author_facet | Shigemizu, Daichi Fujimoto, Akihiro Akiyama, Shintaro Abe, Tetsuo Nakano, Kaoru Boroevich, Keith A. Yamamoto, Yujiro Furuta, Mayuko Kubo, Michiaki Nakagawa, Hidewaki Tsunoda, Tatsuhiko |
author_sort | Shigemizu, Daichi |
collection | PubMed |
description | The recent development of massively parallel sequencing technology has allowed the creation of comprehensive catalogs of genetic variation. However, due to the relatively high sequencing error rate for short read sequence data, sophisticated analysis methods are required to obtain high-quality variant calls. Here, we developed a probabilistic multinomial method for the detection of single nucleotide variants (SNVs) as well as short insertions and deletions (indels) in whole genome sequencing (WGS) and whole exome sequencing (WES) data for single sample calling. Evaluation with DNA genotyping arrays revealed a concordance rate of 99.98% for WGS calls and 99.99% for WES calls. Sanger sequencing of the discordant calls determined the false positive and false negative rates for the WGS (0.0068% and 0.17%) and WES (0.0036% and 0.0084%) datasets. Furthermore, short indels were identified with high accuracy (WGS: 94.7%, WES: 97.3%). We believe our method can contribute to the greater understanding of human diseases. |
format | Online Article Text |
id | pubmed-3703611 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-37036112013-07-08 A practical method to detect SNVs and indels from whole genome and exome sequencing data Shigemizu, Daichi Fujimoto, Akihiro Akiyama, Shintaro Abe, Tetsuo Nakano, Kaoru Boroevich, Keith A. Yamamoto, Yujiro Furuta, Mayuko Kubo, Michiaki Nakagawa, Hidewaki Tsunoda, Tatsuhiko Sci Rep Article The recent development of massively parallel sequencing technology has allowed the creation of comprehensive catalogs of genetic variation. However, due to the relatively high sequencing error rate for short read sequence data, sophisticated analysis methods are required to obtain high-quality variant calls. Here, we developed a probabilistic multinomial method for the detection of single nucleotide variants (SNVs) as well as short insertions and deletions (indels) in whole genome sequencing (WGS) and whole exome sequencing (WES) data for single sample calling. Evaluation with DNA genotyping arrays revealed a concordance rate of 99.98% for WGS calls and 99.99% for WES calls. Sanger sequencing of the discordant calls determined the false positive and false negative rates for the WGS (0.0068% and 0.17%) and WES (0.0036% and 0.0084%) datasets. Furthermore, short indels were identified with high accuracy (WGS: 94.7%, WES: 97.3%). We believe our method can contribute to the greater understanding of human diseases. Nature Publishing Group 2013-07-08 /pmc/articles/PMC3703611/ /pubmed/23831772 http://dx.doi.org/10.1038/srep02161 Text en Copyright © 2013, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by-nc-nd/3.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/ |
spellingShingle | Article Shigemizu, Daichi Fujimoto, Akihiro Akiyama, Shintaro Abe, Tetsuo Nakano, Kaoru Boroevich, Keith A. Yamamoto, Yujiro Furuta, Mayuko Kubo, Michiaki Nakagawa, Hidewaki Tsunoda, Tatsuhiko A practical method to detect SNVs and indels from whole genome and exome sequencing data |
title | A practical method to detect SNVs and indels from whole genome and exome sequencing data |
title_full | A practical method to detect SNVs and indels from whole genome and exome sequencing data |
title_fullStr | A practical method to detect SNVs and indels from whole genome and exome sequencing data |
title_full_unstemmed | A practical method to detect SNVs and indels from whole genome and exome sequencing data |
title_short | A practical method to detect SNVs and indels from whole genome and exome sequencing data |
title_sort | practical method to detect snvs and indels from whole genome and exome sequencing data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3703611/ https://www.ncbi.nlm.nih.gov/pubmed/23831772 http://dx.doi.org/10.1038/srep02161 |
work_keys_str_mv | AT shigemizudaichi apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT fujimotoakihiro apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT akiyamashintaro apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT abetetsuo apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT nakanokaoru apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT boroevichkeitha apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT yamamotoyujiro apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT furutamayuko apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT kubomichiaki apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT nakagawahidewaki apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT tsunodatatsuhiko apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT shigemizudaichi practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT fujimotoakihiro practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT akiyamashintaro practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT abetetsuo practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT nakanokaoru practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT boroevichkeitha practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT yamamotoyujiro practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT furutamayuko practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT kubomichiaki practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT nakagawahidewaki practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata AT tsunodatatsuhiko practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata |