Cargando…

A practical method to detect SNVs and indels from whole genome and exome sequencing data

The recent development of massively parallel sequencing technology has allowed the creation of comprehensive catalogs of genetic variation. However, due to the relatively high sequencing error rate for short read sequence data, sophisticated analysis methods are required to obtain high-quality varia...

Descripción completa

Detalles Bibliográficos
Autores principales: Shigemizu, Daichi, Fujimoto, Akihiro, Akiyama, Shintaro, Abe, Tetsuo, Nakano, Kaoru, Boroevich, Keith A., Yamamoto, Yujiro, Furuta, Mayuko, Kubo, Michiaki, Nakagawa, Hidewaki, Tsunoda, Tatsuhiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3703611/
https://www.ncbi.nlm.nih.gov/pubmed/23831772
http://dx.doi.org/10.1038/srep02161
_version_ 1782275928513052672
author Shigemizu, Daichi
Fujimoto, Akihiro
Akiyama, Shintaro
Abe, Tetsuo
Nakano, Kaoru
Boroevich, Keith A.
Yamamoto, Yujiro
Furuta, Mayuko
Kubo, Michiaki
Nakagawa, Hidewaki
Tsunoda, Tatsuhiko
author_facet Shigemizu, Daichi
Fujimoto, Akihiro
Akiyama, Shintaro
Abe, Tetsuo
Nakano, Kaoru
Boroevich, Keith A.
Yamamoto, Yujiro
Furuta, Mayuko
Kubo, Michiaki
Nakagawa, Hidewaki
Tsunoda, Tatsuhiko
author_sort Shigemizu, Daichi
collection PubMed
description The recent development of massively parallel sequencing technology has allowed the creation of comprehensive catalogs of genetic variation. However, due to the relatively high sequencing error rate for short read sequence data, sophisticated analysis methods are required to obtain high-quality variant calls. Here, we developed a probabilistic multinomial method for the detection of single nucleotide variants (SNVs) as well as short insertions and deletions (indels) in whole genome sequencing (WGS) and whole exome sequencing (WES) data for single sample calling. Evaluation with DNA genotyping arrays revealed a concordance rate of 99.98% for WGS calls and 99.99% for WES calls. Sanger sequencing of the discordant calls determined the false positive and false negative rates for the WGS (0.0068% and 0.17%) and WES (0.0036% and 0.0084%) datasets. Furthermore, short indels were identified with high accuracy (WGS: 94.7%, WES: 97.3%). We believe our method can contribute to the greater understanding of human diseases.
format Online
Article
Text
id pubmed-3703611
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-37036112013-07-08 A practical method to detect SNVs and indels from whole genome and exome sequencing data Shigemizu, Daichi Fujimoto, Akihiro Akiyama, Shintaro Abe, Tetsuo Nakano, Kaoru Boroevich, Keith A. Yamamoto, Yujiro Furuta, Mayuko Kubo, Michiaki Nakagawa, Hidewaki Tsunoda, Tatsuhiko Sci Rep Article The recent development of massively parallel sequencing technology has allowed the creation of comprehensive catalogs of genetic variation. However, due to the relatively high sequencing error rate for short read sequence data, sophisticated analysis methods are required to obtain high-quality variant calls. Here, we developed a probabilistic multinomial method for the detection of single nucleotide variants (SNVs) as well as short insertions and deletions (indels) in whole genome sequencing (WGS) and whole exome sequencing (WES) data for single sample calling. Evaluation with DNA genotyping arrays revealed a concordance rate of 99.98% for WGS calls and 99.99% for WES calls. Sanger sequencing of the discordant calls determined the false positive and false negative rates for the WGS (0.0068% and 0.17%) and WES (0.0036% and 0.0084%) datasets. Furthermore, short indels were identified with high accuracy (WGS: 94.7%, WES: 97.3%). We believe our method can contribute to the greater understanding of human diseases. Nature Publishing Group 2013-07-08 /pmc/articles/PMC3703611/ /pubmed/23831772 http://dx.doi.org/10.1038/srep02161 Text en Copyright © 2013, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by-nc-nd/3.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/
spellingShingle Article
Shigemizu, Daichi
Fujimoto, Akihiro
Akiyama, Shintaro
Abe, Tetsuo
Nakano, Kaoru
Boroevich, Keith A.
Yamamoto, Yujiro
Furuta, Mayuko
Kubo, Michiaki
Nakagawa, Hidewaki
Tsunoda, Tatsuhiko
A practical method to detect SNVs and indels from whole genome and exome sequencing data
title A practical method to detect SNVs and indels from whole genome and exome sequencing data
title_full A practical method to detect SNVs and indels from whole genome and exome sequencing data
title_fullStr A practical method to detect SNVs and indels from whole genome and exome sequencing data
title_full_unstemmed A practical method to detect SNVs and indels from whole genome and exome sequencing data
title_short A practical method to detect SNVs and indels from whole genome and exome sequencing data
title_sort practical method to detect snvs and indels from whole genome and exome sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3703611/
https://www.ncbi.nlm.nih.gov/pubmed/23831772
http://dx.doi.org/10.1038/srep02161
work_keys_str_mv AT shigemizudaichi apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT fujimotoakihiro apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT akiyamashintaro apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT abetetsuo apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT nakanokaoru apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT boroevichkeitha apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT yamamotoyujiro apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT furutamayuko apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT kubomichiaki apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT nakagawahidewaki apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT tsunodatatsuhiko apracticalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT shigemizudaichi practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT fujimotoakihiro practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT akiyamashintaro practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT abetetsuo practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT nakanokaoru practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT boroevichkeitha practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT yamamotoyujiro practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT furutamayuko practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT kubomichiaki practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT nakagawahidewaki practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata
AT tsunodatatsuhiko practicalmethodtodetectsnvsandindelsfromwholegenomeandexomesequencingdata