Cargando…

Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions

The Ion Proton sequencer from Thermo Fisher accurately determines sequence variants from target regions with a rapid turnaround time at a low cost. However, misleading variant-calling errors can occur. We performed a systematic evaluation and manual curation of read-level alignments for the 675 ultr...

Descripción completa

Detalles Bibliográficos
Autores principales: Seo, Heewon, Park, Yoomi, Min, Byung Joo, Seo, Myung Eui, Kim, Ju Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524428/
https://www.ncbi.nlm.nih.gov/pubmed/28742110
http://dx.doi.org/10.1371/journal.pone.0181304
_version_ 1783252466994774016
author Seo, Heewon
Park, Yoomi
Min, Byung Joo
Seo, Myung Eui
Kim, Ju Han
author_facet Seo, Heewon
Park, Yoomi
Min, Byung Joo
Seo, Myung Eui
Kim, Ju Han
author_sort Seo, Heewon
collection PubMed
description The Ion Proton sequencer from Thermo Fisher accurately determines sequence variants from target regions with a rapid turnaround time at a low cost. However, misleading variant-calling errors can occur. We performed a systematic evaluation and manual curation of read-level alignments for the 675 ultrarare variants reported by the Ion Proton sequencer from 27 whole-exome sequencing data but that are not present in either the 1000 Genomes Project and the Exome Aggregation Consortium. We classified positive variant calls into 393 highly likely false positives, 126 likely false positives, and 156 likely true positives, which comprised 58.2%, 18.7%, and 23.1% of the variants, respectively. We identified four distinct error patterns of variant calling that may be bioinformatically corrected when using different strategies: simplicity region, SNV cluster, peripheral sequence read, and base inversion. Local de novo assembly successfully corrected 201 (38.7%) of the 519 highly likely or likely false positives. We also demonstrate that the two sequencing kits from Thermo Fisher (the Ion PI Sequencing 200 kit V3 and the Ion PI Hi-Q kit) exhibit different error profiles across different error types. A refined calling algorithm with better polymerase may improve the performance of the Ion Proton sequencing platform.
format Online
Article
Text
id pubmed-5524428
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55244282017-08-07 Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions Seo, Heewon Park, Yoomi Min, Byung Joo Seo, Myung Eui Kim, Ju Han PLoS One Research Article The Ion Proton sequencer from Thermo Fisher accurately determines sequence variants from target regions with a rapid turnaround time at a low cost. However, misleading variant-calling errors can occur. We performed a systematic evaluation and manual curation of read-level alignments for the 675 ultrarare variants reported by the Ion Proton sequencer from 27 whole-exome sequencing data but that are not present in either the 1000 Genomes Project and the Exome Aggregation Consortium. We classified positive variant calls into 393 highly likely false positives, 126 likely false positives, and 156 likely true positives, which comprised 58.2%, 18.7%, and 23.1% of the variants, respectively. We identified four distinct error patterns of variant calling that may be bioinformatically corrected when using different strategies: simplicity region, SNV cluster, peripheral sequence read, and base inversion. Local de novo assembly successfully corrected 201 (38.7%) of the 519 highly likely or likely false positives. We also demonstrate that the two sequencing kits from Thermo Fisher (the Ion PI Sequencing 200 kit V3 and the Ion PI Hi-Q kit) exhibit different error profiles across different error types. A refined calling algorithm with better polymerase may improve the performance of the Ion Proton sequencing platform. Public Library of Science 2017-07-24 /pmc/articles/PMC5524428/ /pubmed/28742110 http://dx.doi.org/10.1371/journal.pone.0181304 Text en © 2017 Seo et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Seo, Heewon
Park, Yoomi
Min, Byung Joo
Seo, Myung Eui
Kim, Ju Han
Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions
title Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions
title_full Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions
title_fullStr Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions
title_full_unstemmed Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions
title_short Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions
title_sort evaluation of exome variants using the ion proton platform to sequence error-prone regions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524428/
https://www.ncbi.nlm.nih.gov/pubmed/28742110
http://dx.doi.org/10.1371/journal.pone.0181304
work_keys_str_mv AT seoheewon evaluationofexomevariantsusingtheionprotonplatformtosequenceerrorproneregions
AT parkyoomi evaluationofexomevariantsusingtheionprotonplatformtosequenceerrorproneregions
AT minbyungjoo evaluationofexomevariantsusingtheionprotonplatformtosequenceerrorproneregions
AT seomyungeui evaluationofexomevariantsusingtheionprotonplatformtosequenceerrorproneregions
AT kimjuhan evaluationofexomevariantsusingtheionprotonplatformtosequenceerrorproneregions