Cargando…

MAGERI: Computational pipeline for molecular-barcoded targeted resequencing

Unique molecular identifiers (UMIs) show outstanding performance in targeted high-throughput resequencing, being the most promising approach for the accurate identification of rare variants in complex DNA samples. This approach has application in multiple areas, including cancer diagnostics, thus de...

Descripción completa

Detalles Bibliográficos
Autores principales: Shugay, Mikhail, Zaretsky, Andrew R., Shagin, Dmitriy A., Shagina, Irina A., Volchenkov, Ivan A., Shelenkov, Andrew A., Lebedin, Mikhail Y., Bagaev, Dmitriy V., Lukyanov, Sergey, Chudakov, Dmitriy M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5419444/
https://www.ncbi.nlm.nih.gov/pubmed/28475621
http://dx.doi.org/10.1371/journal.pcbi.1005480
_version_ 1783234225343823872
author Shugay, Mikhail
Zaretsky, Andrew R.
Shagin, Dmitriy A.
Shagina, Irina A.
Volchenkov, Ivan A.
Shelenkov, Andrew A.
Lebedin, Mikhail Y.
Bagaev, Dmitriy V.
Lukyanov, Sergey
Chudakov, Dmitriy M.
author_facet Shugay, Mikhail
Zaretsky, Andrew R.
Shagin, Dmitriy A.
Shagina, Irina A.
Volchenkov, Ivan A.
Shelenkov, Andrew A.
Lebedin, Mikhail Y.
Bagaev, Dmitriy V.
Lukyanov, Sergey
Chudakov, Dmitriy M.
author_sort Shugay, Mikhail
collection PubMed
description Unique molecular identifiers (UMIs) show outstanding performance in targeted high-throughput resequencing, being the most promising approach for the accurate identification of rare variants in complex DNA samples. This approach has application in multiple areas, including cancer diagnostics, thus demanding dedicated software and algorithms. Here we introduce MAGERI, a computational pipeline that efficiently handles all caveats of UMI-based analysis to obtain high-fidelity mutation profiles and call ultra-rare variants. Using an extensive set of benchmark datasets including gold-standard biological samples with known variant frequencies, cell-free DNA from tumor patient blood samples and publicly available UMI-encoded datasets we demonstrate that our method is both robust and efficient in calling rare variants. The versatility of our software is supported by accurate results obtained for both tumor DNA and viral RNA samples in datasets prepared using three different UMI-based protocols.
format Online
Article
Text
id pubmed-5419444
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54194442017-05-14 MAGERI: Computational pipeline for molecular-barcoded targeted resequencing Shugay, Mikhail Zaretsky, Andrew R. Shagin, Dmitriy A. Shagina, Irina A. Volchenkov, Ivan A. Shelenkov, Andrew A. Lebedin, Mikhail Y. Bagaev, Dmitriy V. Lukyanov, Sergey Chudakov, Dmitriy M. PLoS Comput Biol Research Article Unique molecular identifiers (UMIs) show outstanding performance in targeted high-throughput resequencing, being the most promising approach for the accurate identification of rare variants in complex DNA samples. This approach has application in multiple areas, including cancer diagnostics, thus demanding dedicated software and algorithms. Here we introduce MAGERI, a computational pipeline that efficiently handles all caveats of UMI-based analysis to obtain high-fidelity mutation profiles and call ultra-rare variants. Using an extensive set of benchmark datasets including gold-standard biological samples with known variant frequencies, cell-free DNA from tumor patient blood samples and publicly available UMI-encoded datasets we demonstrate that our method is both robust and efficient in calling rare variants. The versatility of our software is supported by accurate results obtained for both tumor DNA and viral RNA samples in datasets prepared using three different UMI-based protocols. Public Library of Science 2017-05-05 /pmc/articles/PMC5419444/ /pubmed/28475621 http://dx.doi.org/10.1371/journal.pcbi.1005480 Text en © 2017 Shugay et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Shugay, Mikhail
Zaretsky, Andrew R.
Shagin, Dmitriy A.
Shagina, Irina A.
Volchenkov, Ivan A.
Shelenkov, Andrew A.
Lebedin, Mikhail Y.
Bagaev, Dmitriy V.
Lukyanov, Sergey
Chudakov, Dmitriy M.
MAGERI: Computational pipeline for molecular-barcoded targeted resequencing
title MAGERI: Computational pipeline for molecular-barcoded targeted resequencing
title_full MAGERI: Computational pipeline for molecular-barcoded targeted resequencing
title_fullStr MAGERI: Computational pipeline for molecular-barcoded targeted resequencing
title_full_unstemmed MAGERI: Computational pipeline for molecular-barcoded targeted resequencing
title_short MAGERI: Computational pipeline for molecular-barcoded targeted resequencing
title_sort mageri: computational pipeline for molecular-barcoded targeted resequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5419444/
https://www.ncbi.nlm.nih.gov/pubmed/28475621
http://dx.doi.org/10.1371/journal.pcbi.1005480
work_keys_str_mv AT shugaymikhail magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT zaretskyandrewr magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT shagindmitriya magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT shaginairinaa magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT volchenkovivana magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT shelenkovandrewa magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT lebedinmikhaily magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT bagaevdmitriyv magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT lukyanovsergey magericomputationalpipelineformolecularbarcodedtargetedresequencing
AT chudakovdmitriym magericomputationalpipelineformolecularbarcodedtargetedresequencing