Cargando…

VaDiR: an integrated approach to Variant Detection in RNA

BACKGROUND: Advances in next-generation DNA sequencing technologies are now enabling detailed characterization of sequence variations in cancer genomes. With whole-genome sequencing, variations in coding and non-coding sequences can be discovered. But the cost associated with it is currently limitin...

Descripción completa

Detalles Bibliográficos
Autores principales: Neums, Lisa, Suenaga, Seiji, Beyerlein, Peter, Anders, Sara, Koestler, Devin, Mariani, Andrea, Chien, Jeremy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5827345/
https://www.ncbi.nlm.nih.gov/pubmed/29267927
http://dx.doi.org/10.1093/gigascience/gix122
_version_ 1783302461239328768
author Neums, Lisa
Suenaga, Seiji
Beyerlein, Peter
Anders, Sara
Koestler, Devin
Mariani, Andrea
Chien, Jeremy
author_facet Neums, Lisa
Suenaga, Seiji
Beyerlein, Peter
Anders, Sara
Koestler, Devin
Mariani, Andrea
Chien, Jeremy
author_sort Neums, Lisa
collection PubMed
description BACKGROUND: Advances in next-generation DNA sequencing technologies are now enabling detailed characterization of sequence variations in cancer genomes. With whole-genome sequencing, variations in coding and non-coding sequences can be discovered. But the cost associated with it is currently limiting its general use in research. Whole-exome sequencing is used to characterize sequence variations in coding regions, but the cost associated with capture reagents and biases in capture rate limit its full use in research. Additional limitations include uncertainty in assigning the functional significance of the mutations when these mutations are observed in the non-coding region or in genes that are not expressed in cancer tissue. RESULTS: We investigated the feasibility of uncovering mutations from expressed genes using RNA sequencing datasets with a method called Variant Detection in RNA(VaDiR) that integrates 3 variant callers, namely: SNPiR, RVBoost, and MuTect2. The combination of all 3 methods, which we called Tier 1 variants, produced the highest precision with true positive mutations from RNA-seq that could be validated at the DNA level. We also found that the integration of Tier 1 variants with those called by MuTect2 and SNPiR produced the highest recall with acceptable precision. Finally, we observed a higher rate of mutation discovery in genes that are expressed at higher levels. CONCLUSIONS: Our method, VaDiR, provides a possibility of uncovering mutations from RNA sequencing datasets that could be useful in further functional analysis. In addition, our approach allows orthogonal validation of DNA-based mutation discovery by providing complementary sequence variation analysis from paired RNA/DNA sequencing datasets.
format Online
Article
Text
id pubmed-5827345
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58273452018-03-05 VaDiR: an integrated approach to Variant Detection in RNA Neums, Lisa Suenaga, Seiji Beyerlein, Peter Anders, Sara Koestler, Devin Mariani, Andrea Chien, Jeremy Gigascience Technical Note BACKGROUND: Advances in next-generation DNA sequencing technologies are now enabling detailed characterization of sequence variations in cancer genomes. With whole-genome sequencing, variations in coding and non-coding sequences can be discovered. But the cost associated with it is currently limiting its general use in research. Whole-exome sequencing is used to characterize sequence variations in coding regions, but the cost associated with capture reagents and biases in capture rate limit its full use in research. Additional limitations include uncertainty in assigning the functional significance of the mutations when these mutations are observed in the non-coding region or in genes that are not expressed in cancer tissue. RESULTS: We investigated the feasibility of uncovering mutations from expressed genes using RNA sequencing datasets with a method called Variant Detection in RNA(VaDiR) that integrates 3 variant callers, namely: SNPiR, RVBoost, and MuTect2. The combination of all 3 methods, which we called Tier 1 variants, produced the highest precision with true positive mutations from RNA-seq that could be validated at the DNA level. We also found that the integration of Tier 1 variants with those called by MuTect2 and SNPiR produced the highest recall with acceptable precision. Finally, we observed a higher rate of mutation discovery in genes that are expressed at higher levels. CONCLUSIONS: Our method, VaDiR, provides a possibility of uncovering mutations from RNA sequencing datasets that could be useful in further functional analysis. In addition, our approach allows orthogonal validation of DNA-based mutation discovery by providing complementary sequence variation analysis from paired RNA/DNA sequencing datasets. Oxford University Press 2017-12-18 /pmc/articles/PMC5827345/ /pubmed/29267927 http://dx.doi.org/10.1093/gigascience/gix122 Text en © The Authors 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Neums, Lisa
Suenaga, Seiji
Beyerlein, Peter
Anders, Sara
Koestler, Devin
Mariani, Andrea
Chien, Jeremy
VaDiR: an integrated approach to Variant Detection in RNA
title VaDiR: an integrated approach to Variant Detection in RNA
title_full VaDiR: an integrated approach to Variant Detection in RNA
title_fullStr VaDiR: an integrated approach to Variant Detection in RNA
title_full_unstemmed VaDiR: an integrated approach to Variant Detection in RNA
title_short VaDiR: an integrated approach to Variant Detection in RNA
title_sort vadir: an integrated approach to variant detection in rna
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5827345/
https://www.ncbi.nlm.nih.gov/pubmed/29267927
http://dx.doi.org/10.1093/gigascience/gix122
work_keys_str_mv AT neumslisa vadiranintegratedapproachtovariantdetectioninrna
AT suenagaseiji vadiranintegratedapproachtovariantdetectioninrna
AT beyerleinpeter vadiranintegratedapproachtovariantdetectioninrna
AT anderssara vadiranintegratedapproachtovariantdetectioninrna
AT koestlerdevin vadiranintegratedapproachtovariantdetectioninrna
AT marianiandrea vadiranintegratedapproachtovariantdetectioninrna
AT chienjeremy vadiranintegratedapproachtovariantdetectioninrna