Cargando…

RNA variant identification discrepancy among splice-aware alignment algorithms

Next-generation sequencing (NGS) techniques have been generating various molecular maps, including transcriptomes via RNA-seq. Although the primary purpose of RNA-seq is to quantify the expression level of known genes, RNA variants are also identifiable. However, care must be taken to account for RN...

Descripción completa

Detalles Bibliográficos
Autores principales: Hong, Ji Hyung, Ko, Yoon Ho, Kang, Keunsoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072070/
https://www.ncbi.nlm.nih.gov/pubmed/30071094
http://dx.doi.org/10.1371/journal.pone.0201822
_version_ 1783343965786865664
author Hong, Ji Hyung
Ko, Yoon Ho
Kang, Keunsoo
author_facet Hong, Ji Hyung
Ko, Yoon Ho
Kang, Keunsoo
author_sort Hong, Ji Hyung
collection PubMed
description Next-generation sequencing (NGS) techniques have been generating various molecular maps, including transcriptomes via RNA-seq. Although the primary purpose of RNA-seq is to quantify the expression level of known genes, RNA variants are also identifiable. However, care must be taken to account for RNA’s dynamic nature. In this study, we evaluated the following popular splice-aware alignment algorithms in the context of RNA variant-calling analysis: HISAT2, STAR, STAR (two-pass mode), Subread, and Subjunc. For this, we performed RNA-seq with ten pieces of invasive ductal carcinoma from breast tissue and three pieces of adjacent normal tissue from a single patient. These RNA-seq data were used to evaluate the performance of splice-aware aligners. Surprisingly, the number of common potential RNA editing sites (pRESs) identified by all alignment algorithms was less than 2% of the total. The main cause of this difference was the mapped reads on the splice junctions. In addition, the RNA quality significantly affected the outcome. Therefore, researchers must consider these experimental and bioinformatic features during RNA variant analysis. Further investigations of common pRESs discovered that BDH1, CCDC137, and TBC1D10A transcripts contained a single non-synonymous RNA variant that was unique to breast cancer tissue compared to adjacent normal tissue; thus, further clinical validation is required.
format Online
Article
Text
id pubmed-6072070
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60720702018-08-16 RNA variant identification discrepancy among splice-aware alignment algorithms Hong, Ji Hyung Ko, Yoon Ho Kang, Keunsoo PLoS One Research Article Next-generation sequencing (NGS) techniques have been generating various molecular maps, including transcriptomes via RNA-seq. Although the primary purpose of RNA-seq is to quantify the expression level of known genes, RNA variants are also identifiable. However, care must be taken to account for RNA’s dynamic nature. In this study, we evaluated the following popular splice-aware alignment algorithms in the context of RNA variant-calling analysis: HISAT2, STAR, STAR (two-pass mode), Subread, and Subjunc. For this, we performed RNA-seq with ten pieces of invasive ductal carcinoma from breast tissue and three pieces of adjacent normal tissue from a single patient. These RNA-seq data were used to evaluate the performance of splice-aware aligners. Surprisingly, the number of common potential RNA editing sites (pRESs) identified by all alignment algorithms was less than 2% of the total. The main cause of this difference was the mapped reads on the splice junctions. In addition, the RNA quality significantly affected the outcome. Therefore, researchers must consider these experimental and bioinformatic features during RNA variant analysis. Further investigations of common pRESs discovered that BDH1, CCDC137, and TBC1D10A transcripts contained a single non-synonymous RNA variant that was unique to breast cancer tissue compared to adjacent normal tissue; thus, further clinical validation is required. Public Library of Science 2018-08-02 /pmc/articles/PMC6072070/ /pubmed/30071094 http://dx.doi.org/10.1371/journal.pone.0201822 Text en © 2018 Hong et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hong, Ji Hyung
Ko, Yoon Ho
Kang, Keunsoo
RNA variant identification discrepancy among splice-aware alignment algorithms
title RNA variant identification discrepancy among splice-aware alignment algorithms
title_full RNA variant identification discrepancy among splice-aware alignment algorithms
title_fullStr RNA variant identification discrepancy among splice-aware alignment algorithms
title_full_unstemmed RNA variant identification discrepancy among splice-aware alignment algorithms
title_short RNA variant identification discrepancy among splice-aware alignment algorithms
title_sort rna variant identification discrepancy among splice-aware alignment algorithms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072070/
https://www.ncbi.nlm.nih.gov/pubmed/30071094
http://dx.doi.org/10.1371/journal.pone.0201822
work_keys_str_mv AT hongjihyung rnavariantidentificationdiscrepancyamongspliceawarealignmentalgorithms
AT koyoonho rnavariantidentificationdiscrepancyamongspliceawarealignmentalgorithms
AT kangkeunsoo rnavariantidentificationdiscrepancyamongspliceawarealignmentalgorithms