Cargando…

RNA-seq data science: From raw data to effective interpretation

RNA sequencing (RNA-seq) has become an exemplary technology in modern biology and clinical science. Its immense popularity is due in large part to the continuous efforts of the bioinformatics community to develop accurate and scalable computational tools to analyze the enormous amounts of transcript...

Descripción completa

Detalles Bibliográficos
Autores principales: Deshpande, Dhrithi, Chhugani, Karishma, Chang, Yutong, Karlsberg, Aaron, Loeffler, Caitlin, Zhang, Jinyang, Muszyńska, Agata, Munteanu, Viorel, Yang, Harry, Rotman, Jeremy, Tao, Laura, Balliu, Brunilda, Tseng, Elizabeth, Eskin, Eleazar, Zhao, Fangqing, Mohammadi, Pejman, P. Łabaj, Paweł, Mangul, Serghei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10043755/
https://www.ncbi.nlm.nih.gov/pubmed/36999049
http://dx.doi.org/10.3389/fgene.2023.997383
_version_ 1784913223748681728
author Deshpande, Dhrithi
Chhugani, Karishma
Chang, Yutong
Karlsberg, Aaron
Loeffler, Caitlin
Zhang, Jinyang
Muszyńska, Agata
Munteanu, Viorel
Yang, Harry
Rotman, Jeremy
Tao, Laura
Balliu, Brunilda
Tseng, Elizabeth
Eskin, Eleazar
Zhao, Fangqing
Mohammadi, Pejman
P. Łabaj, Paweł
Mangul, Serghei
author_facet Deshpande, Dhrithi
Chhugani, Karishma
Chang, Yutong
Karlsberg, Aaron
Loeffler, Caitlin
Zhang, Jinyang
Muszyńska, Agata
Munteanu, Viorel
Yang, Harry
Rotman, Jeremy
Tao, Laura
Balliu, Brunilda
Tseng, Elizabeth
Eskin, Eleazar
Zhao, Fangqing
Mohammadi, Pejman
P. Łabaj, Paweł
Mangul, Serghei
author_sort Deshpande, Dhrithi
collection PubMed
description RNA sequencing (RNA-seq) has become an exemplary technology in modern biology and clinical science. Its immense popularity is due in large part to the continuous efforts of the bioinformatics community to develop accurate and scalable computational tools to analyze the enormous amounts of transcriptomic data that it produces. RNA-seq analysis enables genes and their corresponding transcripts to be probed for a variety of purposes, such as detecting novel exons or whole transcripts, assessing expression of genes and alternative transcripts, and studying alternative splicing structure. It can be a challenge, however, to obtain meaningful biological signals from raw RNA-seq data because of the enormous scale of the data as well as the inherent limitations of different sequencing technologies, such as amplification bias or biases of library preparation. The need to overcome these technical challenges has pushed the rapid development of novel computational tools, which have evolved and diversified in accordance with technological advancements, leading to the current myriad of RNA-seq tools. These tools, combined with the diverse computational skill sets of biomedical researchers, help to unlock the full potential of RNA-seq. The purpose of this review is to explain basic concepts in the computational analysis of RNA-seq data and define discipline-specific jargon.
format Online
Article
Text
id pubmed-10043755
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100437552023-03-29 RNA-seq data science: From raw data to effective interpretation Deshpande, Dhrithi Chhugani, Karishma Chang, Yutong Karlsberg, Aaron Loeffler, Caitlin Zhang, Jinyang Muszyńska, Agata Munteanu, Viorel Yang, Harry Rotman, Jeremy Tao, Laura Balliu, Brunilda Tseng, Elizabeth Eskin, Eleazar Zhao, Fangqing Mohammadi, Pejman P. Łabaj, Paweł Mangul, Serghei Front Genet Genetics RNA sequencing (RNA-seq) has become an exemplary technology in modern biology and clinical science. Its immense popularity is due in large part to the continuous efforts of the bioinformatics community to develop accurate and scalable computational tools to analyze the enormous amounts of transcriptomic data that it produces. RNA-seq analysis enables genes and their corresponding transcripts to be probed for a variety of purposes, such as detecting novel exons or whole transcripts, assessing expression of genes and alternative transcripts, and studying alternative splicing structure. It can be a challenge, however, to obtain meaningful biological signals from raw RNA-seq data because of the enormous scale of the data as well as the inherent limitations of different sequencing technologies, such as amplification bias or biases of library preparation. The need to overcome these technical challenges has pushed the rapid development of novel computational tools, which have evolved and diversified in accordance with technological advancements, leading to the current myriad of RNA-seq tools. These tools, combined with the diverse computational skill sets of biomedical researchers, help to unlock the full potential of RNA-seq. The purpose of this review is to explain basic concepts in the computational analysis of RNA-seq data and define discipline-specific jargon. Frontiers Media S.A. 2023-03-13 /pmc/articles/PMC10043755/ /pubmed/36999049 http://dx.doi.org/10.3389/fgene.2023.997383 Text en Copyright © 2023 Deshpande, Chhugani, Chang, Karlsberg, Loeffler, Zhang, Muszyńska, Munteanu, Yang, Rotman, Tao, Balliu, Tseng, Eskin, Zhao, Mohammadi, P. Łabaj and Mangul. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Deshpande, Dhrithi
Chhugani, Karishma
Chang, Yutong
Karlsberg, Aaron
Loeffler, Caitlin
Zhang, Jinyang
Muszyńska, Agata
Munteanu, Viorel
Yang, Harry
Rotman, Jeremy
Tao, Laura
Balliu, Brunilda
Tseng, Elizabeth
Eskin, Eleazar
Zhao, Fangqing
Mohammadi, Pejman
P. Łabaj, Paweł
Mangul, Serghei
RNA-seq data science: From raw data to effective interpretation
title RNA-seq data science: From raw data to effective interpretation
title_full RNA-seq data science: From raw data to effective interpretation
title_fullStr RNA-seq data science: From raw data to effective interpretation
title_full_unstemmed RNA-seq data science: From raw data to effective interpretation
title_short RNA-seq data science: From raw data to effective interpretation
title_sort rna-seq data science: from raw data to effective interpretation
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10043755/
https://www.ncbi.nlm.nih.gov/pubmed/36999049
http://dx.doi.org/10.3389/fgene.2023.997383
work_keys_str_mv AT deshpandedhrithi rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT chhuganikarishma rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT changyutong rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT karlsbergaaron rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT loefflercaitlin rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT zhangjinyang rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT muszynskaagata rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT munteanuviorel rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT yangharry rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT rotmanjeremy rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT taolaura rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT balliubrunilda rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT tsengelizabeth rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT eskineleazar rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT zhaofangqing rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT mohammadipejman rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT płabajpaweł rnaseqdatasciencefromrawdatatoeffectiveinterpretation
AT mangulserghei rnaseqdatasciencefromrawdatatoeffectiveinterpretation