Cargando…

Principal Component Analysis of Alternative Splicing Profiles Revealed by Long-Read ONT Sequencing in Human Liver Tissue and Hepatocyte-Derived HepG2 and Huh7 Cell Lines

The long-read RNA sequencing developed by Oxford Nanopore Technology provides a direct quantification of transcript isoforms. That makes the number of transcript isoforms per gene an intrinsically suitable metric for alternative splicing (AS) profiling in the application to this particular type of R...

Descripción completa

Detalles Bibliográficos
Autores principales: Sarygina, Elizaveta, Kozlova, Anna, Deinichenko, Kseniia, Radko, Sergey, Ptitsyn, Konstantin, Khmeleva, Svetlana, Kurbatov, Leonid K., Spirin, Pavel, Prassolov, Vladimir S., Ilgisonis, Ekaterina, Lisitsa, Andrey, Ponomarenko, Elena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10648607/
https://www.ncbi.nlm.nih.gov/pubmed/37958484
http://dx.doi.org/10.3390/ijms242115502
_version_ 1785135377562992640
author Sarygina, Elizaveta
Kozlova, Anna
Deinichenko, Kseniia
Radko, Sergey
Ptitsyn, Konstantin
Khmeleva, Svetlana
Kurbatov, Leonid K.
Spirin, Pavel
Prassolov, Vladimir S.
Ilgisonis, Ekaterina
Lisitsa, Andrey
Ponomarenko, Elena
author_facet Sarygina, Elizaveta
Kozlova, Anna
Deinichenko, Kseniia
Radko, Sergey
Ptitsyn, Konstantin
Khmeleva, Svetlana
Kurbatov, Leonid K.
Spirin, Pavel
Prassolov, Vladimir S.
Ilgisonis, Ekaterina
Lisitsa, Andrey
Ponomarenko, Elena
author_sort Sarygina, Elizaveta
collection PubMed
description The long-read RNA sequencing developed by Oxford Nanopore Technology provides a direct quantification of transcript isoforms. That makes the number of transcript isoforms per gene an intrinsically suitable metric for alternative splicing (AS) profiling in the application to this particular type of RNA sequencing. By using this simple metric and recruiting principal component analysis (PCA) as a tool to visualize the high-dimensional transcriptomic data, we were able to group biospecimens of normal human liver tissue and hepatocyte-derived malignant HepG2 and Huh7 cells into clear clusters in a 2D space. For the transcriptome-wide analysis, the clustering was observed regardless whether all genes were included in analysis or only those expressed in all biospecimens tested. However, in the application to a particular set of genes known as pharmacogenes, which are involved in drug metabolism, the clustering worsened dramatically in the latter case. Based on PCA data, the subsets of genes most contributing to biospecimens’ grouping into clusters were selected and subjected to gene ontology analysis that allowed us to determine the top 20 biological processes among which translation and processes related to its regulation dominate. The suggested metrics can be a useful addition to the existing metrics for describing AS profiles, especially in application to transcriptome studies with long-read sequencing.
format Online
Article
Text
id pubmed-10648607
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106486072023-10-24 Principal Component Analysis of Alternative Splicing Profiles Revealed by Long-Read ONT Sequencing in Human Liver Tissue and Hepatocyte-Derived HepG2 and Huh7 Cell Lines Sarygina, Elizaveta Kozlova, Anna Deinichenko, Kseniia Radko, Sergey Ptitsyn, Konstantin Khmeleva, Svetlana Kurbatov, Leonid K. Spirin, Pavel Prassolov, Vladimir S. Ilgisonis, Ekaterina Lisitsa, Andrey Ponomarenko, Elena Int J Mol Sci Article The long-read RNA sequencing developed by Oxford Nanopore Technology provides a direct quantification of transcript isoforms. That makes the number of transcript isoforms per gene an intrinsically suitable metric for alternative splicing (AS) profiling in the application to this particular type of RNA sequencing. By using this simple metric and recruiting principal component analysis (PCA) as a tool to visualize the high-dimensional transcriptomic data, we were able to group biospecimens of normal human liver tissue and hepatocyte-derived malignant HepG2 and Huh7 cells into clear clusters in a 2D space. For the transcriptome-wide analysis, the clustering was observed regardless whether all genes were included in analysis or only those expressed in all biospecimens tested. However, in the application to a particular set of genes known as pharmacogenes, which are involved in drug metabolism, the clustering worsened dramatically in the latter case. Based on PCA data, the subsets of genes most contributing to biospecimens’ grouping into clusters were selected and subjected to gene ontology analysis that allowed us to determine the top 20 biological processes among which translation and processes related to its regulation dominate. The suggested metrics can be a useful addition to the existing metrics for describing AS profiles, especially in application to transcriptome studies with long-read sequencing. MDPI 2023-10-24 /pmc/articles/PMC10648607/ /pubmed/37958484 http://dx.doi.org/10.3390/ijms242115502 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sarygina, Elizaveta
Kozlova, Anna
Deinichenko, Kseniia
Radko, Sergey
Ptitsyn, Konstantin
Khmeleva, Svetlana
Kurbatov, Leonid K.
Spirin, Pavel
Prassolov, Vladimir S.
Ilgisonis, Ekaterina
Lisitsa, Andrey
Ponomarenko, Elena
Principal Component Analysis of Alternative Splicing Profiles Revealed by Long-Read ONT Sequencing in Human Liver Tissue and Hepatocyte-Derived HepG2 and Huh7 Cell Lines
title Principal Component Analysis of Alternative Splicing Profiles Revealed by Long-Read ONT Sequencing in Human Liver Tissue and Hepatocyte-Derived HepG2 and Huh7 Cell Lines
title_full Principal Component Analysis of Alternative Splicing Profiles Revealed by Long-Read ONT Sequencing in Human Liver Tissue and Hepatocyte-Derived HepG2 and Huh7 Cell Lines
title_fullStr Principal Component Analysis of Alternative Splicing Profiles Revealed by Long-Read ONT Sequencing in Human Liver Tissue and Hepatocyte-Derived HepG2 and Huh7 Cell Lines
title_full_unstemmed Principal Component Analysis of Alternative Splicing Profiles Revealed by Long-Read ONT Sequencing in Human Liver Tissue and Hepatocyte-Derived HepG2 and Huh7 Cell Lines
title_short Principal Component Analysis of Alternative Splicing Profiles Revealed by Long-Read ONT Sequencing in Human Liver Tissue and Hepatocyte-Derived HepG2 and Huh7 Cell Lines
title_sort principal component analysis of alternative splicing profiles revealed by long-read ont sequencing in human liver tissue and hepatocyte-derived hepg2 and huh7 cell lines
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10648607/
https://www.ncbi.nlm.nih.gov/pubmed/37958484
http://dx.doi.org/10.3390/ijms242115502
work_keys_str_mv AT saryginaelizaveta principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT kozlovaanna principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT deinichenkokseniia principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT radkosergey principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT ptitsynkonstantin principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT khmelevasvetlana principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT kurbatovleonidk principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT spirinpavel principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT prassolovvladimirs principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT ilgisonisekaterina principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT lisitsaandrey principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines
AT ponomarenkoelena principalcomponentanalysisofalternativesplicingprofilesrevealedbylongreadontsequencinginhumanlivertissueandhepatocytederivedhepg2andhuh7celllines