Cargando…

Robust and rigorous identification of tissue-specific genes by statistically extending tau score

OBJECTIVES: In this study, we aimed to identify tissue-specific genes for various human tissues/organs more robustly and rigorously by extending the tau score algorithm. INTRODUCTION: Tissue-specific genes are a class of genes whose functions and expressions are preferred in one or several tissues r...

Descripción completa

Detalles Bibliográficos
Autores principales: Lüleci, Hatice Büşra, Yılmaz, Alper
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9733102/
https://www.ncbi.nlm.nih.gov/pubmed/36494766
http://dx.doi.org/10.1186/s13040-022-00315-9
_version_ 1784846285475414016
author Lüleci, Hatice Büşra
Yılmaz, Alper
author_facet Lüleci, Hatice Büşra
Yılmaz, Alper
author_sort Lüleci, Hatice Büşra
collection PubMed
description OBJECTIVES: In this study, we aimed to identify tissue-specific genes for various human tissues/organs more robustly and rigorously by extending the tau score algorithm. INTRODUCTION: Tissue-specific genes are a class of genes whose functions and expressions are preferred in one or several tissues restrictedly. Identification of tissue-specific genes is essential for discovering multi-cellular biological processes such as tissue-specific molecular regulations, tissue development, physiology, and the pathogenesis of tissue-associated diseases. MATERIALS AND METHODS: Gene expression data derived from five large RNA sequencing (RNA-seq) projects, spanning 96 different human tissues, were retrieved from ArrayExpress and ExpressionAtlas. The first step is categorizing genes using significant filters and tau score as a specificity index. After calculating tau for each gene in all datasets separately, statistical distance from the maximum expression level was estimated using a new meaningful procedure. Specific expression of a gene in one or several tissues was calculated after the integration of tau and statistical distance estimation, which is called as extended tau approach. Obtained tissue-specific genes for 96 different human tissues were functionally annotated, and some comparisons were carried out to show the effectiveness of the extended tau method. RESULTS AND DISCUSSION: Categorization of genes based on expression level and identification of tissue-specific genes for a large number of tissues/organs were executed. Genes were successfully assigned to multiple tissues by generating the extended tau approach as opposed to the original tau score, which can assign tissue specificity to single tissue only. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13040-022-00315-9.
format Online
Article
Text
id pubmed-9733102
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-97331022022-12-10 Robust and rigorous identification of tissue-specific genes by statistically extending tau score Lüleci, Hatice Büşra Yılmaz, Alper BioData Min Methodology OBJECTIVES: In this study, we aimed to identify tissue-specific genes for various human tissues/organs more robustly and rigorously by extending the tau score algorithm. INTRODUCTION: Tissue-specific genes are a class of genes whose functions and expressions are preferred in one or several tissues restrictedly. Identification of tissue-specific genes is essential for discovering multi-cellular biological processes such as tissue-specific molecular regulations, tissue development, physiology, and the pathogenesis of tissue-associated diseases. MATERIALS AND METHODS: Gene expression data derived from five large RNA sequencing (RNA-seq) projects, spanning 96 different human tissues, were retrieved from ArrayExpress and ExpressionAtlas. The first step is categorizing genes using significant filters and tau score as a specificity index. After calculating tau for each gene in all datasets separately, statistical distance from the maximum expression level was estimated using a new meaningful procedure. Specific expression of a gene in one or several tissues was calculated after the integration of tau and statistical distance estimation, which is called as extended tau approach. Obtained tissue-specific genes for 96 different human tissues were functionally annotated, and some comparisons were carried out to show the effectiveness of the extended tau method. RESULTS AND DISCUSSION: Categorization of genes based on expression level and identification of tissue-specific genes for a large number of tissues/organs were executed. Genes were successfully assigned to multiple tissues by generating the extended tau approach as opposed to the original tau score, which can assign tissue specificity to single tissue only. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13040-022-00315-9. BioMed Central 2022-12-09 /pmc/articles/PMC9733102/ /pubmed/36494766 http://dx.doi.org/10.1186/s13040-022-00315-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Lüleci, Hatice Büşra
Yılmaz, Alper
Robust and rigorous identification of tissue-specific genes by statistically extending tau score
title Robust and rigorous identification of tissue-specific genes by statistically extending tau score
title_full Robust and rigorous identification of tissue-specific genes by statistically extending tau score
title_fullStr Robust and rigorous identification of tissue-specific genes by statistically extending tau score
title_full_unstemmed Robust and rigorous identification of tissue-specific genes by statistically extending tau score
title_short Robust and rigorous identification of tissue-specific genes by statistically extending tau score
title_sort robust and rigorous identification of tissue-specific genes by statistically extending tau score
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9733102/
https://www.ncbi.nlm.nih.gov/pubmed/36494766
http://dx.doi.org/10.1186/s13040-022-00315-9
work_keys_str_mv AT lulecihaticebusra robustandrigorousidentificationoftissuespecificgenesbystatisticallyextendingtauscore
AT yılmazalper robustandrigorousidentificationoftissuespecificgenesbystatisticallyextendingtauscore