Cargando…
Robust and rigorous identification of tissue-specific genes by statistically extending tau score
OBJECTIVES: In this study, we aimed to identify tissue-specific genes for various human tissues/organs more robustly and rigorously by extending the tau score algorithm. INTRODUCTION: Tissue-specific genes are a class of genes whose functions and expressions are preferred in one or several tissues r...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9733102/ https://www.ncbi.nlm.nih.gov/pubmed/36494766 http://dx.doi.org/10.1186/s13040-022-00315-9 |
_version_ | 1784846285475414016 |
---|---|
author | Lüleci, Hatice Büşra Yılmaz, Alper |
author_facet | Lüleci, Hatice Büşra Yılmaz, Alper |
author_sort | Lüleci, Hatice Büşra |
collection | PubMed |
description | OBJECTIVES: In this study, we aimed to identify tissue-specific genes for various human tissues/organs more robustly and rigorously by extending the tau score algorithm. INTRODUCTION: Tissue-specific genes are a class of genes whose functions and expressions are preferred in one or several tissues restrictedly. Identification of tissue-specific genes is essential for discovering multi-cellular biological processes such as tissue-specific molecular regulations, tissue development, physiology, and the pathogenesis of tissue-associated diseases. MATERIALS AND METHODS: Gene expression data derived from five large RNA sequencing (RNA-seq) projects, spanning 96 different human tissues, were retrieved from ArrayExpress and ExpressionAtlas. The first step is categorizing genes using significant filters and tau score as a specificity index. After calculating tau for each gene in all datasets separately, statistical distance from the maximum expression level was estimated using a new meaningful procedure. Specific expression of a gene in one or several tissues was calculated after the integration of tau and statistical distance estimation, which is called as extended tau approach. Obtained tissue-specific genes for 96 different human tissues were functionally annotated, and some comparisons were carried out to show the effectiveness of the extended tau method. RESULTS AND DISCUSSION: Categorization of genes based on expression level and identification of tissue-specific genes for a large number of tissues/organs were executed. Genes were successfully assigned to multiple tissues by generating the extended tau approach as opposed to the original tau score, which can assign tissue specificity to single tissue only. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13040-022-00315-9. |
format | Online Article Text |
id | pubmed-9733102 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-97331022022-12-10 Robust and rigorous identification of tissue-specific genes by statistically extending tau score Lüleci, Hatice Büşra Yılmaz, Alper BioData Min Methodology OBJECTIVES: In this study, we aimed to identify tissue-specific genes for various human tissues/organs more robustly and rigorously by extending the tau score algorithm. INTRODUCTION: Tissue-specific genes are a class of genes whose functions and expressions are preferred in one or several tissues restrictedly. Identification of tissue-specific genes is essential for discovering multi-cellular biological processes such as tissue-specific molecular regulations, tissue development, physiology, and the pathogenesis of tissue-associated diseases. MATERIALS AND METHODS: Gene expression data derived from five large RNA sequencing (RNA-seq) projects, spanning 96 different human tissues, were retrieved from ArrayExpress and ExpressionAtlas. The first step is categorizing genes using significant filters and tau score as a specificity index. After calculating tau for each gene in all datasets separately, statistical distance from the maximum expression level was estimated using a new meaningful procedure. Specific expression of a gene in one or several tissues was calculated after the integration of tau and statistical distance estimation, which is called as extended tau approach. Obtained tissue-specific genes for 96 different human tissues were functionally annotated, and some comparisons were carried out to show the effectiveness of the extended tau method. RESULTS AND DISCUSSION: Categorization of genes based on expression level and identification of tissue-specific genes for a large number of tissues/organs were executed. Genes were successfully assigned to multiple tissues by generating the extended tau approach as opposed to the original tau score, which can assign tissue specificity to single tissue only. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13040-022-00315-9. BioMed Central 2022-12-09 /pmc/articles/PMC9733102/ /pubmed/36494766 http://dx.doi.org/10.1186/s13040-022-00315-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Lüleci, Hatice Büşra Yılmaz, Alper Robust and rigorous identification of tissue-specific genes by statistically extending tau score |
title | Robust and rigorous identification of tissue-specific genes by statistically extending tau score |
title_full | Robust and rigorous identification of tissue-specific genes by statistically extending tau score |
title_fullStr | Robust and rigorous identification of tissue-specific genes by statistically extending tau score |
title_full_unstemmed | Robust and rigorous identification of tissue-specific genes by statistically extending tau score |
title_short | Robust and rigorous identification of tissue-specific genes by statistically extending tau score |
title_sort | robust and rigorous identification of tissue-specific genes by statistically extending tau score |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9733102/ https://www.ncbi.nlm.nih.gov/pubmed/36494766 http://dx.doi.org/10.1186/s13040-022-00315-9 |
work_keys_str_mv | AT lulecihaticebusra robustandrigorousidentificationoftissuespecificgenesbystatisticallyextendingtauscore AT yılmazalper robustandrigorousidentificationoftissuespecificgenesbystatisticallyextendingtauscore |