Cargando…
COTAN: scRNA-seq data analysis based on gene co-expression
Estimating the co-expression of cell identity factors in single-cell is crucial. Due to the low efficiency of scRNA-seq methodologies, sensitive computational approaches are critical to accurately infer transcription profiles in a cell population. We introduce COTAN, a statistical and computational...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356963/ https://www.ncbi.nlm.nih.gov/pubmed/34396096 http://dx.doi.org/10.1093/nargab/lqab072 |
_version_ | 1783737044703379456 |
---|---|
author | Galfrè, Silvia Giulia Morandin, Francesco Pietrosanto, Marco Cremisi, Federico Helmer-Citterich, Manuela |
author_facet | Galfrè, Silvia Giulia Morandin, Francesco Pietrosanto, Marco Cremisi, Federico Helmer-Citterich, Manuela |
author_sort | Galfrè, Silvia Giulia |
collection | PubMed |
description | Estimating the co-expression of cell identity factors in single-cell is crucial. Due to the low efficiency of scRNA-seq methodologies, sensitive computational approaches are critical to accurately infer transcription profiles in a cell population. We introduce COTAN, a statistical and computational method, to analyze the co-expression of gene pairs at single cell level, providing the foundation for single-cell gene interactome analysis. The basic idea is studying the zero UMI counts’ distribution instead of focusing on positive counts; this is done with a generalized contingency tables framework. COTAN can assess the correlated or anti-correlated expression of gene pairs, providing a new correlation index with an approximate p-value for the associated test of independence. COTAN can evaluate whether single genes are differentially expressed, scoring them with a newly defined global differentiation index. Similarly to correlation network analysis, it provides ways to plot and cluster genes according to their co-expression pattern with other genes, effectively helping the study of gene interactions, becoming a new tool to identify cell-identity markers. We assayed COTAN on two neural development datasets with very promising results. COTAN is an R package that complements the traditional single cell RNA-seq analysis and it is available at https://github.com/seriph78/COTAN. |
format | Online Article Text |
id | pubmed-8356963 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-83569632021-08-12 COTAN: scRNA-seq data analysis based on gene co-expression Galfrè, Silvia Giulia Morandin, Francesco Pietrosanto, Marco Cremisi, Federico Helmer-Citterich, Manuela NAR Genom Bioinform Methods Article Estimating the co-expression of cell identity factors in single-cell is crucial. Due to the low efficiency of scRNA-seq methodologies, sensitive computational approaches are critical to accurately infer transcription profiles in a cell population. We introduce COTAN, a statistical and computational method, to analyze the co-expression of gene pairs at single cell level, providing the foundation for single-cell gene interactome analysis. The basic idea is studying the zero UMI counts’ distribution instead of focusing on positive counts; this is done with a generalized contingency tables framework. COTAN can assess the correlated or anti-correlated expression of gene pairs, providing a new correlation index with an approximate p-value for the associated test of independence. COTAN can evaluate whether single genes are differentially expressed, scoring them with a newly defined global differentiation index. Similarly to correlation network analysis, it provides ways to plot and cluster genes according to their co-expression pattern with other genes, effectively helping the study of gene interactions, becoming a new tool to identify cell-identity markers. We assayed COTAN on two neural development datasets with very promising results. COTAN is an R package that complements the traditional single cell RNA-seq analysis and it is available at https://github.com/seriph78/COTAN. Oxford University Press 2021-08-11 /pmc/articles/PMC8356963/ /pubmed/34396096 http://dx.doi.org/10.1093/nargab/lqab072 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Article Galfrè, Silvia Giulia Morandin, Francesco Pietrosanto, Marco Cremisi, Federico Helmer-Citterich, Manuela COTAN: scRNA-seq data analysis based on gene co-expression |
title | COTAN: scRNA-seq data analysis based on gene co-expression |
title_full | COTAN: scRNA-seq data analysis based on gene co-expression |
title_fullStr | COTAN: scRNA-seq data analysis based on gene co-expression |
title_full_unstemmed | COTAN: scRNA-seq data analysis based on gene co-expression |
title_short | COTAN: scRNA-seq data analysis based on gene co-expression |
title_sort | cotan: scrna-seq data analysis based on gene co-expression |
topic | Methods Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356963/ https://www.ncbi.nlm.nih.gov/pubmed/34396096 http://dx.doi.org/10.1093/nargab/lqab072 |
work_keys_str_mv | AT galfresilviagiulia cotanscrnaseqdataanalysisbasedongenecoexpression AT morandinfrancesco cotanscrnaseqdataanalysisbasedongenecoexpression AT pietrosantomarco cotanscrnaseqdataanalysisbasedongenecoexpression AT cremisifederico cotanscrnaseqdataanalysisbasedongenecoexpression AT helmercitterichmanuela cotanscrnaseqdataanalysisbasedongenecoexpression |