Cargando…

COTAN: scRNA-seq data analysis based on gene co-expression

Estimating the co-expression of cell identity factors in single-cell is crucial. Due to the low efficiency of scRNA-seq methodologies, sensitive computational approaches are critical to accurately infer transcription profiles in a cell population. We introduce COTAN, a statistical and computational...

Descripción completa

Detalles Bibliográficos
Autores principales: Galfrè, Silvia Giulia, Morandin, Francesco, Pietrosanto, Marco, Cremisi, Federico, Helmer-Citterich, Manuela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356963/
https://www.ncbi.nlm.nih.gov/pubmed/34396096
http://dx.doi.org/10.1093/nargab/lqab072
_version_ 1783737044703379456
author Galfrè, Silvia Giulia
Morandin, Francesco
Pietrosanto, Marco
Cremisi, Federico
Helmer-Citterich, Manuela
author_facet Galfrè, Silvia Giulia
Morandin, Francesco
Pietrosanto, Marco
Cremisi, Federico
Helmer-Citterich, Manuela
author_sort Galfrè, Silvia Giulia
collection PubMed
description Estimating the co-expression of cell identity factors in single-cell is crucial. Due to the low efficiency of scRNA-seq methodologies, sensitive computational approaches are critical to accurately infer transcription profiles in a cell population. We introduce COTAN, a statistical and computational method, to analyze the co-expression of gene pairs at single cell level, providing the foundation for single-cell gene interactome analysis. The basic idea is studying the zero UMI counts’ distribution instead of focusing on positive counts; this is done with a generalized contingency tables framework. COTAN can assess the correlated or anti-correlated expression of gene pairs, providing a new correlation index with an approximate p-value for the associated test of independence. COTAN can evaluate whether single genes are differentially expressed, scoring them with a newly defined global differentiation index. Similarly to correlation network analysis, it provides ways to plot and cluster genes according to their co-expression pattern with other genes, effectively helping the study of gene interactions, becoming a new tool to identify cell-identity markers. We assayed COTAN on two neural development datasets with very promising results. COTAN is an R package that complements the traditional single cell RNA-seq analysis and it is available at https://github.com/seriph78/COTAN.
format Online
Article
Text
id pubmed-8356963
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83569632021-08-12 COTAN: scRNA-seq data analysis based on gene co-expression Galfrè, Silvia Giulia Morandin, Francesco Pietrosanto, Marco Cremisi, Federico Helmer-Citterich, Manuela NAR Genom Bioinform Methods Article Estimating the co-expression of cell identity factors in single-cell is crucial. Due to the low efficiency of scRNA-seq methodologies, sensitive computational approaches are critical to accurately infer transcription profiles in a cell population. We introduce COTAN, a statistical and computational method, to analyze the co-expression of gene pairs at single cell level, providing the foundation for single-cell gene interactome analysis. The basic idea is studying the zero UMI counts’ distribution instead of focusing on positive counts; this is done with a generalized contingency tables framework. COTAN can assess the correlated or anti-correlated expression of gene pairs, providing a new correlation index with an approximate p-value for the associated test of independence. COTAN can evaluate whether single genes are differentially expressed, scoring them with a newly defined global differentiation index. Similarly to correlation network analysis, it provides ways to plot and cluster genes according to their co-expression pattern with other genes, effectively helping the study of gene interactions, becoming a new tool to identify cell-identity markers. We assayed COTAN on two neural development datasets with very promising results. COTAN is an R package that complements the traditional single cell RNA-seq analysis and it is available at https://github.com/seriph78/COTAN. Oxford University Press 2021-08-11 /pmc/articles/PMC8356963/ /pubmed/34396096 http://dx.doi.org/10.1093/nargab/lqab072 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Article
Galfrè, Silvia Giulia
Morandin, Francesco
Pietrosanto, Marco
Cremisi, Federico
Helmer-Citterich, Manuela
COTAN: scRNA-seq data analysis based on gene co-expression
title COTAN: scRNA-seq data analysis based on gene co-expression
title_full COTAN: scRNA-seq data analysis based on gene co-expression
title_fullStr COTAN: scRNA-seq data analysis based on gene co-expression
title_full_unstemmed COTAN: scRNA-seq data analysis based on gene co-expression
title_short COTAN: scRNA-seq data analysis based on gene co-expression
title_sort cotan: scrna-seq data analysis based on gene co-expression
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356963/
https://www.ncbi.nlm.nih.gov/pubmed/34396096
http://dx.doi.org/10.1093/nargab/lqab072
work_keys_str_mv AT galfresilviagiulia cotanscrnaseqdataanalysisbasedongenecoexpression
AT morandinfrancesco cotanscrnaseqdataanalysisbasedongenecoexpression
AT pietrosantomarco cotanscrnaseqdataanalysisbasedongenecoexpression
AT cremisifederico cotanscrnaseqdataanalysisbasedongenecoexpression
AT helmercitterichmanuela cotanscrnaseqdataanalysisbasedongenecoexpression