Cargando…

GAGAM v1.2: An Improvement on Peak Labeling and Genomic Annotated Gene Activity Matrix Construction

Single-cell Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq) is rapidly becoming a powerful technology for assessing the epigenetic landscape of thousands of cells. However, the sparsity of the resulting data poses significant challenges to their interpretability and informat...

Descripción completa

Detalles Bibliográficos
Autores principales: Martini, Lorenzo, Bardini, Roberta, Savino, Alessandro, Di Carlo, Stefano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9858924/
https://www.ncbi.nlm.nih.gov/pubmed/36672856
http://dx.doi.org/10.3390/genes14010115
_version_ 1784874226516230144
author Martini, Lorenzo
Bardini, Roberta
Savino, Alessandro
Di Carlo, Stefano
author_facet Martini, Lorenzo
Bardini, Roberta
Savino, Alessandro
Di Carlo, Stefano
author_sort Martini, Lorenzo
collection PubMed
description Single-cell Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq) is rapidly becoming a powerful technology for assessing the epigenetic landscape of thousands of cells. However, the sparsity of the resulting data poses significant challenges to their interpretability and informativeness. Different computational methods are available, proposing ways to generate significant features from accessibility data and process them to obtain meaningful results. Foremost among them is the peak calling, which interprets the raw scATAC-seq data generating the peaks as features. However, scATAC-seq data are not trivially comparable with single-cell RNA sequencing (scRNA-seq) data, an increasingly pressing challenge since the necessity of multimodal experiments integration. For this reason, this study wants to improve the concept of the Gene Activity Matrix (GAM), which links the accessibility data to the genes, by proposing an improved version of the Genomic-Annotated Gene Activity Matrix (GAGAM) concept. Specifically, this paper presents GAGAM v1.2, a new and better version of GAGAM v1.0. GAGAM aims to label the peaks and link them to the genes through functional annotation of the whole genome. Using genes as features in scATAC-seq datasets makes different datasets comparable and allows linking gene accessibility and expression. This link is crucial for gene regulation understanding and fundamental for the increasing impact of multi-omics data. Results confirm that our method performs better than the previous GAMs and shows a preliminary comparison with scRNA-seq data.
format Online
Article
Text
id pubmed-9858924
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98589242023-01-21 GAGAM v1.2: An Improvement on Peak Labeling and Genomic Annotated Gene Activity Matrix Construction Martini, Lorenzo Bardini, Roberta Savino, Alessandro Di Carlo, Stefano Genes (Basel) Article Single-cell Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq) is rapidly becoming a powerful technology for assessing the epigenetic landscape of thousands of cells. However, the sparsity of the resulting data poses significant challenges to their interpretability and informativeness. Different computational methods are available, proposing ways to generate significant features from accessibility data and process them to obtain meaningful results. Foremost among them is the peak calling, which interprets the raw scATAC-seq data generating the peaks as features. However, scATAC-seq data are not trivially comparable with single-cell RNA sequencing (scRNA-seq) data, an increasingly pressing challenge since the necessity of multimodal experiments integration. For this reason, this study wants to improve the concept of the Gene Activity Matrix (GAM), which links the accessibility data to the genes, by proposing an improved version of the Genomic-Annotated Gene Activity Matrix (GAGAM) concept. Specifically, this paper presents GAGAM v1.2, a new and better version of GAGAM v1.0. GAGAM aims to label the peaks and link them to the genes through functional annotation of the whole genome. Using genes as features in scATAC-seq datasets makes different datasets comparable and allows linking gene accessibility and expression. This link is crucial for gene regulation understanding and fundamental for the increasing impact of multi-omics data. Results confirm that our method performs better than the previous GAMs and shows a preliminary comparison with scRNA-seq data. MDPI 2022-12-30 /pmc/articles/PMC9858924/ /pubmed/36672856 http://dx.doi.org/10.3390/genes14010115 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Martini, Lorenzo
Bardini, Roberta
Savino, Alessandro
Di Carlo, Stefano
GAGAM v1.2: An Improvement on Peak Labeling and Genomic Annotated Gene Activity Matrix Construction
title GAGAM v1.2: An Improvement on Peak Labeling and Genomic Annotated Gene Activity Matrix Construction
title_full GAGAM v1.2: An Improvement on Peak Labeling and Genomic Annotated Gene Activity Matrix Construction
title_fullStr GAGAM v1.2: An Improvement on Peak Labeling and Genomic Annotated Gene Activity Matrix Construction
title_full_unstemmed GAGAM v1.2: An Improvement on Peak Labeling and Genomic Annotated Gene Activity Matrix Construction
title_short GAGAM v1.2: An Improvement on Peak Labeling and Genomic Annotated Gene Activity Matrix Construction
title_sort gagam v1.2: an improvement on peak labeling and genomic annotated gene activity matrix construction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9858924/
https://www.ncbi.nlm.nih.gov/pubmed/36672856
http://dx.doi.org/10.3390/genes14010115
work_keys_str_mv AT martinilorenzo gagamv12animprovementonpeaklabelingandgenomicannotatedgeneactivitymatrixconstruction
AT bardiniroberta gagamv12animprovementonpeaklabelingandgenomicannotatedgeneactivitymatrixconstruction
AT savinoalessandro gagamv12animprovementonpeaklabelingandgenomicannotatedgeneactivitymatrixconstruction
AT dicarlostefano gagamv12animprovementonpeaklabelingandgenomicannotatedgeneactivitymatrixconstruction