Cargando…

LONGO: an R package for interactive gene length dependent analysis for neuronal identity

MOTIVATION: Reprogramming somatic cells into neurons holds great promise to model neuronal development and disease. The efficiency and success rate of neuronal reprogramming, however, may vary between different conversion platforms and cell types, thereby necessitating an unbiased, systematic approa...

Descripción completa

Detalles Bibliográficos
Autores principales: McCoy, Matthew J, Paul, Alexander J, Victor, Matheus B, Richner, Michelle, Gabel, Harrison W, Gong, Haijun, Yoo, Andrew S, Ahn, Tae-Hyuk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022641/
https://www.ncbi.nlm.nih.gov/pubmed/29950021
http://dx.doi.org/10.1093/bioinformatics/bty243
Descripción
Sumario:MOTIVATION: Reprogramming somatic cells into neurons holds great promise to model neuronal development and disease. The efficiency and success rate of neuronal reprogramming, however, may vary between different conversion platforms and cell types, thereby necessitating an unbiased, systematic approach to estimate neuronal identity of converted cells. Recent studies have demonstrated that long genes (>100 kb from transcription start to end) are highly enriched in neurons, which provides an opportunity to identify neurons based on the expression of these long genes. RESULTS: We have developed a versatile R package, LONGO, to analyze gene expression based on gene length. We propose a systematic analysis of long gene expression (LGE) with a metric termed the long gene quotient (LQ) that quantifies LGE in RNA-seq or microarray data to validate neuronal identity at the single-cell and population levels. This unique feature of neurons provides an opportunity to utilize measurements of LGE in transcriptome data to quickly and easily distinguish neurons from non-neuronal cells. By combining this conceptual advancement and statistical tool in a user-friendly and interactive software package, we intend to encourage and simplify further investigation into LGE, particularly as it applies to validating and improving neuronal differentiation and reprogramming methodologies. AVAILABILITY AND IMPLEMENTATION: LONGO is freely available for download at https://github.com/biohpc/longo. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.