Cargando…

LONGO: an R package for interactive gene length dependent analysis for neuronal identity

MOTIVATION: Reprogramming somatic cells into neurons holds great promise to model neuronal development and disease. The efficiency and success rate of neuronal reprogramming, however, may vary between different conversion platforms and cell types, thereby necessitating an unbiased, systematic approa...

Descripción completa

Detalles Bibliográficos
Autores principales: McCoy, Matthew J, Paul, Alexander J, Victor, Matheus B, Richner, Michelle, Gabel, Harrison W, Gong, Haijun, Yoo, Andrew S, Ahn, Tae-Hyuk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022641/
https://www.ncbi.nlm.nih.gov/pubmed/29950021
http://dx.doi.org/10.1093/bioinformatics/bty243
_version_ 1783335721525837824
author McCoy, Matthew J
Paul, Alexander J
Victor, Matheus B
Richner, Michelle
Gabel, Harrison W
Gong, Haijun
Yoo, Andrew S
Ahn, Tae-Hyuk
author_facet McCoy, Matthew J
Paul, Alexander J
Victor, Matheus B
Richner, Michelle
Gabel, Harrison W
Gong, Haijun
Yoo, Andrew S
Ahn, Tae-Hyuk
author_sort McCoy, Matthew J
collection PubMed
description MOTIVATION: Reprogramming somatic cells into neurons holds great promise to model neuronal development and disease. The efficiency and success rate of neuronal reprogramming, however, may vary between different conversion platforms and cell types, thereby necessitating an unbiased, systematic approach to estimate neuronal identity of converted cells. Recent studies have demonstrated that long genes (>100 kb from transcription start to end) are highly enriched in neurons, which provides an opportunity to identify neurons based on the expression of these long genes. RESULTS: We have developed a versatile R package, LONGO, to analyze gene expression based on gene length. We propose a systematic analysis of long gene expression (LGE) with a metric termed the long gene quotient (LQ) that quantifies LGE in RNA-seq or microarray data to validate neuronal identity at the single-cell and population levels. This unique feature of neurons provides an opportunity to utilize measurements of LGE in transcriptome data to quickly and easily distinguish neurons from non-neuronal cells. By combining this conceptual advancement and statistical tool in a user-friendly and interactive software package, we intend to encourage and simplify further investigation into LGE, particularly as it applies to validating and improving neuronal differentiation and reprogramming methodologies. AVAILABILITY AND IMPLEMENTATION: LONGO is freely available for download at https://github.com/biohpc/longo. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6022641
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60226412018-07-10 LONGO: an R package for interactive gene length dependent analysis for neuronal identity McCoy, Matthew J Paul, Alexander J Victor, Matheus B Richner, Michelle Gabel, Harrison W Gong, Haijun Yoo, Andrew S Ahn, Tae-Hyuk Bioinformatics Ismb 2018–Intelligent Systems for Molecular Biology Proceedings MOTIVATION: Reprogramming somatic cells into neurons holds great promise to model neuronal development and disease. The efficiency and success rate of neuronal reprogramming, however, may vary between different conversion platforms and cell types, thereby necessitating an unbiased, systematic approach to estimate neuronal identity of converted cells. Recent studies have demonstrated that long genes (>100 kb from transcription start to end) are highly enriched in neurons, which provides an opportunity to identify neurons based on the expression of these long genes. RESULTS: We have developed a versatile R package, LONGO, to analyze gene expression based on gene length. We propose a systematic analysis of long gene expression (LGE) with a metric termed the long gene quotient (LQ) that quantifies LGE in RNA-seq or microarray data to validate neuronal identity at the single-cell and population levels. This unique feature of neurons provides an opportunity to utilize measurements of LGE in transcriptome data to quickly and easily distinguish neurons from non-neuronal cells. By combining this conceptual advancement and statistical tool in a user-friendly and interactive software package, we intend to encourage and simplify further investigation into LGE, particularly as it applies to validating and improving neuronal differentiation and reprogramming methodologies. AVAILABILITY AND IMPLEMENTATION: LONGO is freely available for download at https://github.com/biohpc/longo. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-07-01 2018-06-27 /pmc/articles/PMC6022641/ /pubmed/29950021 http://dx.doi.org/10.1093/bioinformatics/bty243 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2018–Intelligent Systems for Molecular Biology Proceedings
McCoy, Matthew J
Paul, Alexander J
Victor, Matheus B
Richner, Michelle
Gabel, Harrison W
Gong, Haijun
Yoo, Andrew S
Ahn, Tae-Hyuk
LONGO: an R package for interactive gene length dependent analysis for neuronal identity
title LONGO: an R package for interactive gene length dependent analysis for neuronal identity
title_full LONGO: an R package for interactive gene length dependent analysis for neuronal identity
title_fullStr LONGO: an R package for interactive gene length dependent analysis for neuronal identity
title_full_unstemmed LONGO: an R package for interactive gene length dependent analysis for neuronal identity
title_short LONGO: an R package for interactive gene length dependent analysis for neuronal identity
title_sort longo: an r package for interactive gene length dependent analysis for neuronal identity
topic Ismb 2018–Intelligent Systems for Molecular Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022641/
https://www.ncbi.nlm.nih.gov/pubmed/29950021
http://dx.doi.org/10.1093/bioinformatics/bty243
work_keys_str_mv AT mccoymatthewj longoanrpackageforinteractivegenelengthdependentanalysisforneuronalidentity
AT paulalexanderj longoanrpackageforinteractivegenelengthdependentanalysisforneuronalidentity
AT victormatheusb longoanrpackageforinteractivegenelengthdependentanalysisforneuronalidentity
AT richnermichelle longoanrpackageforinteractivegenelengthdependentanalysisforneuronalidentity
AT gabelharrisonw longoanrpackageforinteractivegenelengthdependentanalysisforneuronalidentity
AT gonghaijun longoanrpackageforinteractivegenelengthdependentanalysisforneuronalidentity
AT yooandrews longoanrpackageforinteractivegenelengthdependentanalysisforneuronalidentity
AT ahntaehyuk longoanrpackageforinteractivegenelengthdependentanalysisforneuronalidentity