Cargando…

M3Drop: dropout-based feature selection for scRNASeq

MOTIVATION: Most genomes contain thousands of genes, but for most functional responses, only a subset of those genes are relevant. To facilitate many single-cell RNASeq (scRNASeq) analyses the set of genes is often reduced through feature selection, i.e. by removing genes only subject to technical n...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrews, Tallulah S, Hemberg, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6691329/
https://www.ncbi.nlm.nih.gov/pubmed/30590489
http://dx.doi.org/10.1093/bioinformatics/bty1044
_version_ 1783443349045248000
author Andrews, Tallulah S
Hemberg, Martin
author_facet Andrews, Tallulah S
Hemberg, Martin
author_sort Andrews, Tallulah S
collection PubMed
description MOTIVATION: Most genomes contain thousands of genes, but for most functional responses, only a subset of those genes are relevant. To facilitate many single-cell RNASeq (scRNASeq) analyses the set of genes is often reduced through feature selection, i.e. by removing genes only subject to technical noise. RESULTS: We present M3Drop, an R package that implements popular existing feature selection methods and two novel methods which take advantage of the prevalence of zeros (dropouts) in scRNASeq data to identify features. We show these new methods outperform existing methods on simulated and real datasets. AVAILABILITY AND IMPLEMENTATION: M3Drop is freely available on github as an R package and is compatible with other popular scRNASeq tools: https://github.com/tallulandrews/M3Drop. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6691329
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66913292019-08-16 M3Drop: dropout-based feature selection for scRNASeq Andrews, Tallulah S Hemberg, Martin Bioinformatics Applications Notes MOTIVATION: Most genomes contain thousands of genes, but for most functional responses, only a subset of those genes are relevant. To facilitate many single-cell RNASeq (scRNASeq) analyses the set of genes is often reduced through feature selection, i.e. by removing genes only subject to technical noise. RESULTS: We present M3Drop, an R package that implements popular existing feature selection methods and two novel methods which take advantage of the prevalence of zeros (dropouts) in scRNASeq data to identify features. We show these new methods outperform existing methods on simulated and real datasets. AVAILABILITY AND IMPLEMENTATION: M3Drop is freely available on github as an R package and is compatible with other popular scRNASeq tools: https://github.com/tallulandrews/M3Drop. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-08-15 2018-12-24 /pmc/articles/PMC6691329/ /pubmed/30590489 http://dx.doi.org/10.1093/bioinformatics/bty1044 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Andrews, Tallulah S
Hemberg, Martin
M3Drop: dropout-based feature selection for scRNASeq
title M3Drop: dropout-based feature selection for scRNASeq
title_full M3Drop: dropout-based feature selection for scRNASeq
title_fullStr M3Drop: dropout-based feature selection for scRNASeq
title_full_unstemmed M3Drop: dropout-based feature selection for scRNASeq
title_short M3Drop: dropout-based feature selection for scRNASeq
title_sort m3drop: dropout-based feature selection for scrnaseq
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6691329/
https://www.ncbi.nlm.nih.gov/pubmed/30590489
http://dx.doi.org/10.1093/bioinformatics/bty1044
work_keys_str_mv AT andrewstallulahs m3dropdropoutbasedfeatureselectionforscrnaseq
AT hembergmartin m3dropdropoutbasedfeatureselectionforscrnaseq