Cargando…

SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups

BACKGROUND: To understand biology and differences among various tissues or cell types, one typically searches for molecular features that display characteristic abundance patterns. Several specificity metrics have been introduced to identify tissue-specific molecular features, but these either requi...

Descripción completa

Detalles Bibliográficos
Autores principales: Everaert, Celine, Volders, Pieter-Jan, Morlion, Annelien, Thas, Olivier, Mestdagh, Pieter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7026976/
https://www.ncbi.nlm.nih.gov/pubmed/32066370
http://dx.doi.org/10.1186/s12859-020-3407-z
_version_ 1783498771734200320
author Everaert, Celine
Volders, Pieter-Jan
Morlion, Annelien
Thas, Olivier
Mestdagh, Pieter
author_facet Everaert, Celine
Volders, Pieter-Jan
Morlion, Annelien
Thas, Olivier
Mestdagh, Pieter
author_sort Everaert, Celine
collection PubMed
description BACKGROUND: To understand biology and differences among various tissues or cell types, one typically searches for molecular features that display characteristic abundance patterns. Several specificity metrics have been introduced to identify tissue-specific molecular features, but these either require an equal number of replicates per tissue or they can’t handle replicates at all. RESULTS: We describe a non-parametric specificity score that is compatible with unequal sample group sizes. To demonstrate its usefulness, the specificity score was calculated on all GTEx samples, detecting known and novel tissue-specific genes. A webtool was developed to browse these results for genes or tissues of interest. An example python implementation of SPECS is available at https://github.com/celineeveraert/SPECS. The precalculated SPECS results on the GTEx data are available through a user-friendly browser at specs.cmgg.be. CONCLUSIONS: SPECS is a non-parametric method that identifies known and novel specific-expressed genes. In addition, SPECS could be adopted for other features and applications.
format Online
Article
Text
id pubmed-7026976
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70269762020-02-24 SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups Everaert, Celine Volders, Pieter-Jan Morlion, Annelien Thas, Olivier Mestdagh, Pieter BMC Bioinformatics Methodology Article BACKGROUND: To understand biology and differences among various tissues or cell types, one typically searches for molecular features that display characteristic abundance patterns. Several specificity metrics have been introduced to identify tissue-specific molecular features, but these either require an equal number of replicates per tissue or they can’t handle replicates at all. RESULTS: We describe a non-parametric specificity score that is compatible with unequal sample group sizes. To demonstrate its usefulness, the specificity score was calculated on all GTEx samples, detecting known and novel tissue-specific genes. A webtool was developed to browse these results for genes or tissues of interest. An example python implementation of SPECS is available at https://github.com/celineeveraert/SPECS. The precalculated SPECS results on the GTEx data are available through a user-friendly browser at specs.cmgg.be. CONCLUSIONS: SPECS is a non-parametric method that identifies known and novel specific-expressed genes. In addition, SPECS could be adopted for other features and applications. BioMed Central 2020-02-17 /pmc/articles/PMC7026976/ /pubmed/32066370 http://dx.doi.org/10.1186/s12859-020-3407-z Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Everaert, Celine
Volders, Pieter-Jan
Morlion, Annelien
Thas, Olivier
Mestdagh, Pieter
SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups
title SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups
title_full SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups
title_fullStr SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups
title_full_unstemmed SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups
title_short SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups
title_sort specs: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7026976/
https://www.ncbi.nlm.nih.gov/pubmed/32066370
http://dx.doi.org/10.1186/s12859-020-3407-z
work_keys_str_mv AT everaertceline specsanonparametricmethodtoidentifytissuespecificmolecularfeaturesforunbalancedsamplegroups
AT volderspieterjan specsanonparametricmethodtoidentifytissuespecificmolecularfeaturesforunbalancedsamplegroups
AT morlionannelien specsanonparametricmethodtoidentifytissuespecificmolecularfeaturesforunbalancedsamplegroups
AT thasolivier specsanonparametricmethodtoidentifytissuespecificmolecularfeaturesforunbalancedsamplegroups
AT mestdaghpieter specsanonparametricmethodtoidentifytissuespecificmolecularfeaturesforunbalancedsamplegroups