Cargando…

l1kdeconv: an R package for peak calling analysis with LINCS L1000 data

BACKGROUND: LINCS L1000 is a high-throughput technology that allows gene expression measurement in a large number of assays. However, to fit the measurements of ~1000 genes in the ~500 color channels of LINCS L1000, every two landmark genes are designed to share a single channel. Thus, a deconvoluti...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhao, Li, Jin, Yu, Peng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5532784/
https://www.ncbi.nlm.nih.gov/pubmed/28750623
http://dx.doi.org/10.1186/s12859-017-1767-9
_version_ 1783253522986303488
author Li, Zhao
Li, Jin
Yu, Peng
author_facet Li, Zhao
Li, Jin
Yu, Peng
author_sort Li, Zhao
collection PubMed
description BACKGROUND: LINCS L1000 is a high-throughput technology that allows gene expression measurement in a large number of assays. However, to fit the measurements of ~1000 genes in the ~500 color channels of LINCS L1000, every two landmark genes are designed to share a single channel. Thus, a deconvolution step is required to infer the expression values of each gene. Any errors in this step can be propagated adversely to the downstream analyses. RESULTS: We presented a LINCS L1000 data peak calling R package l1kdeconv based on a new outlier detection method and an aggregate Gaussian mixture model (AGMM). Upon the remove of outliers and the borrowing information among similar samples, l1kdeconv showed more stable and better performance than methods commonly used in LINCS L1000 data deconvolution. CONCLUSIONS: Based on the benchmark using both simulated data and real data, the l1kdeconv package achieved more stable results than the commonly used LINCS L1000 data deconvolution methods.
format Online
Article
Text
id pubmed-5532784
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55327842017-08-02 l1kdeconv: an R package for peak calling analysis with LINCS L1000 data Li, Zhao Li, Jin Yu, Peng BMC Bioinformatics Software BACKGROUND: LINCS L1000 is a high-throughput technology that allows gene expression measurement in a large number of assays. However, to fit the measurements of ~1000 genes in the ~500 color channels of LINCS L1000, every two landmark genes are designed to share a single channel. Thus, a deconvolution step is required to infer the expression values of each gene. Any errors in this step can be propagated adversely to the downstream analyses. RESULTS: We presented a LINCS L1000 data peak calling R package l1kdeconv based on a new outlier detection method and an aggregate Gaussian mixture model (AGMM). Upon the remove of outliers and the borrowing information among similar samples, l1kdeconv showed more stable and better performance than methods commonly used in LINCS L1000 data deconvolution. CONCLUSIONS: Based on the benchmark using both simulated data and real data, the l1kdeconv package achieved more stable results than the commonly used LINCS L1000 data deconvolution methods. BioMed Central 2017-07-27 /pmc/articles/PMC5532784/ /pubmed/28750623 http://dx.doi.org/10.1186/s12859-017-1767-9 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Li, Zhao
Li, Jin
Yu, Peng
l1kdeconv: an R package for peak calling analysis with LINCS L1000 data
title l1kdeconv: an R package for peak calling analysis with LINCS L1000 data
title_full l1kdeconv: an R package for peak calling analysis with LINCS L1000 data
title_fullStr l1kdeconv: an R package for peak calling analysis with LINCS L1000 data
title_full_unstemmed l1kdeconv: an R package for peak calling analysis with LINCS L1000 data
title_short l1kdeconv: an R package for peak calling analysis with LINCS L1000 data
title_sort l1kdeconv: an r package for peak calling analysis with lincs l1000 data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5532784/
https://www.ncbi.nlm.nih.gov/pubmed/28750623
http://dx.doi.org/10.1186/s12859-017-1767-9
work_keys_str_mv AT lizhao l1kdeconvanrpackageforpeakcallinganalysiswithlincsl1000data
AT lijin l1kdeconvanrpackageforpeakcallinganalysiswithlincsl1000data
AT yupeng l1kdeconvanrpackageforpeakcallinganalysiswithlincsl1000data