Cargando…

Selecting Reliable mRNA Expression Measurements Across Platforms Improves Downstream Analysis

With increasing use of publicly available gene expression data sets, the quality of the expression data is a critical issue for downstream analysis, gene signature development, and cross-validation of data sets. Thus, identifying reliable expression measurements by leveraging multiple mRNA expressio...

Descripción completa

Detalles Bibliográficos
Autores principales: Tong, Pan, Diao, Lixia, Shen, Li, Li, Lerong, Heymach, John Victor, Girard, Luc, Minna, John D., Coombes, Kevin R., Byers, Lauren Averett, Wang, Jing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4863871/
https://www.ncbi.nlm.nih.gov/pubmed/27199546
http://dx.doi.org/10.4137/CIN.S38590
_version_ 1782431553334280192
author Tong, Pan
Diao, Lixia
Shen, Li
Li, Lerong
Heymach, John Victor
Girard, Luc
Minna, John D.
Coombes, Kevin R.
Byers, Lauren Averett
Wang, Jing
author_facet Tong, Pan
Diao, Lixia
Shen, Li
Li, Lerong
Heymach, John Victor
Girard, Luc
Minna, John D.
Coombes, Kevin R.
Byers, Lauren Averett
Wang, Jing
author_sort Tong, Pan
collection PubMed
description With increasing use of publicly available gene expression data sets, the quality of the expression data is a critical issue for downstream analysis, gene signature development, and cross-validation of data sets. Thus, identifying reliable expression measurements by leveraging multiple mRNA expression platforms is an important analytical task. In this study, we propose a statistical framework for selecting reliable measurements between platforms by modeling the correlations of mRNA expression levels using a beta-mixture model. The model-based selection provides an effective and objective way to separate good probes from probes with low quality, thereby improving the efficiency and accuracy of the analysis. The proposed method can be used to compare two microarray technologies or microarray and RNA sequencing measurements. We tested the approach in two matched profiling data sets, using microarray gene expression measurements from the same samples profiled on both Affymetrix and Illumina platforms. We also applied the algorithm to mRNA expression data to compare Affymetrix microarray data with RNA sequencing measurements. The algorithm successfully identified probes/genes with reliable measurements. Removing the unreliable measurements resulted in significant improvements for gene signature development and functional annotations.
format Online
Article
Text
id pubmed-4863871
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-48638712016-05-19 Selecting Reliable mRNA Expression Measurements Across Platforms Improves Downstream Analysis Tong, Pan Diao, Lixia Shen, Li Li, Lerong Heymach, John Victor Girard, Luc Minna, John D. Coombes, Kevin R. Byers, Lauren Averett Wang, Jing Cancer Inform Original Research With increasing use of publicly available gene expression data sets, the quality of the expression data is a critical issue for downstream analysis, gene signature development, and cross-validation of data sets. Thus, identifying reliable expression measurements by leveraging multiple mRNA expression platforms is an important analytical task. In this study, we propose a statistical framework for selecting reliable measurements between platforms by modeling the correlations of mRNA expression levels using a beta-mixture model. The model-based selection provides an effective and objective way to separate good probes from probes with low quality, thereby improving the efficiency and accuracy of the analysis. The proposed method can be used to compare two microarray technologies or microarray and RNA sequencing measurements. We tested the approach in two matched profiling data sets, using microarray gene expression measurements from the same samples profiled on both Affymetrix and Illumina platforms. We also applied the algorithm to mRNA expression data to compare Affymetrix microarray data with RNA sequencing measurements. The algorithm successfully identified probes/genes with reliable measurements. Removing the unreliable measurements resulted in significant improvements for gene signature development and functional annotations. Libertas Academica 2016-05-10 /pmc/articles/PMC4863871/ /pubmed/27199546 http://dx.doi.org/10.4137/CIN.S38590 Text en © 2016 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 license.
spellingShingle Original Research
Tong, Pan
Diao, Lixia
Shen, Li
Li, Lerong
Heymach, John Victor
Girard, Luc
Minna, John D.
Coombes, Kevin R.
Byers, Lauren Averett
Wang, Jing
Selecting Reliable mRNA Expression Measurements Across Platforms Improves Downstream Analysis
title Selecting Reliable mRNA Expression Measurements Across Platforms Improves Downstream Analysis
title_full Selecting Reliable mRNA Expression Measurements Across Platforms Improves Downstream Analysis
title_fullStr Selecting Reliable mRNA Expression Measurements Across Platforms Improves Downstream Analysis
title_full_unstemmed Selecting Reliable mRNA Expression Measurements Across Platforms Improves Downstream Analysis
title_short Selecting Reliable mRNA Expression Measurements Across Platforms Improves Downstream Analysis
title_sort selecting reliable mrna expression measurements across platforms improves downstream analysis
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4863871/
https://www.ncbi.nlm.nih.gov/pubmed/27199546
http://dx.doi.org/10.4137/CIN.S38590
work_keys_str_mv AT tongpan selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT diaolixia selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT shenli selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT lilerong selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT heymachjohnvictor selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT girardluc selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT minnajohnd selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT coombeskevinr selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT byerslaurenaverett selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis
AT wangjing selectingreliablemrnaexpressionmeasurementsacrossplatformsimprovesdownstreamanalysis