Cargando…

CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis

The problem of identifying differential activity such as in gene expression is a major defeat in biostatistics and bioinformatics. Equally important, however much less frequently studied, is the question of similar activity from one biological condition to another. The fold-change, or ratio, is usua...

Descripción completa

Detalles Bibliográficos
Autores principales: Tchitchek, Nicolas, Golib Dzib, José Felipe, Targat, Brice, Noth, Sebastian, Benecke, Arndt, Lesne, Annick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054499/
https://www.ncbi.nlm.nih.gov/pubmed/22917185
http://dx.doi.org/10.1016/j.gpb.2012.06.002
_version_ 1782458612862418944
author Tchitchek, Nicolas
Golib Dzib, José Felipe
Targat, Brice
Noth, Sebastian
Benecke, Arndt
Lesne, Annick
author_facet Tchitchek, Nicolas
Golib Dzib, José Felipe
Targat, Brice
Noth, Sebastian
Benecke, Arndt
Lesne, Annick
author_sort Tchitchek, Nicolas
collection PubMed
description The problem of identifying differential activity such as in gene expression is a major defeat in biostatistics and bioinformatics. Equally important, however much less frequently studied, is the question of similar activity from one biological condition to another. The fold-change, or ratio, is usually considered a relevant criterion for stating difference and similarity between measurements. Importantly, no statistical method for concomitant evaluation of similarity and distinctness currently exists for biological applications. Modern microarray, digital PCR (dPCR), and Next-Generation Sequencing (NGS) technologies frequently provide a means of coefficient of variation estimation for individual measurements. Using fold-change, and by making the assumption that measurements are normally distributed with known variances, we designed a novel statistical test that allows us to detect concomitantly, thus using the same formalism, differentially and similarly expressed genes (http://cds.ihes.fr). Given two sets of gene measurements in different biological conditions, the probabilities of making type I and type II errors in stating that a gene is differentially or similarly expressed from one condition to the other can be calculated. Furthermore, a confidence interval for the fold-change can be delineated. Finally, we demonstrate that the assumption of normality can be relaxed to consider arbitrary distributions numerically. The Concomitant evaluation of Distinctness and Similarity (CDS) statistical test correctly estimates similarities and differences between measurements of gene expression. The implementation, being time and memory efficient, allows the use of the CDS test in high-throughput data analysis such as microarray, dPCR, and NGS experiments. Importantly, the CDS test can be applied to the comparison of single measurements (N = 1) provided the variance (or coefficient of variation) of the signals is known, making CDS a valuable tool also in biomedical analysis where typically a single measurement per subject is available.
format Online
Article
Text
id pubmed-5054499
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-50544992016-10-14 CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis Tchitchek, Nicolas Golib Dzib, José Felipe Targat, Brice Noth, Sebastian Benecke, Arndt Lesne, Annick Genomics Proteomics Bioinformatics Original Research The problem of identifying differential activity such as in gene expression is a major defeat in biostatistics and bioinformatics. Equally important, however much less frequently studied, is the question of similar activity from one biological condition to another. The fold-change, or ratio, is usually considered a relevant criterion for stating difference and similarity between measurements. Importantly, no statistical method for concomitant evaluation of similarity and distinctness currently exists for biological applications. Modern microarray, digital PCR (dPCR), and Next-Generation Sequencing (NGS) technologies frequently provide a means of coefficient of variation estimation for individual measurements. Using fold-change, and by making the assumption that measurements are normally distributed with known variances, we designed a novel statistical test that allows us to detect concomitantly, thus using the same formalism, differentially and similarly expressed genes (http://cds.ihes.fr). Given two sets of gene measurements in different biological conditions, the probabilities of making type I and type II errors in stating that a gene is differentially or similarly expressed from one condition to the other can be calculated. Furthermore, a confidence interval for the fold-change can be delineated. Finally, we demonstrate that the assumption of normality can be relaxed to consider arbitrary distributions numerically. The Concomitant evaluation of Distinctness and Similarity (CDS) statistical test correctly estimates similarities and differences between measurements of gene expression. The implementation, being time and memory efficient, allows the use of the CDS test in high-throughput data analysis such as microarray, dPCR, and NGS experiments. Importantly, the CDS test can be applied to the comparison of single measurements (N = 1) provided the variance (or coefficient of variation) of the signals is known, making CDS a valuable tool also in biomedical analysis where typically a single measurement per subject is available. Elsevier 2012-06 2012-06-25 /pmc/articles/PMC5054499/ /pubmed/22917185 http://dx.doi.org/10.1016/j.gpb.2012.06.002 Text en © 2012 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China. Published by Elsevier Ltd and Science Press. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle Original Research
Tchitchek, Nicolas
Golib Dzib, José Felipe
Targat, Brice
Noth, Sebastian
Benecke, Arndt
Lesne, Annick
CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis
title CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis
title_full CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis
title_fullStr CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis
title_full_unstemmed CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis
title_short CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis
title_sort cds: a fold-change based statistical test for concomitant identification of distinctness and similarity in gene expression analysis
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054499/
https://www.ncbi.nlm.nih.gov/pubmed/22917185
http://dx.doi.org/10.1016/j.gpb.2012.06.002
work_keys_str_mv AT tchitcheknicolas cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis
AT golibdzibjosefelipe cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis
AT targatbrice cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis
AT nothsebastian cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis
AT beneckearndt cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis
AT lesneannick cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis