Cargando…
CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis
The problem of identifying differential activity such as in gene expression is a major defeat in biostatistics and bioinformatics. Equally important, however much less frequently studied, is the question of similar activity from one biological condition to another. The fold-change, or ratio, is usua...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054499/ https://www.ncbi.nlm.nih.gov/pubmed/22917185 http://dx.doi.org/10.1016/j.gpb.2012.06.002 |
_version_ | 1782458612862418944 |
---|---|
author | Tchitchek, Nicolas Golib Dzib, José Felipe Targat, Brice Noth, Sebastian Benecke, Arndt Lesne, Annick |
author_facet | Tchitchek, Nicolas Golib Dzib, José Felipe Targat, Brice Noth, Sebastian Benecke, Arndt Lesne, Annick |
author_sort | Tchitchek, Nicolas |
collection | PubMed |
description | The problem of identifying differential activity such as in gene expression is a major defeat in biostatistics and bioinformatics. Equally important, however much less frequently studied, is the question of similar activity from one biological condition to another. The fold-change, or ratio, is usually considered a relevant criterion for stating difference and similarity between measurements. Importantly, no statistical method for concomitant evaluation of similarity and distinctness currently exists for biological applications. Modern microarray, digital PCR (dPCR), and Next-Generation Sequencing (NGS) technologies frequently provide a means of coefficient of variation estimation for individual measurements. Using fold-change, and by making the assumption that measurements are normally distributed with known variances, we designed a novel statistical test that allows us to detect concomitantly, thus using the same formalism, differentially and similarly expressed genes (http://cds.ihes.fr). Given two sets of gene measurements in different biological conditions, the probabilities of making type I and type II errors in stating that a gene is differentially or similarly expressed from one condition to the other can be calculated. Furthermore, a confidence interval for the fold-change can be delineated. Finally, we demonstrate that the assumption of normality can be relaxed to consider arbitrary distributions numerically. The Concomitant evaluation of Distinctness and Similarity (CDS) statistical test correctly estimates similarities and differences between measurements of gene expression. The implementation, being time and memory efficient, allows the use of the CDS test in high-throughput data analysis such as microarray, dPCR, and NGS experiments. Importantly, the CDS test can be applied to the comparison of single measurements (N = 1) provided the variance (or coefficient of variation) of the signals is known, making CDS a valuable tool also in biomedical analysis where typically a single measurement per subject is available. |
format | Online Article Text |
id | pubmed-5054499 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-50544992016-10-14 CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis Tchitchek, Nicolas Golib Dzib, José Felipe Targat, Brice Noth, Sebastian Benecke, Arndt Lesne, Annick Genomics Proteomics Bioinformatics Original Research The problem of identifying differential activity such as in gene expression is a major defeat in biostatistics and bioinformatics. Equally important, however much less frequently studied, is the question of similar activity from one biological condition to another. The fold-change, or ratio, is usually considered a relevant criterion for stating difference and similarity between measurements. Importantly, no statistical method for concomitant evaluation of similarity and distinctness currently exists for biological applications. Modern microarray, digital PCR (dPCR), and Next-Generation Sequencing (NGS) technologies frequently provide a means of coefficient of variation estimation for individual measurements. Using fold-change, and by making the assumption that measurements are normally distributed with known variances, we designed a novel statistical test that allows us to detect concomitantly, thus using the same formalism, differentially and similarly expressed genes (http://cds.ihes.fr). Given two sets of gene measurements in different biological conditions, the probabilities of making type I and type II errors in stating that a gene is differentially or similarly expressed from one condition to the other can be calculated. Furthermore, a confidence interval for the fold-change can be delineated. Finally, we demonstrate that the assumption of normality can be relaxed to consider arbitrary distributions numerically. The Concomitant evaluation of Distinctness and Similarity (CDS) statistical test correctly estimates similarities and differences between measurements of gene expression. The implementation, being time and memory efficient, allows the use of the CDS test in high-throughput data analysis such as microarray, dPCR, and NGS experiments. Importantly, the CDS test can be applied to the comparison of single measurements (N = 1) provided the variance (or coefficient of variation) of the signals is known, making CDS a valuable tool also in biomedical analysis where typically a single measurement per subject is available. Elsevier 2012-06 2012-06-25 /pmc/articles/PMC5054499/ /pubmed/22917185 http://dx.doi.org/10.1016/j.gpb.2012.06.002 Text en © 2012 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China. Published by Elsevier Ltd and Science Press. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/). |
spellingShingle | Original Research Tchitchek, Nicolas Golib Dzib, José Felipe Targat, Brice Noth, Sebastian Benecke, Arndt Lesne, Annick CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis |
title | CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis |
title_full | CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis |
title_fullStr | CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis |
title_full_unstemmed | CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis |
title_short | CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis |
title_sort | cds: a fold-change based statistical test for concomitant identification of distinctness and similarity in gene expression analysis |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054499/ https://www.ncbi.nlm.nih.gov/pubmed/22917185 http://dx.doi.org/10.1016/j.gpb.2012.06.002 |
work_keys_str_mv | AT tchitcheknicolas cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis AT golibdzibjosefelipe cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis AT targatbrice cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis AT nothsebastian cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis AT beneckearndt cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis AT lesneannick cdsafoldchangebasedstatisticaltestforconcomitantidentificationofdistinctnessandsimilarityingeneexpressionanalysis |