Cargando…
ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses
MOTIVATION: Allelic imbalance (AI), i.e. the unequal expression of the alleles of the same gene in a single cell, affects a subset of genes in diploid organisms. One prominent example of AI is parental genomic imprinting, which results in parent-of-origin-dependent, mono-allelic expression of a limi...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9883709/ https://www.ncbi.nlm.nih.gov/pubmed/31350542 http://dx.doi.org/10.1093/bioinformatics/btz564 |
_version_ | 1784879562986881024 |
---|---|
author | Reynès, Christelle Kister, Guilhem Rohmer, Marine Bouschet, Tristan Varrault, Annie Dubois, Emeric Rialle, Stéphanie Journot, Laurent Sabatier, Robert |
author_facet | Reynès, Christelle Kister, Guilhem Rohmer, Marine Bouschet, Tristan Varrault, Annie Dubois, Emeric Rialle, Stéphanie Journot, Laurent Sabatier, Robert |
author_sort | Reynès, Christelle |
collection | PubMed |
description | MOTIVATION: Allelic imbalance (AI), i.e. the unequal expression of the alleles of the same gene in a single cell, affects a subset of genes in diploid organisms. One prominent example of AI is parental genomic imprinting, which results in parent-of-origin-dependent, mono-allelic expression of a limited number of genes in metatherian and eutherian mammals and in angiosperms. Currently available methods for identifying AI rely on data modeling and come with the associated limitations. RESULTS: We have designed ISoLDE (Integrative Statistics of alleLe Dependent Expression), a novel nonparametric statistical method that takes into account both AI and the characteristics of RNA-seq data to infer allelic expression bias when at least two biological replicates are available for reciprocal crosses. ISoLDE learns the distribution of a specific test statistic from the data and calls genes ‘allelically imbalanced’, ‘bi-allelically expressed’ or ‘undetermined’. Depending on the number of replicates, predefined thresholds or permutations are used to make calls. We benchmarked ISoLDE against published methods, and showed that ISoLDE compared favorably with respect to sensitivity, specificity and robustness to the number of replicates. Using ISoLDE on different RNA-seq datasets generated from hybrid mouse tissues, we did not discover novel imprinted genes (IGs), confirming the most conservative estimations of IG number. AVAILABILITY AND IMPLEMENTATION: ISoLDE has been implemented as a Bioconductor package available at http://bioconductor.org/packages/ISoLDE/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9883709 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98837092023-02-01 ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses Reynès, Christelle Kister, Guilhem Rohmer, Marine Bouschet, Tristan Varrault, Annie Dubois, Emeric Rialle, Stéphanie Journot, Laurent Sabatier, Robert Bioinformatics Original Papers MOTIVATION: Allelic imbalance (AI), i.e. the unequal expression of the alleles of the same gene in a single cell, affects a subset of genes in diploid organisms. One prominent example of AI is parental genomic imprinting, which results in parent-of-origin-dependent, mono-allelic expression of a limited number of genes in metatherian and eutherian mammals and in angiosperms. Currently available methods for identifying AI rely on data modeling and come with the associated limitations. RESULTS: We have designed ISoLDE (Integrative Statistics of alleLe Dependent Expression), a novel nonparametric statistical method that takes into account both AI and the characteristics of RNA-seq data to infer allelic expression bias when at least two biological replicates are available for reciprocal crosses. ISoLDE learns the distribution of a specific test statistic from the data and calls genes ‘allelically imbalanced’, ‘bi-allelically expressed’ or ‘undetermined’. Depending on the number of replicates, predefined thresholds or permutations are used to make calls. We benchmarked ISoLDE against published methods, and showed that ISoLDE compared favorably with respect to sensitivity, specificity and robustness to the number of replicates. Using ISoLDE on different RNA-seq datasets generated from hybrid mouse tissues, we did not discover novel imprinted genes (IGs), confirming the most conservative estimations of IG number. AVAILABILITY AND IMPLEMENTATION: ISoLDE has been implemented as a Bioconductor package available at http://bioconductor.org/packages/ISoLDE/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07-26 /pmc/articles/PMC9883709/ /pubmed/31350542 http://dx.doi.org/10.1093/bioinformatics/btz564 Text en © The Author(s) 2019. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Reynès, Christelle Kister, Guilhem Rohmer, Marine Bouschet, Tristan Varrault, Annie Dubois, Emeric Rialle, Stéphanie Journot, Laurent Sabatier, Robert ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses |
title | ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses |
title_full | ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses |
title_fullStr | ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses |
title_full_unstemmed | ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses |
title_short | ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses |
title_sort | isolde: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9883709/ https://www.ncbi.nlm.nih.gov/pubmed/31350542 http://dx.doi.org/10.1093/bioinformatics/btz564 |
work_keys_str_mv | AT reyneschristelle isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses AT kisterguilhem isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses AT rohmermarine isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses AT bouschettristan isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses AT varraultannie isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses AT duboisemeric isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses AT riallestephanie isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses AT journotlaurent isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses AT sabatierrobert isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses |