Cargando…

ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses

MOTIVATION: Allelic imbalance (AI), i.e. the unequal expression of the alleles of the same gene in a single cell, affects a subset of genes in diploid organisms. One prominent example of AI is parental genomic imprinting, which results in parent-of-origin-dependent, mono-allelic expression of a limi...

Descripción completa

Detalles Bibliográficos
Autores principales: Reynès, Christelle, Kister, Guilhem, Rohmer, Marine, Bouschet, Tristan, Varrault, Annie, Dubois, Emeric, Rialle, Stéphanie, Journot, Laurent, Sabatier, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9883709/
https://www.ncbi.nlm.nih.gov/pubmed/31350542
http://dx.doi.org/10.1093/bioinformatics/btz564
_version_ 1784879562986881024
author Reynès, Christelle
Kister, Guilhem
Rohmer, Marine
Bouschet, Tristan
Varrault, Annie
Dubois, Emeric
Rialle, Stéphanie
Journot, Laurent
Sabatier, Robert
author_facet Reynès, Christelle
Kister, Guilhem
Rohmer, Marine
Bouschet, Tristan
Varrault, Annie
Dubois, Emeric
Rialle, Stéphanie
Journot, Laurent
Sabatier, Robert
author_sort Reynès, Christelle
collection PubMed
description MOTIVATION: Allelic imbalance (AI), i.e. the unequal expression of the alleles of the same gene in a single cell, affects a subset of genes in diploid organisms. One prominent example of AI is parental genomic imprinting, which results in parent-of-origin-dependent, mono-allelic expression of a limited number of genes in metatherian and eutherian mammals and in angiosperms. Currently available methods for identifying AI rely on data modeling and come with the associated limitations. RESULTS: We have designed ISoLDE (Integrative Statistics of alleLe Dependent Expression), a novel nonparametric statistical method that takes into account both AI and the characteristics of RNA-seq data to infer allelic expression bias when at least two biological replicates are available for reciprocal crosses. ISoLDE learns the distribution of a specific test statistic from the data and calls genes ‘allelically imbalanced’, ‘bi-allelically expressed’ or ‘undetermined’. Depending on the number of replicates, predefined thresholds or permutations are used to make calls. We benchmarked ISoLDE against published methods, and showed that ISoLDE compared favorably with respect to sensitivity, specificity and robustness to the number of replicates. Using ISoLDE on different RNA-seq datasets generated from hybrid mouse tissues, we did not discover novel imprinted genes (IGs), confirming the most conservative estimations of IG number. AVAILABILITY AND IMPLEMENTATION: ISoLDE has been implemented as a Bioconductor package available at http://bioconductor.org/packages/ISoLDE/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9883709
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98837092023-02-01 ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses Reynès, Christelle Kister, Guilhem Rohmer, Marine Bouschet, Tristan Varrault, Annie Dubois, Emeric Rialle, Stéphanie Journot, Laurent Sabatier, Robert Bioinformatics Original Papers MOTIVATION: Allelic imbalance (AI), i.e. the unequal expression of the alleles of the same gene in a single cell, affects a subset of genes in diploid organisms. One prominent example of AI is parental genomic imprinting, which results in parent-of-origin-dependent, mono-allelic expression of a limited number of genes in metatherian and eutherian mammals and in angiosperms. Currently available methods for identifying AI rely on data modeling and come with the associated limitations. RESULTS: We have designed ISoLDE (Integrative Statistics of alleLe Dependent Expression), a novel nonparametric statistical method that takes into account both AI and the characteristics of RNA-seq data to infer allelic expression bias when at least two biological replicates are available for reciprocal crosses. ISoLDE learns the distribution of a specific test statistic from the data and calls genes ‘allelically imbalanced’, ‘bi-allelically expressed’ or ‘undetermined’. Depending on the number of replicates, predefined thresholds or permutations are used to make calls. We benchmarked ISoLDE against published methods, and showed that ISoLDE compared favorably with respect to sensitivity, specificity and robustness to the number of replicates. Using ISoLDE on different RNA-seq datasets generated from hybrid mouse tissues, we did not discover novel imprinted genes (IGs), confirming the most conservative estimations of IG number. AVAILABILITY AND IMPLEMENTATION: ISoLDE has been implemented as a Bioconductor package available at http://bioconductor.org/packages/ISoLDE/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07-26 /pmc/articles/PMC9883709/ /pubmed/31350542 http://dx.doi.org/10.1093/bioinformatics/btz564 Text en © The Author(s) 2019. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Reynès, Christelle
Kister, Guilhem
Rohmer, Marine
Bouschet, Tristan
Varrault, Annie
Dubois, Emeric
Rialle, Stéphanie
Journot, Laurent
Sabatier, Robert
ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses
title ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses
title_full ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses
title_fullStr ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses
title_full_unstemmed ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses
title_short ISoLDE: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses
title_sort isolde: a data-driven statistical method for the inference of allelic imbalance in datasets with reciprocal crosses
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9883709/
https://www.ncbi.nlm.nih.gov/pubmed/31350542
http://dx.doi.org/10.1093/bioinformatics/btz564
work_keys_str_mv AT reyneschristelle isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses
AT kisterguilhem isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses
AT rohmermarine isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses
AT bouschettristan isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses
AT varraultannie isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses
AT duboisemeric isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses
AT riallestephanie isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses
AT journotlaurent isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses
AT sabatierrobert isoldeadatadrivenstatisticalmethodfortheinferenceofallelicimbalanceindatasetswithreciprocalcrosses