Cargando…

Improved Discovery of Molecular Interactions in Genome-Scale Data with Adaptive Model-Based Normalization

BACKGROUND: High throughput molecular-interaction studies using immunoprecipitations (IP) or affinity purifications are powerful and widely used in biology research. One of many important applications of this method is to identify the set of RNAs that interact with a particular RNA-binding protein (...

Descripción completa

Detalles Bibliográficos
Autores principales: Salzman, Julia, Klass, Daniel M., Brown, Patrick O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3551948/
https://www.ncbi.nlm.nih.gov/pubmed/23349766
http://dx.doi.org/10.1371/journal.pone.0053930
_version_ 1782256648992063488
author Salzman, Julia
Klass, Daniel M.
Brown, Patrick O.
author_facet Salzman, Julia
Klass, Daniel M.
Brown, Patrick O.
author_sort Salzman, Julia
collection PubMed
description BACKGROUND: High throughput molecular-interaction studies using immunoprecipitations (IP) or affinity purifications are powerful and widely used in biology research. One of many important applications of this method is to identify the set of RNAs that interact with a particular RNA-binding protein (RBP). Here, the unique statistical challenge presented is to delineate a specific set of RNAs that are enriched in one sample relative to another, typically a specific IP compared to a non-specific control to model background. The choice of normalization procedure critically impacts the number of RNAs that will be identified as interacting with an RBP at a given significance threshold – yet existing normalization methods make assumptions that are often fundamentally inaccurate when applied to IP enrichment data. METHODS: In this paper, we present a new normalization methodology that is specifically designed for identifying enriched RNA or DNA sequences in an IP. The normalization (called adaptive or AD normalization) uses a basic model of the IP experiment and is not a variant of mean, quantile, or other methodology previously proposed. The approach is evaluated statistically and tested with simulated and empirical data. RESULTS AND CONCLUSIONS: The adaptive (AD) normalization method results in a greatly increased range in the number of enriched RNAs identified, fewer false positives, and overall better concordance with independent biological evidence, for the RBPs we analyzed, compared to median normalization. The approach is also applicable to the study of pairwise RNA, DNA and protein interactions such as the analysis of transcription factors via chromatin immunoprecipitation (ChIP) or any other experiments where samples from two conditions, one of which contains an enriched subset of the other, are studied.
format Online
Article
Text
id pubmed-3551948
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35519482013-01-24 Improved Discovery of Molecular Interactions in Genome-Scale Data with Adaptive Model-Based Normalization Salzman, Julia Klass, Daniel M. Brown, Patrick O. PLoS One Research Article BACKGROUND: High throughput molecular-interaction studies using immunoprecipitations (IP) or affinity purifications are powerful and widely used in biology research. One of many important applications of this method is to identify the set of RNAs that interact with a particular RNA-binding protein (RBP). Here, the unique statistical challenge presented is to delineate a specific set of RNAs that are enriched in one sample relative to another, typically a specific IP compared to a non-specific control to model background. The choice of normalization procedure critically impacts the number of RNAs that will be identified as interacting with an RBP at a given significance threshold – yet existing normalization methods make assumptions that are often fundamentally inaccurate when applied to IP enrichment data. METHODS: In this paper, we present a new normalization methodology that is specifically designed for identifying enriched RNA or DNA sequences in an IP. The normalization (called adaptive or AD normalization) uses a basic model of the IP experiment and is not a variant of mean, quantile, or other methodology previously proposed. The approach is evaluated statistically and tested with simulated and empirical data. RESULTS AND CONCLUSIONS: The adaptive (AD) normalization method results in a greatly increased range in the number of enriched RNAs identified, fewer false positives, and overall better concordance with independent biological evidence, for the RBPs we analyzed, compared to median normalization. The approach is also applicable to the study of pairwise RNA, DNA and protein interactions such as the analysis of transcription factors via chromatin immunoprecipitation (ChIP) or any other experiments where samples from two conditions, one of which contains an enriched subset of the other, are studied. Public Library of Science 2013-01-22 /pmc/articles/PMC3551948/ /pubmed/23349766 http://dx.doi.org/10.1371/journal.pone.0053930 Text en © 2013 Salzman et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Salzman, Julia
Klass, Daniel M.
Brown, Patrick O.
Improved Discovery of Molecular Interactions in Genome-Scale Data with Adaptive Model-Based Normalization
title Improved Discovery of Molecular Interactions in Genome-Scale Data with Adaptive Model-Based Normalization
title_full Improved Discovery of Molecular Interactions in Genome-Scale Data with Adaptive Model-Based Normalization
title_fullStr Improved Discovery of Molecular Interactions in Genome-Scale Data with Adaptive Model-Based Normalization
title_full_unstemmed Improved Discovery of Molecular Interactions in Genome-Scale Data with Adaptive Model-Based Normalization
title_short Improved Discovery of Molecular Interactions in Genome-Scale Data with Adaptive Model-Based Normalization
title_sort improved discovery of molecular interactions in genome-scale data with adaptive model-based normalization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3551948/
https://www.ncbi.nlm.nih.gov/pubmed/23349766
http://dx.doi.org/10.1371/journal.pone.0053930
work_keys_str_mv AT salzmanjulia improveddiscoveryofmolecularinteractionsingenomescaledatawithadaptivemodelbasednormalization
AT klassdanielm improveddiscoveryofmolecularinteractionsingenomescaledatawithadaptivemodelbasednormalization
AT brownpatricko improveddiscoveryofmolecularinteractionsingenomescaledatawithadaptivemodelbasednormalization