Cargando…

Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data

MicroRNAs (miRNAs) are small non-coding RNA molecules whose primary function is to regulate the expression of gene products via hybridization to mRNA transcripts, resulting in suppression of translation or mRNA degradation. Although miRNAs have been implicated in complex diseases, including cancer,...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Tin, Diaz, Diana, Tagett, Rebecca, Draghici, Sorin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4941544/
https://www.ncbi.nlm.nih.gov/pubmed/27403564
http://dx.doi.org/10.1038/srep29251
_version_ 1782442315101503488
author Nguyen, Tin
Diaz, Diana
Tagett, Rebecca
Draghici, Sorin
author_facet Nguyen, Tin
Diaz, Diana
Tagett, Rebecca
Draghici, Sorin
author_sort Nguyen, Tin
collection PubMed
description MicroRNAs (miRNAs) are small non-coding RNA molecules whose primary function is to regulate the expression of gene products via hybridization to mRNA transcripts, resulting in suppression of translation or mRNA degradation. Although miRNAs have been implicated in complex diseases, including cancer, their impact on distinct biological pathways and phenotypes is largely unknown. Current integration approaches require sample-matched miRNA/mRNA datasets, resulting in limited applicability in practice. Since these approaches cannot integrate heterogeneous information available across independent experiments, they neither account for bias inherent in individual studies, nor do they benefit from increased sample size. Here we present a novel framework able to integrate miRNA and mRNA data (vertical data integration) available in independent studies (horizontal meta-analysis) allowing for a comprehensive analysis of the given phenotypes. To demonstrate the utility of our method, we conducted a meta-analysis of pancreatic and colorectal cancer, using 1,471 samples from 15 mRNA and 14 miRNA expression datasets. Our two-dimensional data integration approach greatly increases the power of statistical analysis and correctly identifies pathways known to be implicated in the phenotypes. The proposed framework is sufficiently general to integrate other types of data obtained from high-throughput assays.
format Online
Article
Text
id pubmed-4941544
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49415442016-07-20 Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data Nguyen, Tin Diaz, Diana Tagett, Rebecca Draghici, Sorin Sci Rep Article MicroRNAs (miRNAs) are small non-coding RNA molecules whose primary function is to regulate the expression of gene products via hybridization to mRNA transcripts, resulting in suppression of translation or mRNA degradation. Although miRNAs have been implicated in complex diseases, including cancer, their impact on distinct biological pathways and phenotypes is largely unknown. Current integration approaches require sample-matched miRNA/mRNA datasets, resulting in limited applicability in practice. Since these approaches cannot integrate heterogeneous information available across independent experiments, they neither account for bias inherent in individual studies, nor do they benefit from increased sample size. Here we present a novel framework able to integrate miRNA and mRNA data (vertical data integration) available in independent studies (horizontal meta-analysis) allowing for a comprehensive analysis of the given phenotypes. To demonstrate the utility of our method, we conducted a meta-analysis of pancreatic and colorectal cancer, using 1,471 samples from 15 mRNA and 14 miRNA expression datasets. Our two-dimensional data integration approach greatly increases the power of statistical analysis and correctly identifies pathways known to be implicated in the phenotypes. The proposed framework is sufficiently general to integrate other types of data obtained from high-throughput assays. Nature Publishing Group 2016-07-12 /pmc/articles/PMC4941544/ /pubmed/27403564 http://dx.doi.org/10.1038/srep29251 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Nguyen, Tin
Diaz, Diana
Tagett, Rebecca
Draghici, Sorin
Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data
title Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data
title_full Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data
title_fullStr Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data
title_full_unstemmed Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data
title_short Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data
title_sort overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4941544/
https://www.ncbi.nlm.nih.gov/pubmed/27403564
http://dx.doi.org/10.1038/srep29251
work_keys_str_mv AT nguyentin overcomingthematchedsamplebottleneckanorthogonalapproachtointegrateomicdata
AT diazdiana overcomingthematchedsamplebottleneckanorthogonalapproachtointegrateomicdata
AT tagettrebecca overcomingthematchedsamplebottleneckanorthogonalapproachtointegrateomicdata
AT draghicisorin overcomingthematchedsamplebottleneckanorthogonalapproachtointegrateomicdata