Cargando…

Integrative regression network for genomic association study

BACKGROUND: The increasing availability of multiple types of genomic profiles measured from the same cancer patients has provided numerous opportunities for investigating genomic mechanisms underlying cancer. In particular, association studies of gene expression traits with respect to multi-layered...

Descripción completa

Detalles Bibliográficos
Autores principales: Vangimalla, Reddy Rani, Jeong, Hyun-hwan, Sohn, Kyung-Ah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4989890/
https://www.ncbi.nlm.nih.gov/pubmed/27535739
http://dx.doi.org/10.1186/s12920-016-0192-7
_version_ 1782448622281949184
author Vangimalla, Reddy Rani
Jeong, Hyun-hwan
Sohn, Kyung-Ah
author_facet Vangimalla, Reddy Rani
Jeong, Hyun-hwan
Sohn, Kyung-Ah
author_sort Vangimalla, Reddy Rani
collection PubMed
description BACKGROUND: The increasing availability of multiple types of genomic profiles measured from the same cancer patients has provided numerous opportunities for investigating genomic mechanisms underlying cancer. In particular, association studies of gene expression traits with respect to multi-layered genomic features are highly useful for uncovering the underlying mechanism. Conventional correlation-based association tests are limited because they are prone to revealing indirect associations. Moreover, integration of multiple types of genomic features raises another challenge. METHODS: In this study, we propose a new framework for association studies called integrative regression network that identifies genomic associations on multiple high-dimensional genomic profiles by taking into account the associations between as well as within profiles. We employed high-dimensional regression techniques to first identify the associations between different genomic profiles. Based on the resulting regression coefficients, a regression network was constructed within each profile. For example, two methylation features having similar regression coefficients with respect to a number of gene expression traits are likely to be involved in the same biological process and therefore we define an edge between two methylation features in the regression network. To extract more reliable associations, multiple sparse structured regression techniques were applied and the resulting multiple networks were merged as the integrative regression network using a similarity network fusion technique. RESULTS: Experiments were carried out using four different sparse structured regression methods on five cancer types from TCGA. The advantages and disadvantages of each regression method were also explored. We find there was large inconsistency in the results from different regression methods, which supports the need to extract the proposed integrative regression network from multiple complimentary regression techniques. Fusing multiple regression networks by using similarity measurements led to the identification of significant gene pairs and a resulting network with better topological properties. CONCLUSIONS: We developed and validated the integrative regression network scheme on multi-layered genomic profiles from TCGA. Our method facilitates identification of the strong signals as well as weaker signals by fusing information from different regression techniques. It could be extended to integrate results obtained from different cancer types as well. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-016-0192-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4989890
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49898902016-08-30 Integrative regression network for genomic association study Vangimalla, Reddy Rani Jeong, Hyun-hwan Sohn, Kyung-Ah BMC Med Genomics Research BACKGROUND: The increasing availability of multiple types of genomic profiles measured from the same cancer patients has provided numerous opportunities for investigating genomic mechanisms underlying cancer. In particular, association studies of gene expression traits with respect to multi-layered genomic features are highly useful for uncovering the underlying mechanism. Conventional correlation-based association tests are limited because they are prone to revealing indirect associations. Moreover, integration of multiple types of genomic features raises another challenge. METHODS: In this study, we propose a new framework for association studies called integrative regression network that identifies genomic associations on multiple high-dimensional genomic profiles by taking into account the associations between as well as within profiles. We employed high-dimensional regression techniques to first identify the associations between different genomic profiles. Based on the resulting regression coefficients, a regression network was constructed within each profile. For example, two methylation features having similar regression coefficients with respect to a number of gene expression traits are likely to be involved in the same biological process and therefore we define an edge between two methylation features in the regression network. To extract more reliable associations, multiple sparse structured regression techniques were applied and the resulting multiple networks were merged as the integrative regression network using a similarity network fusion technique. RESULTS: Experiments were carried out using four different sparse structured regression methods on five cancer types from TCGA. The advantages and disadvantages of each regression method were also explored. We find there was large inconsistency in the results from different regression methods, which supports the need to extract the proposed integrative regression network from multiple complimentary regression techniques. Fusing multiple regression networks by using similarity measurements led to the identification of significant gene pairs and a resulting network with better topological properties. CONCLUSIONS: We developed and validated the integrative regression network scheme on multi-layered genomic profiles from TCGA. Our method facilitates identification of the strong signals as well as weaker signals by fusing information from different regression techniques. It could be extended to integrate results obtained from different cancer types as well. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-016-0192-7) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-12 /pmc/articles/PMC4989890/ /pubmed/27535739 http://dx.doi.org/10.1186/s12920-016-0192-7 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Vangimalla, Reddy Rani
Jeong, Hyun-hwan
Sohn, Kyung-Ah
Integrative regression network for genomic association study
title Integrative regression network for genomic association study
title_full Integrative regression network for genomic association study
title_fullStr Integrative regression network for genomic association study
title_full_unstemmed Integrative regression network for genomic association study
title_short Integrative regression network for genomic association study
title_sort integrative regression network for genomic association study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4989890/
https://www.ncbi.nlm.nih.gov/pubmed/27535739
http://dx.doi.org/10.1186/s12920-016-0192-7
work_keys_str_mv AT vangimallareddyrani integrativeregressionnetworkforgenomicassociationstudy
AT jeonghyunhwan integrativeregressionnetworkforgenomicassociationstudy
AT sohnkyungah integrativeregressionnetworkforgenomicassociationstudy