Cargando…

NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases

BACKGROUND: Enrichment analysis is a widely applied procedure for shedding light on the molecular mechanisms and functions at the basis of phenotypes, for enlarging the dataset of possibly related genes/proteins and for helping interpretation and prioritization of newly determined variations. Severa...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Lena, Pietro, Martelli, Pier Luigi, Fariselli, Piero, Casadio, Rita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4480278/
https://www.ncbi.nlm.nih.gov/pubmed/26110971
http://dx.doi.org/10.1186/1471-2164-16-S8-S6
_version_ 1782378132448215040
author Di Lena, Pietro
Martelli, Pier Luigi
Fariselli, Piero
Casadio, Rita
author_facet Di Lena, Pietro
Martelli, Pier Luigi
Fariselli, Piero
Casadio, Rita
author_sort Di Lena, Pietro
collection PubMed
description BACKGROUND: Enrichment analysis is a widely applied procedure for shedding light on the molecular mechanisms and functions at the basis of phenotypes, for enlarging the dataset of possibly related genes/proteins and for helping interpretation and prioritization of newly determined variations. Several standard and Network-based enrichment methods are available. Both approaches rely on the annotations that characterize the genes/proteins included in the input set; network based ones also include in different ways physical and functional relationships among different genes or proteins that can be extracted from the available biological networks of interactions. RESULTS: Here we describe a novel procedure based on the extraction from the STRING interactome of sub-networks connecting proteins that share the same Gene Ontology(GO) terms for Biological Process (BP). Enrichment analysis is performed by mapping the protein set to be analyzed on the sub-networks, and then by collecting the corresponding annotations. We test the ability of our enrichment method in finding annotation terms disregarded by other enrichment methods available. We benchmarked 244 sets of proteins associated to different Mendelian diseases, according to the OMIM web resource. In 143 cases (58%), the network-based procedure extracts GO terms neglected by the standard method, and in 86 cases (35%), some of the newly enriched GO terms are not included in the set of annotations characterizing the input proteins. We present in detail six cases where our network-based enrichment provides an insight into the biological basis of the diseases, outperforming other freely available network-based methods. CONCLUSIONS: Considering a set of proteins in the context of their interaction network can help in better defining their functions. Our novel method exploits the information contained in the STRING database for building the minimal connecting network containing all the proteins annotated with the same GO term. The enrichment procedure is performed considering the GO-specific network modules and, when tested on the OMIM-derived benchmark sets, it is able to extract enrichment terms neglected by other methods. Our procedure is effective even when the size of the input protein set is small, requiring at least two input proteins.
format Online
Article
Text
id pubmed-4480278
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44802782015-07-10 NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases Di Lena, Pietro Martelli, Pier Luigi Fariselli, Piero Casadio, Rita BMC Genomics Research BACKGROUND: Enrichment analysis is a widely applied procedure for shedding light on the molecular mechanisms and functions at the basis of phenotypes, for enlarging the dataset of possibly related genes/proteins and for helping interpretation and prioritization of newly determined variations. Several standard and Network-based enrichment methods are available. Both approaches rely on the annotations that characterize the genes/proteins included in the input set; network based ones also include in different ways physical and functional relationships among different genes or proteins that can be extracted from the available biological networks of interactions. RESULTS: Here we describe a novel procedure based on the extraction from the STRING interactome of sub-networks connecting proteins that share the same Gene Ontology(GO) terms for Biological Process (BP). Enrichment analysis is performed by mapping the protein set to be analyzed on the sub-networks, and then by collecting the corresponding annotations. We test the ability of our enrichment method in finding annotation terms disregarded by other enrichment methods available. We benchmarked 244 sets of proteins associated to different Mendelian diseases, according to the OMIM web resource. In 143 cases (58%), the network-based procedure extracts GO terms neglected by the standard method, and in 86 cases (35%), some of the newly enriched GO terms are not included in the set of annotations characterizing the input proteins. We present in detail six cases where our network-based enrichment provides an insight into the biological basis of the diseases, outperforming other freely available network-based methods. CONCLUSIONS: Considering a set of proteins in the context of their interaction network can help in better defining their functions. Our novel method exploits the information contained in the STRING database for building the minimal connecting network containing all the proteins annotated with the same GO term. The enrichment procedure is performed considering the GO-specific network modules and, when tested on the OMIM-derived benchmark sets, it is able to extract enrichment terms neglected by other methods. Our procedure is effective even when the size of the input protein set is small, requiring at least two input proteins. BioMed Central 2015-06-18 /pmc/articles/PMC4480278/ /pubmed/26110971 http://dx.doi.org/10.1186/1471-2164-16-S8-S6 Text en Copyright © 2015 Di Lena et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Di Lena, Pietro
Martelli, Pier Luigi
Fariselli, Piero
Casadio, Rita
NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases
title NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases
title_full NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases
title_fullStr NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases
title_full_unstemmed NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases
title_short NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases
title_sort net-ge: a novel network-based gene enrichment for detecting biological processes associated to mendelian diseases
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4480278/
https://www.ncbi.nlm.nih.gov/pubmed/26110971
http://dx.doi.org/10.1186/1471-2164-16-S8-S6
work_keys_str_mv AT dilenapietro netgeanovelnetworkbasedgeneenrichmentfordetectingbiologicalprocessesassociatedtomendeliandiseases
AT martellipierluigi netgeanovelnetworkbasedgeneenrichmentfordetectingbiologicalprocessesassociatedtomendeliandiseases
AT farisellipiero netgeanovelnetworkbasedgeneenrichmentfordetectingbiologicalprocessesassociatedtomendeliandiseases
AT casadiorita netgeanovelnetworkbasedgeneenrichmentfordetectingbiologicalprocessesassociatedtomendeliandiseases