Cargando…

Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype

Genes with similar roles in the cell cluster on chromosomes, thus benefiting from coordinated regulation. This allows gene function to be inferred by transferring annotations from genomic neighbors, following the guilt-by-association principle. We performed a systematic search for co-occurrence of &...

Descripción completa

Detalles Bibliográficos
Autores principales: Mihelčić, Matej, Šmuc, Tomislav, Supek, Fran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6925100/
https://www.ncbi.nlm.nih.gov/pubmed/31863070
http://dx.doi.org/10.1038/s41598-019-55984-0
_version_ 1783481844470120448
author Mihelčić, Matej
Šmuc, Tomislav
Supek, Fran
author_facet Mihelčić, Matej
Šmuc, Tomislav
Supek, Fran
author_sort Mihelčić, Matej
collection PubMed
description Genes with similar roles in the cell cluster on chromosomes, thus benefiting from coordinated regulation. This allows gene function to be inferred by transferring annotations from genomic neighbors, following the guilt-by-association principle. We performed a systematic search for co-occurrence of >1000 gene functions in genomic neighborhoods across 1669 prokaryotic, 49 fungal and 80 metazoan genomes, revealing prevalent patterns that cannot be explained by clustering of functionally similar genes. It is a very common occurrence that pairs of dissimilar gene functions – corresponding to semantically distant Gene Ontology terms – are significantly co-located on chromosomes. These neighborhood associations are often as conserved across genomes as the known associations between similar functions, suggesting selective benefits from clustering of certain diverse functions, which may conceivably play complementary roles in the cell. We propose a simple encoding of chromosomal gene order, the neighborhood function profiles (NFP), which draws on diverse gene clustering patterns to predict gene function and phenotype. NFPs yield a 26–46% increase in predictive power over state-of-the-art approaches that propagate function across neighborhoods, thus providing hundreds of novel, high-confidence gene function inferences per genome. Furthermore, we demonstrate that copy number-neutral structural variation that shapes gene function distribution across chromosomes can predict phenotype of individuals from their genome sequence.
format Online
Article
Text
id pubmed-6925100
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-69251002019-12-23 Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype Mihelčić, Matej Šmuc, Tomislav Supek, Fran Sci Rep Article Genes with similar roles in the cell cluster on chromosomes, thus benefiting from coordinated regulation. This allows gene function to be inferred by transferring annotations from genomic neighbors, following the guilt-by-association principle. We performed a systematic search for co-occurrence of >1000 gene functions in genomic neighborhoods across 1669 prokaryotic, 49 fungal and 80 metazoan genomes, revealing prevalent patterns that cannot be explained by clustering of functionally similar genes. It is a very common occurrence that pairs of dissimilar gene functions – corresponding to semantically distant Gene Ontology terms – are significantly co-located on chromosomes. These neighborhood associations are often as conserved across genomes as the known associations between similar functions, suggesting selective benefits from clustering of certain diverse functions, which may conceivably play complementary roles in the cell. We propose a simple encoding of chromosomal gene order, the neighborhood function profiles (NFP), which draws on diverse gene clustering patterns to predict gene function and phenotype. NFPs yield a 26–46% increase in predictive power over state-of-the-art approaches that propagate function across neighborhoods, thus providing hundreds of novel, high-confidence gene function inferences per genome. Furthermore, we demonstrate that copy number-neutral structural variation that shapes gene function distribution across chromosomes can predict phenotype of individuals from their genome sequence. Nature Publishing Group UK 2019-12-20 /pmc/articles/PMC6925100/ /pubmed/31863070 http://dx.doi.org/10.1038/s41598-019-55984-0 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Mihelčić, Matej
Šmuc, Tomislav
Supek, Fran
Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype
title Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype
title_full Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype
title_fullStr Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype
title_full_unstemmed Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype
title_short Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype
title_sort patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6925100/
https://www.ncbi.nlm.nih.gov/pubmed/31863070
http://dx.doi.org/10.1038/s41598-019-55984-0
work_keys_str_mv AT mihelcicmatej patternsofdiversegenefunctionsingenomicneighborhoodspredictgenefunctionandphenotype
AT smuctomislav patternsofdiversegenefunctionsingenomicneighborhoodspredictgenefunctionandphenotype
AT supekfran patternsofdiversegenefunctionsingenomicneighborhoodspredictgenefunctionandphenotype