Cargando…

Gene Size Matters: An Analysis of Gene Length in the Human Genome

While it is expected for gene length to be associated with factors such as intron number and evolutionary conservation, we are yet to understand the connections between gene length and function in the human genome. In this study, we show that, as expected, there is a strong positive correlation betw...

Descripción completa

Detalles Bibliográficos
Autores principales: Lopes, Inês, Altab, Gulam, Raina, Priyanka, de Magalhães, João Pedro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7905317/
https://www.ncbi.nlm.nih.gov/pubmed/33643374
http://dx.doi.org/10.3389/fgene.2021.559998
_version_ 1783655087573303296
author Lopes, Inês
Altab, Gulam
Raina, Priyanka
de Magalhães, João Pedro
author_facet Lopes, Inês
Altab, Gulam
Raina, Priyanka
de Magalhães, João Pedro
author_sort Lopes, Inês
collection PubMed
description While it is expected for gene length to be associated with factors such as intron number and evolutionary conservation, we are yet to understand the connections between gene length and function in the human genome. In this study, we show that, as expected, there is a strong positive correlation between gene length, transcript length, and protein size as well as a correlation with the number of genetic variants and introns. Among tissue-specific genes, we find that the longest transcripts tend to be expressed in the blood vessels, nerves, thyroid, cervix uteri, and the brain, while the smallest transcripts tend to be expressed in the pancreas, skin, stomach, vagina, and testis. We report, as shown previously, that natural selection suppresses changes for genes with longer transcripts and promotes changes for genes with smaller transcripts. We also observe that genes with longer transcripts tend to have a higher number of co-expressed genes and protein-protein interactions, as well as more associated publications. In the functional analysis, we show that bigger transcripts are often associated with neuronal development, while smaller transcripts tend to play roles in skin development and in the immune system. Furthermore, pathways related to cancer, neurons, and heart diseases tend to have genes with longer transcripts, with smaller transcripts being present in pathways related to immune responses and neurodegenerative diseases. Based on our results, we hypothesize that longer genes tend to be associated with functions that are important in the early development stages, while smaller genes tend to play a role in functions that are important throughout the whole life, like the immune system, which requires fast responses.
format Online
Article
Text
id pubmed-7905317
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79053172021-02-26 Gene Size Matters: An Analysis of Gene Length in the Human Genome Lopes, Inês Altab, Gulam Raina, Priyanka de Magalhães, João Pedro Front Genet Genetics While it is expected for gene length to be associated with factors such as intron number and evolutionary conservation, we are yet to understand the connections between gene length and function in the human genome. In this study, we show that, as expected, there is a strong positive correlation between gene length, transcript length, and protein size as well as a correlation with the number of genetic variants and introns. Among tissue-specific genes, we find that the longest transcripts tend to be expressed in the blood vessels, nerves, thyroid, cervix uteri, and the brain, while the smallest transcripts tend to be expressed in the pancreas, skin, stomach, vagina, and testis. We report, as shown previously, that natural selection suppresses changes for genes with longer transcripts and promotes changes for genes with smaller transcripts. We also observe that genes with longer transcripts tend to have a higher number of co-expressed genes and protein-protein interactions, as well as more associated publications. In the functional analysis, we show that bigger transcripts are often associated with neuronal development, while smaller transcripts tend to play roles in skin development and in the immune system. Furthermore, pathways related to cancer, neurons, and heart diseases tend to have genes with longer transcripts, with smaller transcripts being present in pathways related to immune responses and neurodegenerative diseases. Based on our results, we hypothesize that longer genes tend to be associated with functions that are important in the early development stages, while smaller genes tend to play a role in functions that are important throughout the whole life, like the immune system, which requires fast responses. Frontiers Media S.A. 2021-02-11 /pmc/articles/PMC7905317/ /pubmed/33643374 http://dx.doi.org/10.3389/fgene.2021.559998 Text en Copyright © 2021 Lopes, Altab, Raina and de Magalhães. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Lopes, Inês
Altab, Gulam
Raina, Priyanka
de Magalhães, João Pedro
Gene Size Matters: An Analysis of Gene Length in the Human Genome
title Gene Size Matters: An Analysis of Gene Length in the Human Genome
title_full Gene Size Matters: An Analysis of Gene Length in the Human Genome
title_fullStr Gene Size Matters: An Analysis of Gene Length in the Human Genome
title_full_unstemmed Gene Size Matters: An Analysis of Gene Length in the Human Genome
title_short Gene Size Matters: An Analysis of Gene Length in the Human Genome
title_sort gene size matters: an analysis of gene length in the human genome
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7905317/
https://www.ncbi.nlm.nih.gov/pubmed/33643374
http://dx.doi.org/10.3389/fgene.2021.559998
work_keys_str_mv AT lopesines genesizemattersananalysisofgenelengthinthehumangenome
AT altabgulam genesizemattersananalysisofgenelengthinthehumangenome
AT rainapriyanka genesizemattersananalysisofgenelengthinthehumangenome
AT demagalhaesjoaopedro genesizemattersananalysisofgenelengthinthehumangenome