Cargando…
Transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era
BACKGROUND: With the first complete draft of a human genome, the Telomere-to-Telomere Consortium unlocked previously concealed genomic regions for genetic analyses. These regions harbour nearly 2000 potential novel genes with unknown function. In order to uncover candidate genes associated with huma...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10594789/ https://www.ncbi.nlm.nih.gov/pubmed/37872607 http://dx.doi.org/10.1186/s40246-023-00543-y |
_version_ | 1785124724769030144 |
---|---|
author | Falker-Gieske, Clemens |
author_facet | Falker-Gieske, Clemens |
author_sort | Falker-Gieske, Clemens |
collection | PubMed |
description | BACKGROUND: With the first complete draft of a human genome, the Telomere-to-Telomere Consortium unlocked previously concealed genomic regions for genetic analyses. These regions harbour nearly 2000 potential novel genes with unknown function. In order to uncover candidate genes associated with human neurological pathologies, a comparative transcriptome study using the T2T-CHM13 and the GRCh38 genome assemblies was conducted on previously published datasets for eight distinct human neurological disorders. RESULTS: The analysis of differential expression in RNA sequencing data led to the identification of 336 novel candidate genes linked to human neurological disorders. Additionally, it was revealed that, on average, 3.6% of the differentially expressed genes detected with the GRCh38 assembly may represent potential false positives. Among the noteworthy findings, two novel genes were discovered, one encoding a pore-structured protein and the other a highly ordered β-strand-rich protein. These genes exhibited upregulation in multiple epilepsy datasets and hold promise as candidate genes potentially modulating the progression of the disease. Furthermore, an analysis of RNA derived from white matter lesions in multiple sclerosis patients indicated significant upregulation of 26 rRNA encoding genes. Additionally, putative pathology related genes were identified for Alzheimer’s disease, amyotrophic lateral sclerosis, glioblastoma, glioma, and conditions resulting from the m.3242 A > G mtDNA mutation. CONCLUSION: The results presented here underline the potential of the T2T-CHM13 assembly in facilitating the discovery of candidate genes from transcriptome data in the context of human disorders. Moreover, the results demonstrate the value of remapping sequencing data to a superior genome assembly. Numerous potential pathology related genes, either as causative factors or related elements, have been unveiled, warranting further experimental validation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-023-00543-y. |
format | Online Article Text |
id | pubmed-10594789 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-105947892023-10-25 Transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era Falker-Gieske, Clemens Hum Genomics Research BACKGROUND: With the first complete draft of a human genome, the Telomere-to-Telomere Consortium unlocked previously concealed genomic regions for genetic analyses. These regions harbour nearly 2000 potential novel genes with unknown function. In order to uncover candidate genes associated with human neurological pathologies, a comparative transcriptome study using the T2T-CHM13 and the GRCh38 genome assemblies was conducted on previously published datasets for eight distinct human neurological disorders. RESULTS: The analysis of differential expression in RNA sequencing data led to the identification of 336 novel candidate genes linked to human neurological disorders. Additionally, it was revealed that, on average, 3.6% of the differentially expressed genes detected with the GRCh38 assembly may represent potential false positives. Among the noteworthy findings, two novel genes were discovered, one encoding a pore-structured protein and the other a highly ordered β-strand-rich protein. These genes exhibited upregulation in multiple epilepsy datasets and hold promise as candidate genes potentially modulating the progression of the disease. Furthermore, an analysis of RNA derived from white matter lesions in multiple sclerosis patients indicated significant upregulation of 26 rRNA encoding genes. Additionally, putative pathology related genes were identified for Alzheimer’s disease, amyotrophic lateral sclerosis, glioblastoma, glioma, and conditions resulting from the m.3242 A > G mtDNA mutation. CONCLUSION: The results presented here underline the potential of the T2T-CHM13 assembly in facilitating the discovery of candidate genes from transcriptome data in the context of human disorders. Moreover, the results demonstrate the value of remapping sequencing data to a superior genome assembly. Numerous potential pathology related genes, either as causative factors or related elements, have been unveiled, warranting further experimental validation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-023-00543-y. BioMed Central 2023-10-23 /pmc/articles/PMC10594789/ /pubmed/37872607 http://dx.doi.org/10.1186/s40246-023-00543-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Falker-Gieske, Clemens Transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era |
title | Transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era |
title_full | Transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era |
title_fullStr | Transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era |
title_full_unstemmed | Transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era |
title_short | Transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era |
title_sort | transcriptome driven discovery of novel candidate genes for human neurological disorders in the telomer-to-telomer genome assembly era |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10594789/ https://www.ncbi.nlm.nih.gov/pubmed/37872607 http://dx.doi.org/10.1186/s40246-023-00543-y |
work_keys_str_mv | AT falkergieskeclemens transcriptomedrivendiscoveryofnovelcandidategenesforhumanneurologicaldisordersinthetelomertotelomergenomeassemblyera |