Cargando…

Long read sequencing characterises a novel structural variant, revealing underactive AKR1C1 with overactive AKR1C2 as a possible cause of severe chronic fatigue

BACKGROUND: Causative genetic variants cannot yet be found for many disorders with a clear heritable component, including chronic fatigue disorders like myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS). These conditions may involve genes in difficult-to-align genomic regions that are refr...

Descripción completa

Detalles Bibliográficos
Autores principales: Oakley, Julia, Hill, Martin, Giess, Adam, Tanguy, Mélanie, Elgar, Greg
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10655400/
https://www.ncbi.nlm.nih.gov/pubmed/37978513
http://dx.doi.org/10.1186/s12967-023-04711-5
_version_ 1785147939184705536
author Oakley, Julia
Hill, Martin
Giess, Adam
Tanguy, Mélanie
Elgar, Greg
author_facet Oakley, Julia
Hill, Martin
Giess, Adam
Tanguy, Mélanie
Elgar, Greg
author_sort Oakley, Julia
collection PubMed
description BACKGROUND: Causative genetic variants cannot yet be found for many disorders with a clear heritable component, including chronic fatigue disorders like myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS). These conditions may involve genes in difficult-to-align genomic regions that are refractory to short read approaches. Structural variants in these regions can be particularly hard to detect or define with short reads, yet may account for a significant number of cases. Long read sequencing can overcome these difficulties but so far little data is available regarding the specific analytical challenges inherent in such regions, which need to be taken into account to ensure that variants are correctly identified. Research into chronic fatigue disorders faces the additional challenge that the heterogeneous patient populations likely encompass multiple aetiologies with overlapping symptoms, rather than a single disease entity, such that each individual abnormality may lack statistical significance within a larger sample. Better delineation of patient subgroups is needed to target research and treatment. METHODS: We use nanopore sequencing in a case of unexplained severe fatigue to identify and fully characterise a large inversion in a highly homologous region spanning the AKR1C gene locus, which was indicated but could not be resolved by short-read sequencing. We then use GC–MS/MS serum steroid analysis to investigate the functional consequences. RESULTS: Several commonly used bioinformatics tools are confounded by the homology but a combined approach including visual inspection allows the variant to be accurately resolved. The DNA inversion appears to increase the expression of AKR1C2 while limiting AKR1C1 activity, resulting in a relative increase of inhibitory GABAergic neurosteroids and impaired progesterone metabolism which could suppress neuronal activity and interfere with cellular function in a wide range of tissues. CONCLUSIONS: This study provides an example of how long read sequencing can improve diagnostic yield in research and clinical care, and highlights some of the analytical challenges presented by regions containing tandem arrays of genes. It also proposes a novel gene associated with a novel disease aetiology that may be an underlying cause of complex chronic fatigue. It reveals biomarkers that could now be assessed in a larger cohort, potentially identifying a subset of patients who might respond to treatments suggested by the aetiology. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-023-04711-5.
format Online
Article
Text
id pubmed-10655400
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-106554002023-11-17 Long read sequencing characterises a novel structural variant, revealing underactive AKR1C1 with overactive AKR1C2 as a possible cause of severe chronic fatigue Oakley, Julia Hill, Martin Giess, Adam Tanguy, Mélanie Elgar, Greg J Transl Med Research BACKGROUND: Causative genetic variants cannot yet be found for many disorders with a clear heritable component, including chronic fatigue disorders like myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS). These conditions may involve genes in difficult-to-align genomic regions that are refractory to short read approaches. Structural variants in these regions can be particularly hard to detect or define with short reads, yet may account for a significant number of cases. Long read sequencing can overcome these difficulties but so far little data is available regarding the specific analytical challenges inherent in such regions, which need to be taken into account to ensure that variants are correctly identified. Research into chronic fatigue disorders faces the additional challenge that the heterogeneous patient populations likely encompass multiple aetiologies with overlapping symptoms, rather than a single disease entity, such that each individual abnormality may lack statistical significance within a larger sample. Better delineation of patient subgroups is needed to target research and treatment. METHODS: We use nanopore sequencing in a case of unexplained severe fatigue to identify and fully characterise a large inversion in a highly homologous region spanning the AKR1C gene locus, which was indicated but could not be resolved by short-read sequencing. We then use GC–MS/MS serum steroid analysis to investigate the functional consequences. RESULTS: Several commonly used bioinformatics tools are confounded by the homology but a combined approach including visual inspection allows the variant to be accurately resolved. The DNA inversion appears to increase the expression of AKR1C2 while limiting AKR1C1 activity, resulting in a relative increase of inhibitory GABAergic neurosteroids and impaired progesterone metabolism which could suppress neuronal activity and interfere with cellular function in a wide range of tissues. CONCLUSIONS: This study provides an example of how long read sequencing can improve diagnostic yield in research and clinical care, and highlights some of the analytical challenges presented by regions containing tandem arrays of genes. It also proposes a novel gene associated with a novel disease aetiology that may be an underlying cause of complex chronic fatigue. It reveals biomarkers that could now be assessed in a larger cohort, potentially identifying a subset of patients who might respond to treatments suggested by the aetiology. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-023-04711-5. BioMed Central 2023-11-17 /pmc/articles/PMC10655400/ /pubmed/37978513 http://dx.doi.org/10.1186/s12967-023-04711-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Oakley, Julia
Hill, Martin
Giess, Adam
Tanguy, Mélanie
Elgar, Greg
Long read sequencing characterises a novel structural variant, revealing underactive AKR1C1 with overactive AKR1C2 as a possible cause of severe chronic fatigue
title Long read sequencing characterises a novel structural variant, revealing underactive AKR1C1 with overactive AKR1C2 as a possible cause of severe chronic fatigue
title_full Long read sequencing characterises a novel structural variant, revealing underactive AKR1C1 with overactive AKR1C2 as a possible cause of severe chronic fatigue
title_fullStr Long read sequencing characterises a novel structural variant, revealing underactive AKR1C1 with overactive AKR1C2 as a possible cause of severe chronic fatigue
title_full_unstemmed Long read sequencing characterises a novel structural variant, revealing underactive AKR1C1 with overactive AKR1C2 as a possible cause of severe chronic fatigue
title_short Long read sequencing characterises a novel structural variant, revealing underactive AKR1C1 with overactive AKR1C2 as a possible cause of severe chronic fatigue
title_sort long read sequencing characterises a novel structural variant, revealing underactive akr1c1 with overactive akr1c2 as a possible cause of severe chronic fatigue
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10655400/
https://www.ncbi.nlm.nih.gov/pubmed/37978513
http://dx.doi.org/10.1186/s12967-023-04711-5
work_keys_str_mv AT oakleyjulia longreadsequencingcharacterisesanovelstructuralvariantrevealingunderactiveakr1c1withoveractiveakr1c2asapossiblecauseofseverechronicfatigue
AT hillmartin longreadsequencingcharacterisesanovelstructuralvariantrevealingunderactiveakr1c1withoveractiveakr1c2asapossiblecauseofseverechronicfatigue
AT giessadam longreadsequencingcharacterisesanovelstructuralvariantrevealingunderactiveakr1c1withoveractiveakr1c2asapossiblecauseofseverechronicfatigue
AT tanguymelanie longreadsequencingcharacterisesanovelstructuralvariantrevealingunderactiveakr1c1withoveractiveakr1c2asapossiblecauseofseverechronicfatigue
AT elgargreg longreadsequencingcharacterisesanovelstructuralvariantrevealingunderactiveakr1c1withoveractiveakr1c2asapossiblecauseofseverechronicfatigue