Cargando…
Genome-wide detection of human variants that disrupt intronic branchpoints
Pre-messenger RNA splicing is initiated with the recognition of a single-nucleotide intronic branchpoint (BP) within a BP motif by spliceosome elements. Forty-eight rare variants in 43 human genes have been reported to alter splicing and cause disease by disrupting BP. However, until now, no computa...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9636908/ https://www.ncbi.nlm.nih.gov/pubmed/36306325 http://dx.doi.org/10.1073/pnas.2211194119 |
_version_ | 1784825058790735872 |
---|---|
author | Zhang, Peng Philippot, Quentin Ren, Weicheng Lei, Wei-Te Li, Juan Stenson, Peter D. Palacín, Pere Soler Colobran, Roger Boisson, Bertrand Zhang, Shen-Ying Puel, Anne Pan-Hammarström, Qiang Zhang, Qian Cooper, David N. Abel, Laurent Casanova, Jean-Laurent |
author_facet | Zhang, Peng Philippot, Quentin Ren, Weicheng Lei, Wei-Te Li, Juan Stenson, Peter D. Palacín, Pere Soler Colobran, Roger Boisson, Bertrand Zhang, Shen-Ying Puel, Anne Pan-Hammarström, Qiang Zhang, Qian Cooper, David N. Abel, Laurent Casanova, Jean-Laurent |
author_sort | Zhang, Peng |
collection | PubMed |
description | Pre-messenger RNA splicing is initiated with the recognition of a single-nucleotide intronic branchpoint (BP) within a BP motif by spliceosome elements. Forty-eight rare variants in 43 human genes have been reported to alter splicing and cause disease by disrupting BP. However, until now, no computational approach was available to efficiently detect such variants in massively parallel sequencing data. We established a comprehensive human genome-wide BP database by integrating existing BP data and generating new BP data from RNA sequencing of lariat debranching enzyme DBR1-mutated patients and from machine-learning predictions. We characterized multiple features of BP in major and minor introns and found that BP and BP-2 (two nucleotides upstream of BP) positions exhibit a lower rate of variation in human populations and higher evolutionary conservation than the intronic background, while being comparable to the exonic background. We developed BPHunter as a genome-wide computational approach to systematically and efficiently detect intronic variants that may disrupt BP recognition. BPHunter retrospectively identified 40 of the 48 known pathogenic BP variants, in which we summarized a strategy for prioritizing BP variant candidates. The remaining eight variants all create AG-dinucleotides between the BP and acceptor site, which is the likely reason for missplicing. We demonstrated the practical utility of BPHunter prospectively by using it to identify a novel germline heterozygous BP variant of STAT2 in a patient with critical COVID-19 pneumonia and a novel somatic intronic 59-nucleotide deletion of ITPKB in a lymphoma patient, both of which were validated experimentally. BPHunter is publicly available from https://hgidsoft.rockefeller.edu/BPHunter and https://github.com/casanova-lab/BPHunter. |
format | Online Article Text |
id | pubmed-9636908 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-96369082022-11-06 Genome-wide detection of human variants that disrupt intronic branchpoints Zhang, Peng Philippot, Quentin Ren, Weicheng Lei, Wei-Te Li, Juan Stenson, Peter D. Palacín, Pere Soler Colobran, Roger Boisson, Bertrand Zhang, Shen-Ying Puel, Anne Pan-Hammarström, Qiang Zhang, Qian Cooper, David N. Abel, Laurent Casanova, Jean-Laurent Proc Natl Acad Sci U S A Biological Sciences Pre-messenger RNA splicing is initiated with the recognition of a single-nucleotide intronic branchpoint (BP) within a BP motif by spliceosome elements. Forty-eight rare variants in 43 human genes have been reported to alter splicing and cause disease by disrupting BP. However, until now, no computational approach was available to efficiently detect such variants in massively parallel sequencing data. We established a comprehensive human genome-wide BP database by integrating existing BP data and generating new BP data from RNA sequencing of lariat debranching enzyme DBR1-mutated patients and from machine-learning predictions. We characterized multiple features of BP in major and minor introns and found that BP and BP-2 (two nucleotides upstream of BP) positions exhibit a lower rate of variation in human populations and higher evolutionary conservation than the intronic background, while being comparable to the exonic background. We developed BPHunter as a genome-wide computational approach to systematically and efficiently detect intronic variants that may disrupt BP recognition. BPHunter retrospectively identified 40 of the 48 known pathogenic BP variants, in which we summarized a strategy for prioritizing BP variant candidates. The remaining eight variants all create AG-dinucleotides between the BP and acceptor site, which is the likely reason for missplicing. We demonstrated the practical utility of BPHunter prospectively by using it to identify a novel germline heterozygous BP variant of STAT2 in a patient with critical COVID-19 pneumonia and a novel somatic intronic 59-nucleotide deletion of ITPKB in a lymphoma patient, both of which were validated experimentally. BPHunter is publicly available from https://hgidsoft.rockefeller.edu/BPHunter and https://github.com/casanova-lab/BPHunter. National Academy of Sciences 2022-10-28 2022-11-01 /pmc/articles/PMC9636908/ /pubmed/36306325 http://dx.doi.org/10.1073/pnas.2211194119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Biological Sciences Zhang, Peng Philippot, Quentin Ren, Weicheng Lei, Wei-Te Li, Juan Stenson, Peter D. Palacín, Pere Soler Colobran, Roger Boisson, Bertrand Zhang, Shen-Ying Puel, Anne Pan-Hammarström, Qiang Zhang, Qian Cooper, David N. Abel, Laurent Casanova, Jean-Laurent Genome-wide detection of human variants that disrupt intronic branchpoints |
title | Genome-wide detection of human variants that disrupt intronic branchpoints |
title_full | Genome-wide detection of human variants that disrupt intronic branchpoints |
title_fullStr | Genome-wide detection of human variants that disrupt intronic branchpoints |
title_full_unstemmed | Genome-wide detection of human variants that disrupt intronic branchpoints |
title_short | Genome-wide detection of human variants that disrupt intronic branchpoints |
title_sort | genome-wide detection of human variants that disrupt intronic branchpoints |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9636908/ https://www.ncbi.nlm.nih.gov/pubmed/36306325 http://dx.doi.org/10.1073/pnas.2211194119 |
work_keys_str_mv | AT zhangpeng genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT philippotquentin genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT renweicheng genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT leiweite genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT lijuan genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT stensonpeterd genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT palacinperesoler genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT colobranroger genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT boissonbertrand genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT zhangshenying genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT puelanne genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT panhammarstromqiang genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT zhangqian genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT cooperdavidn genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT abellaurent genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints AT casanovajeanlaurent genomewidedetectionofhumanvariantsthatdisruptintronicbranchpoints |