Cargando…
Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
BACKGROUND: Previous large-scale studies of de novo variants identified a number of genes associated with neurodevelopmental disorders (NDDs); however, it was also predicted that many NDD-associated genes await discovery. Such genes can be discovered by integrating copy number variants (CNVs), which...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040275/ https://www.ncbi.nlm.nih.gov/pubmed/35468861 http://dx.doi.org/10.1186/s13073-022-01042-w |
_version_ | 1784694302217076736 |
---|---|
author | Hamanaka, Kohei Miyake, Noriko Mizuguchi, Takeshi Miyatake, Satoko Uchiyama, Yuri Tsuchida, Naomi Sekiguchi, Futoshi Mitsuhashi, Satomi Tsurusaki, Yoshinori Nakashima, Mitsuko Saitsu, Hirotomo Yamada, Kohei Sakamoto, Masamune Fukuda, Hiromi Ohori, Sachiko Saida, Ken Itai, Toshiyuki Azuma, Yoshiteru Koshimizu, Eriko Fujita, Atsushi Erturk, Biray Hiraki, Yoko Ch’ng, Gaik-Siew Kato, Mitsuhiro Okamoto, Nobuhiko Takata, Atsushi Matsumoto, Naomichi |
author_facet | Hamanaka, Kohei Miyake, Noriko Mizuguchi, Takeshi Miyatake, Satoko Uchiyama, Yuri Tsuchida, Naomi Sekiguchi, Futoshi Mitsuhashi, Satomi Tsurusaki, Yoshinori Nakashima, Mitsuko Saitsu, Hirotomo Yamada, Kohei Sakamoto, Masamune Fukuda, Hiromi Ohori, Sachiko Saida, Ken Itai, Toshiyuki Azuma, Yoshiteru Koshimizu, Eriko Fujita, Atsushi Erturk, Biray Hiraki, Yoko Ch’ng, Gaik-Siew Kato, Mitsuhiro Okamoto, Nobuhiko Takata, Atsushi Matsumoto, Naomichi |
author_sort | Hamanaka, Kohei |
collection | PubMed |
description | BACKGROUND: Previous large-scale studies of de novo variants identified a number of genes associated with neurodevelopmental disorders (NDDs); however, it was also predicted that many NDD-associated genes await discovery. Such genes can be discovered by integrating copy number variants (CNVs), which have not been fully considered in previous studies, and increasing the sample size. METHODS: We first constructed a model estimating the rates of de novo CNVs per gene from several factors such as gene length and number of exons. Second, we compiled a comprehensive list of de novo single-nucleotide variants (SNVs) in 41,165 individuals and de novo CNVs in 3675 individuals with NDDs by aggregating our own and publicly available datasets, including denovo-db and the Deciphering Developmental Disorders study data. Third, summing up the de novo CNV rates that we estimated and SNV rates previously established, gene-based enrichment of de novo deleterious SNVs and CNVs were assessed in the 41,165 cases. Significantly enriched genes were further prioritized according to their similarity to known NDD genes using a deep learning model that considers functional characteristics (e.g., gene ontology and expression patterns). RESULTS: We identified a total of 380 genes achieving statistical significance (5% false discovery rate), including 31 genes affected by de novo CNVs. Of the 380 genes, 52 have not previously been reported as NDD genes, and the data of de novo CNVs contributed to the significance of three genes (GLTSCR1, MARK2, and UBR3). Among the 52 genes, we reasonably excluded 18 genes [a number almost identical to the theoretically expected false positives (i.e., 380 × 0.05 = 19)] given their constraints against deleterious variants and extracted 34 “plausible” candidate genes. Their validity as NDD genes was consistently supported by their similarity in function and gene expression patterns to known NDD genes. Quantifying the overall similarity using deep learning, we identified 11 high-confidence (> 90% true-positive probabilities) candidate genes: HDAC2, SUPT16H, HECTD4, CHD5, XPO1, GSK3B, NLGN2, ADGRB1, CTR9, BRD3, and MARK2. CONCLUSIONS: We identified dozens of new candidates for NDD genes. Both the methods and the resources developed here will contribute to the further identification of novel NDD-associated genes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-022-01042-w. |
format | Online Article Text |
id | pubmed-9040275 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-90402752022-04-27 Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants Hamanaka, Kohei Miyake, Noriko Mizuguchi, Takeshi Miyatake, Satoko Uchiyama, Yuri Tsuchida, Naomi Sekiguchi, Futoshi Mitsuhashi, Satomi Tsurusaki, Yoshinori Nakashima, Mitsuko Saitsu, Hirotomo Yamada, Kohei Sakamoto, Masamune Fukuda, Hiromi Ohori, Sachiko Saida, Ken Itai, Toshiyuki Azuma, Yoshiteru Koshimizu, Eriko Fujita, Atsushi Erturk, Biray Hiraki, Yoko Ch’ng, Gaik-Siew Kato, Mitsuhiro Okamoto, Nobuhiko Takata, Atsushi Matsumoto, Naomichi Genome Med Research BACKGROUND: Previous large-scale studies of de novo variants identified a number of genes associated with neurodevelopmental disorders (NDDs); however, it was also predicted that many NDD-associated genes await discovery. Such genes can be discovered by integrating copy number variants (CNVs), which have not been fully considered in previous studies, and increasing the sample size. METHODS: We first constructed a model estimating the rates of de novo CNVs per gene from several factors such as gene length and number of exons. Second, we compiled a comprehensive list of de novo single-nucleotide variants (SNVs) in 41,165 individuals and de novo CNVs in 3675 individuals with NDDs by aggregating our own and publicly available datasets, including denovo-db and the Deciphering Developmental Disorders study data. Third, summing up the de novo CNV rates that we estimated and SNV rates previously established, gene-based enrichment of de novo deleterious SNVs and CNVs were assessed in the 41,165 cases. Significantly enriched genes were further prioritized according to their similarity to known NDD genes using a deep learning model that considers functional characteristics (e.g., gene ontology and expression patterns). RESULTS: We identified a total of 380 genes achieving statistical significance (5% false discovery rate), including 31 genes affected by de novo CNVs. Of the 380 genes, 52 have not previously been reported as NDD genes, and the data of de novo CNVs contributed to the significance of three genes (GLTSCR1, MARK2, and UBR3). Among the 52 genes, we reasonably excluded 18 genes [a number almost identical to the theoretically expected false positives (i.e., 380 × 0.05 = 19)] given their constraints against deleterious variants and extracted 34 “plausible” candidate genes. Their validity as NDD genes was consistently supported by their similarity in function and gene expression patterns to known NDD genes. Quantifying the overall similarity using deep learning, we identified 11 high-confidence (> 90% true-positive probabilities) candidate genes: HDAC2, SUPT16H, HECTD4, CHD5, XPO1, GSK3B, NLGN2, ADGRB1, CTR9, BRD3, and MARK2. CONCLUSIONS: We identified dozens of new candidates for NDD genes. Both the methods and the resources developed here will contribute to the further identification of novel NDD-associated genes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-022-01042-w. BioMed Central 2022-04-26 /pmc/articles/PMC9040275/ /pubmed/35468861 http://dx.doi.org/10.1186/s13073-022-01042-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Hamanaka, Kohei Miyake, Noriko Mizuguchi, Takeshi Miyatake, Satoko Uchiyama, Yuri Tsuchida, Naomi Sekiguchi, Futoshi Mitsuhashi, Satomi Tsurusaki, Yoshinori Nakashima, Mitsuko Saitsu, Hirotomo Yamada, Kohei Sakamoto, Masamune Fukuda, Hiromi Ohori, Sachiko Saida, Ken Itai, Toshiyuki Azuma, Yoshiteru Koshimizu, Eriko Fujita, Atsushi Erturk, Biray Hiraki, Yoko Ch’ng, Gaik-Siew Kato, Mitsuhiro Okamoto, Nobuhiko Takata, Atsushi Matsumoto, Naomichi Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants |
title | Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants |
title_full | Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants |
title_fullStr | Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants |
title_full_unstemmed | Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants |
title_short | Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants |
title_sort | large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040275/ https://www.ncbi.nlm.nih.gov/pubmed/35468861 http://dx.doi.org/10.1186/s13073-022-01042-w |
work_keys_str_mv | AT hamanakakohei largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT miyakenoriko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT mizuguchitakeshi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT miyatakesatoko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT uchiyamayuri largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT tsuchidanaomi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT sekiguchifutoshi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT mitsuhashisatomi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT tsurusakiyoshinori largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT nakashimamitsuko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT saitsuhirotomo largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT yamadakohei largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT sakamotomasamune largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT fukudahiromi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT ohorisachiko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT saidaken largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT itaitoshiyuki largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT azumayoshiteru largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT koshimizueriko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT fujitaatsushi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT erturkbiray largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT hirakiyoko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT chnggaiksiew largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT katomitsuhiro largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT okamotonobuhiko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT takataatsushi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants AT matsumotonaomichi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants |