Cargando…

Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants

BACKGROUND: Previous large-scale studies of de novo variants identified a number of genes associated with neurodevelopmental disorders (NDDs); however, it was also predicted that many NDD-associated genes await discovery. Such genes can be discovered by integrating copy number variants (CNVs), which...

Descripción completa

Detalles Bibliográficos
Autores principales: Hamanaka, Kohei, Miyake, Noriko, Mizuguchi, Takeshi, Miyatake, Satoko, Uchiyama, Yuri, Tsuchida, Naomi, Sekiguchi, Futoshi, Mitsuhashi, Satomi, Tsurusaki, Yoshinori, Nakashima, Mitsuko, Saitsu, Hirotomo, Yamada, Kohei, Sakamoto, Masamune, Fukuda, Hiromi, Ohori, Sachiko, Saida, Ken, Itai, Toshiyuki, Azuma, Yoshiteru, Koshimizu, Eriko, Fujita, Atsushi, Erturk, Biray, Hiraki, Yoko, Ch’ng, Gaik-Siew, Kato, Mitsuhiro, Okamoto, Nobuhiko, Takata, Atsushi, Matsumoto, Naomichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040275/
https://www.ncbi.nlm.nih.gov/pubmed/35468861
http://dx.doi.org/10.1186/s13073-022-01042-w
_version_ 1784694302217076736
author Hamanaka, Kohei
Miyake, Noriko
Mizuguchi, Takeshi
Miyatake, Satoko
Uchiyama, Yuri
Tsuchida, Naomi
Sekiguchi, Futoshi
Mitsuhashi, Satomi
Tsurusaki, Yoshinori
Nakashima, Mitsuko
Saitsu, Hirotomo
Yamada, Kohei
Sakamoto, Masamune
Fukuda, Hiromi
Ohori, Sachiko
Saida, Ken
Itai, Toshiyuki
Azuma, Yoshiteru
Koshimizu, Eriko
Fujita, Atsushi
Erturk, Biray
Hiraki, Yoko
Ch’ng, Gaik-Siew
Kato, Mitsuhiro
Okamoto, Nobuhiko
Takata, Atsushi
Matsumoto, Naomichi
author_facet Hamanaka, Kohei
Miyake, Noriko
Mizuguchi, Takeshi
Miyatake, Satoko
Uchiyama, Yuri
Tsuchida, Naomi
Sekiguchi, Futoshi
Mitsuhashi, Satomi
Tsurusaki, Yoshinori
Nakashima, Mitsuko
Saitsu, Hirotomo
Yamada, Kohei
Sakamoto, Masamune
Fukuda, Hiromi
Ohori, Sachiko
Saida, Ken
Itai, Toshiyuki
Azuma, Yoshiteru
Koshimizu, Eriko
Fujita, Atsushi
Erturk, Biray
Hiraki, Yoko
Ch’ng, Gaik-Siew
Kato, Mitsuhiro
Okamoto, Nobuhiko
Takata, Atsushi
Matsumoto, Naomichi
author_sort Hamanaka, Kohei
collection PubMed
description BACKGROUND: Previous large-scale studies of de novo variants identified a number of genes associated with neurodevelopmental disorders (NDDs); however, it was also predicted that many NDD-associated genes await discovery. Such genes can be discovered by integrating copy number variants (CNVs), which have not been fully considered in previous studies, and increasing the sample size. METHODS: We first constructed a model estimating the rates of de novo CNVs per gene from several factors such as gene length and number of exons. Second, we compiled a comprehensive list of de novo single-nucleotide variants (SNVs) in 41,165 individuals and de novo CNVs in 3675 individuals with NDDs by aggregating our own and publicly available datasets, including denovo-db and the Deciphering Developmental Disorders study data. Third, summing up the de novo CNV rates that we estimated and SNV rates previously established, gene-based enrichment of de novo deleterious SNVs and CNVs were assessed in the 41,165 cases. Significantly enriched genes were further prioritized according to their similarity to known NDD genes using a deep learning model that considers functional characteristics (e.g., gene ontology and expression patterns). RESULTS: We identified a total of 380 genes achieving statistical significance (5% false discovery rate), including 31 genes affected by de novo CNVs. Of the 380 genes, 52 have not previously been reported as NDD genes, and the data of de novo CNVs contributed to the significance of three genes (GLTSCR1, MARK2, and UBR3). Among the 52 genes, we reasonably excluded 18 genes [a number almost identical to the theoretically expected false positives (i.e., 380 × 0.05 = 19)] given their constraints against deleterious variants and extracted 34 “plausible” candidate genes. Their validity as NDD genes was consistently supported by their similarity in function and gene expression patterns to known NDD genes. Quantifying the overall similarity using deep learning, we identified 11 high-confidence (> 90% true-positive probabilities) candidate genes: HDAC2, SUPT16H, HECTD4, CHD5, XPO1, GSK3B, NLGN2, ADGRB1, CTR9, BRD3, and MARK2. CONCLUSIONS: We identified dozens of new candidates for NDD genes. Both the methods and the resources developed here will contribute to the further identification of novel NDD-associated genes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-022-01042-w.
format Online
Article
Text
id pubmed-9040275
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90402752022-04-27 Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants Hamanaka, Kohei Miyake, Noriko Mizuguchi, Takeshi Miyatake, Satoko Uchiyama, Yuri Tsuchida, Naomi Sekiguchi, Futoshi Mitsuhashi, Satomi Tsurusaki, Yoshinori Nakashima, Mitsuko Saitsu, Hirotomo Yamada, Kohei Sakamoto, Masamune Fukuda, Hiromi Ohori, Sachiko Saida, Ken Itai, Toshiyuki Azuma, Yoshiteru Koshimizu, Eriko Fujita, Atsushi Erturk, Biray Hiraki, Yoko Ch’ng, Gaik-Siew Kato, Mitsuhiro Okamoto, Nobuhiko Takata, Atsushi Matsumoto, Naomichi Genome Med Research BACKGROUND: Previous large-scale studies of de novo variants identified a number of genes associated with neurodevelopmental disorders (NDDs); however, it was also predicted that many NDD-associated genes await discovery. Such genes can be discovered by integrating copy number variants (CNVs), which have not been fully considered in previous studies, and increasing the sample size. METHODS: We first constructed a model estimating the rates of de novo CNVs per gene from several factors such as gene length and number of exons. Second, we compiled a comprehensive list of de novo single-nucleotide variants (SNVs) in 41,165 individuals and de novo CNVs in 3675 individuals with NDDs by aggregating our own and publicly available datasets, including denovo-db and the Deciphering Developmental Disorders study data. Third, summing up the de novo CNV rates that we estimated and SNV rates previously established, gene-based enrichment of de novo deleterious SNVs and CNVs were assessed in the 41,165 cases. Significantly enriched genes were further prioritized according to their similarity to known NDD genes using a deep learning model that considers functional characteristics (e.g., gene ontology and expression patterns). RESULTS: We identified a total of 380 genes achieving statistical significance (5% false discovery rate), including 31 genes affected by de novo CNVs. Of the 380 genes, 52 have not previously been reported as NDD genes, and the data of de novo CNVs contributed to the significance of three genes (GLTSCR1, MARK2, and UBR3). Among the 52 genes, we reasonably excluded 18 genes [a number almost identical to the theoretically expected false positives (i.e., 380 × 0.05 = 19)] given their constraints against deleterious variants and extracted 34 “plausible” candidate genes. Their validity as NDD genes was consistently supported by their similarity in function and gene expression patterns to known NDD genes. Quantifying the overall similarity using deep learning, we identified 11 high-confidence (> 90% true-positive probabilities) candidate genes: HDAC2, SUPT16H, HECTD4, CHD5, XPO1, GSK3B, NLGN2, ADGRB1, CTR9, BRD3, and MARK2. CONCLUSIONS: We identified dozens of new candidates for NDD genes. Both the methods and the resources developed here will contribute to the further identification of novel NDD-associated genes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-022-01042-w. BioMed Central 2022-04-26 /pmc/articles/PMC9040275/ /pubmed/35468861 http://dx.doi.org/10.1186/s13073-022-01042-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hamanaka, Kohei
Miyake, Noriko
Mizuguchi, Takeshi
Miyatake, Satoko
Uchiyama, Yuri
Tsuchida, Naomi
Sekiguchi, Futoshi
Mitsuhashi, Satomi
Tsurusaki, Yoshinori
Nakashima, Mitsuko
Saitsu, Hirotomo
Yamada, Kohei
Sakamoto, Masamune
Fukuda, Hiromi
Ohori, Sachiko
Saida, Ken
Itai, Toshiyuki
Azuma, Yoshiteru
Koshimizu, Eriko
Fujita, Atsushi
Erturk, Biray
Hiraki, Yoko
Ch’ng, Gaik-Siew
Kato, Mitsuhiro
Okamoto, Nobuhiko
Takata, Atsushi
Matsumoto, Naomichi
Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
title Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
title_full Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
title_fullStr Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
title_full_unstemmed Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
title_short Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
title_sort large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040275/
https://www.ncbi.nlm.nih.gov/pubmed/35468861
http://dx.doi.org/10.1186/s13073-022-01042-w
work_keys_str_mv AT hamanakakohei largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT miyakenoriko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT mizuguchitakeshi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT miyatakesatoko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT uchiyamayuri largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT tsuchidanaomi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT sekiguchifutoshi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT mitsuhashisatomi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT tsurusakiyoshinori largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT nakashimamitsuko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT saitsuhirotomo largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT yamadakohei largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT sakamotomasamune largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT fukudahiromi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT ohorisachiko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT saidaken largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT itaitoshiyuki largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT azumayoshiteru largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT koshimizueriko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT fujitaatsushi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT erturkbiray largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT hirakiyoko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT chnggaiksiew largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT katomitsuhiro largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT okamotonobuhiko largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT takataatsushi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants
AT matsumotonaomichi largescalediscoveryofnovelneurodevelopmentaldisorderrelatedgenesthroughaunifiedanalysisofsinglenucleotideandcopynumbervariants