Cargando…

A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders

Single gene disorders are individually rare but collectively common leading causes of neonatal and pediatric morbidity and mortality. Both parents or the mothers of affected individuals with autosomal recessive or X-linked recessive diseases, respectively, are carrier(s). Carrier frequencies of rece...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Wenjuan, Wang, Chen, Mullapudi, Nandita, Cao, Yanan, Li, Lin, Lo, Ivan Fai Man, Tsui, Stephen Kwok-Wing, Chen, Xiao, Lei, Yong, Gu, Shen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9763236/
https://www.ncbi.nlm.nih.gov/pubmed/36535936
http://dx.doi.org/10.1038/s41525-022-00344-7
_version_ 1784853010805948416
author Zhu, Wenjuan
Wang, Chen
Mullapudi, Nandita
Cao, Yanan
Li, Lin
Lo, Ivan Fai Man
Tsui, Stephen Kwok-Wing
Chen, Xiao
Lei, Yong
Gu, Shen
author_facet Zhu, Wenjuan
Wang, Chen
Mullapudi, Nandita
Cao, Yanan
Li, Lin
Lo, Ivan Fai Man
Tsui, Stephen Kwok-Wing
Chen, Xiao
Lei, Yong
Gu, Shen
author_sort Zhu, Wenjuan
collection PubMed
description Single gene disorders are individually rare but collectively common leading causes of neonatal and pediatric morbidity and mortality. Both parents or the mothers of affected individuals with autosomal recessive or X-linked recessive diseases, respectively, are carrier(s). Carrier frequencies of recessive diseases can vary drastically among different ethnicities. This study established a robust pipeline for estimating and ranking carrier frequencies of all known 2699 recessive genes based on genome-wide sequencing data in healthy individuals. The discovery gnomAD cohort contained sequencing data on 76,156 genomes and 125,748 exomes from individuals with seven ethnicity backgrounds. The three validation cohorts composed of the SG10K Project with 4810 genomes on East Asian and South Asian, the ChinaMAP project with 10,588 Chinese genomes, and the WBBC pilot project with 4480 Chinese genomes. Within each cohort, comprehensive selection criteria for various kinds of deleterious variants were instituted, including known pathogenic variants (Type 1), presumably loss-of-function changes (Type 2), predicted deleterious missense variants (Type 3), and potentially harmful in-frame INDELs (Type 4). Subsequently, carrier frequencies of the 2699 genes were calculated and ranked based on ethnicity-specific carrier rates of Type 1 to Type 4 variants. Comparison of results from different cohorts with similar ethnicity background exhibited high degree of correlation, particularly between the ChinaMAP and the WBBC cohorts (Pearson correlation coefficient R = 0.92), confirming the validity of our variant selection criteria and the overall analysis pipeline.
format Online
Article
Text
id pubmed-9763236
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97632362022-12-21 A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders Zhu, Wenjuan Wang, Chen Mullapudi, Nandita Cao, Yanan Li, Lin Lo, Ivan Fai Man Tsui, Stephen Kwok-Wing Chen, Xiao Lei, Yong Gu, Shen NPJ Genom Med Article Single gene disorders are individually rare but collectively common leading causes of neonatal and pediatric morbidity and mortality. Both parents or the mothers of affected individuals with autosomal recessive or X-linked recessive diseases, respectively, are carrier(s). Carrier frequencies of recessive diseases can vary drastically among different ethnicities. This study established a robust pipeline for estimating and ranking carrier frequencies of all known 2699 recessive genes based on genome-wide sequencing data in healthy individuals. The discovery gnomAD cohort contained sequencing data on 76,156 genomes and 125,748 exomes from individuals with seven ethnicity backgrounds. The three validation cohorts composed of the SG10K Project with 4810 genomes on East Asian and South Asian, the ChinaMAP project with 10,588 Chinese genomes, and the WBBC pilot project with 4480 Chinese genomes. Within each cohort, comprehensive selection criteria for various kinds of deleterious variants were instituted, including known pathogenic variants (Type 1), presumably loss-of-function changes (Type 2), predicted deleterious missense variants (Type 3), and potentially harmful in-frame INDELs (Type 4). Subsequently, carrier frequencies of the 2699 genes were calculated and ranked based on ethnicity-specific carrier rates of Type 1 to Type 4 variants. Comparison of results from different cohorts with similar ethnicity background exhibited high degree of correlation, particularly between the ChinaMAP and the WBBC cohorts (Pearson correlation coefficient R = 0.92), confirming the validity of our variant selection criteria and the overall analysis pipeline. Nature Publishing Group UK 2022-12-19 /pmc/articles/PMC9763236/ /pubmed/36535936 http://dx.doi.org/10.1038/s41525-022-00344-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhu, Wenjuan
Wang, Chen
Mullapudi, Nandita
Cao, Yanan
Li, Lin
Lo, Ivan Fai Man
Tsui, Stephen Kwok-Wing
Chen, Xiao
Lei, Yong
Gu, Shen
A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders
title A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders
title_full A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders
title_fullStr A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders
title_full_unstemmed A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders
title_short A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders
title_sort robust pipeline for ranking carrier frequencies of autosomal recessive and x-linked mendelian disorders
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9763236/
https://www.ncbi.nlm.nih.gov/pubmed/36535936
http://dx.doi.org/10.1038/s41525-022-00344-7
work_keys_str_mv AT zhuwenjuan arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT wangchen arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT mullapudinandita arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT caoyanan arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT lilin arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT loivanfaiman arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT tsuistephenkwokwing arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT chenxiao arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT leiyong arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT gushen arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT zhuwenjuan robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT wangchen robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT mullapudinandita robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT caoyanan robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT lilin robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT loivanfaiman robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT tsuistephenkwokwing robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT chenxiao robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT leiyong robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders
AT gushen robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders