Cargando…
A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders
Single gene disorders are individually rare but collectively common leading causes of neonatal and pediatric morbidity and mortality. Both parents or the mothers of affected individuals with autosomal recessive or X-linked recessive diseases, respectively, are carrier(s). Carrier frequencies of rece...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9763236/ https://www.ncbi.nlm.nih.gov/pubmed/36535936 http://dx.doi.org/10.1038/s41525-022-00344-7 |
_version_ | 1784853010805948416 |
---|---|
author | Zhu, Wenjuan Wang, Chen Mullapudi, Nandita Cao, Yanan Li, Lin Lo, Ivan Fai Man Tsui, Stephen Kwok-Wing Chen, Xiao Lei, Yong Gu, Shen |
author_facet | Zhu, Wenjuan Wang, Chen Mullapudi, Nandita Cao, Yanan Li, Lin Lo, Ivan Fai Man Tsui, Stephen Kwok-Wing Chen, Xiao Lei, Yong Gu, Shen |
author_sort | Zhu, Wenjuan |
collection | PubMed |
description | Single gene disorders are individually rare but collectively common leading causes of neonatal and pediatric morbidity and mortality. Both parents or the mothers of affected individuals with autosomal recessive or X-linked recessive diseases, respectively, are carrier(s). Carrier frequencies of recessive diseases can vary drastically among different ethnicities. This study established a robust pipeline for estimating and ranking carrier frequencies of all known 2699 recessive genes based on genome-wide sequencing data in healthy individuals. The discovery gnomAD cohort contained sequencing data on 76,156 genomes and 125,748 exomes from individuals with seven ethnicity backgrounds. The three validation cohorts composed of the SG10K Project with 4810 genomes on East Asian and South Asian, the ChinaMAP project with 10,588 Chinese genomes, and the WBBC pilot project with 4480 Chinese genomes. Within each cohort, comprehensive selection criteria for various kinds of deleterious variants were instituted, including known pathogenic variants (Type 1), presumably loss-of-function changes (Type 2), predicted deleterious missense variants (Type 3), and potentially harmful in-frame INDELs (Type 4). Subsequently, carrier frequencies of the 2699 genes were calculated and ranked based on ethnicity-specific carrier rates of Type 1 to Type 4 variants. Comparison of results from different cohorts with similar ethnicity background exhibited high degree of correlation, particularly between the ChinaMAP and the WBBC cohorts (Pearson correlation coefficient R = 0.92), confirming the validity of our variant selection criteria and the overall analysis pipeline. |
format | Online Article Text |
id | pubmed-9763236 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-97632362022-12-21 A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders Zhu, Wenjuan Wang, Chen Mullapudi, Nandita Cao, Yanan Li, Lin Lo, Ivan Fai Man Tsui, Stephen Kwok-Wing Chen, Xiao Lei, Yong Gu, Shen NPJ Genom Med Article Single gene disorders are individually rare but collectively common leading causes of neonatal and pediatric morbidity and mortality. Both parents or the mothers of affected individuals with autosomal recessive or X-linked recessive diseases, respectively, are carrier(s). Carrier frequencies of recessive diseases can vary drastically among different ethnicities. This study established a robust pipeline for estimating and ranking carrier frequencies of all known 2699 recessive genes based on genome-wide sequencing data in healthy individuals. The discovery gnomAD cohort contained sequencing data on 76,156 genomes and 125,748 exomes from individuals with seven ethnicity backgrounds. The three validation cohorts composed of the SG10K Project with 4810 genomes on East Asian and South Asian, the ChinaMAP project with 10,588 Chinese genomes, and the WBBC pilot project with 4480 Chinese genomes. Within each cohort, comprehensive selection criteria for various kinds of deleterious variants were instituted, including known pathogenic variants (Type 1), presumably loss-of-function changes (Type 2), predicted deleterious missense variants (Type 3), and potentially harmful in-frame INDELs (Type 4). Subsequently, carrier frequencies of the 2699 genes were calculated and ranked based on ethnicity-specific carrier rates of Type 1 to Type 4 variants. Comparison of results from different cohorts with similar ethnicity background exhibited high degree of correlation, particularly between the ChinaMAP and the WBBC cohorts (Pearson correlation coefficient R = 0.92), confirming the validity of our variant selection criteria and the overall analysis pipeline. Nature Publishing Group UK 2022-12-19 /pmc/articles/PMC9763236/ /pubmed/36535936 http://dx.doi.org/10.1038/s41525-022-00344-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zhu, Wenjuan Wang, Chen Mullapudi, Nandita Cao, Yanan Li, Lin Lo, Ivan Fai Man Tsui, Stephen Kwok-Wing Chen, Xiao Lei, Yong Gu, Shen A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders |
title | A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders |
title_full | A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders |
title_fullStr | A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders |
title_full_unstemmed | A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders |
title_short | A robust pipeline for ranking carrier frequencies of autosomal recessive and X-linked Mendelian disorders |
title_sort | robust pipeline for ranking carrier frequencies of autosomal recessive and x-linked mendelian disorders |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9763236/ https://www.ncbi.nlm.nih.gov/pubmed/36535936 http://dx.doi.org/10.1038/s41525-022-00344-7 |
work_keys_str_mv | AT zhuwenjuan arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT wangchen arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT mullapudinandita arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT caoyanan arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT lilin arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT loivanfaiman arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT tsuistephenkwokwing arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT chenxiao arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT leiyong arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT gushen arobustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT zhuwenjuan robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT wangchen robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT mullapudinandita robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT caoyanan robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT lilin robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT loivanfaiman robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT tsuistephenkwokwing robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT chenxiao robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT leiyong robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders AT gushen robustpipelineforrankingcarrierfrequenciesofautosomalrecessiveandxlinkedmendeliandisorders |