Cargando…
Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations
BACKGROUND: Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8559369/ https://www.ncbi.nlm.nih.gov/pubmed/34719381 http://dx.doi.org/10.1186/s12864-021-07949-9 |
_version_ | 1784592746573463552 |
---|---|
author | Emde, Anne-Katrin Phipps-Green, Amanda Cadzow, Murray Gallagher, C. Scott Major, Tanya J. Merriman, Marilyn E. Topless, Ruth K. Takei, Riku Dalbeth, Nicola Murphy, Rinki Stamp, Lisa K. de Zoysa, Janak Wilcox, Philip L. Fox, Keolu Wasik, Kaja A. Merriman, Tony R. Castel, Stephane E. |
author_facet | Emde, Anne-Katrin Phipps-Green, Amanda Cadzow, Murray Gallagher, C. Scott Major, Tanya J. Merriman, Marilyn E. Topless, Ruth K. Takei, Riku Dalbeth, Nicola Murphy, Rinki Stamp, Lisa K. de Zoysa, Janak Wilcox, Philip L. Fox, Keolu Wasik, Kaja A. Merriman, Tony R. Castel, Stephane E. |
author_sort | Emde, Anne-Katrin |
collection | PubMed |
description | BACKGROUND: Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to deepening global health disparities. Whole genome sequencing (WGS) better captures genetic variation but remains prohibitively expensive. Thus, we explored WGS at “mid-pass” 1-7x coverage. RESULTS: Here, we developed and benchmarked methods for mid-pass sequencing. When applied to a population without an existing genomic reference panel, 4x mid-pass performed consistently well across ethnicities, with high recall (98%) and precision (97.5%). CONCLUSION: Compared to array data imputed into 1000 Genomes, mid-pass performed better across all metrics and identified novel population-specific variants with potential disease relevance. We hope our work will reduce financial barriers for geneticists from underrepresented populations to characterize their genomes prior to biomedical genetic applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07949-9. |
format | Online Article Text |
id | pubmed-8559369 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-85593692021-11-03 Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations Emde, Anne-Katrin Phipps-Green, Amanda Cadzow, Murray Gallagher, C. Scott Major, Tanya J. Merriman, Marilyn E. Topless, Ruth K. Takei, Riku Dalbeth, Nicola Murphy, Rinki Stamp, Lisa K. de Zoysa, Janak Wilcox, Philip L. Fox, Keolu Wasik, Kaja A. Merriman, Tony R. Castel, Stephane E. BMC Genomics Research Article BACKGROUND: Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to deepening global health disparities. Whole genome sequencing (WGS) better captures genetic variation but remains prohibitively expensive. Thus, we explored WGS at “mid-pass” 1-7x coverage. RESULTS: Here, we developed and benchmarked methods for mid-pass sequencing. When applied to a population without an existing genomic reference panel, 4x mid-pass performed consistently well across ethnicities, with high recall (98%) and precision (97.5%). CONCLUSION: Compared to array data imputed into 1000 Genomes, mid-pass performed better across all metrics and identified novel population-specific variants with potential disease relevance. We hope our work will reduce financial barriers for geneticists from underrepresented populations to characterize their genomes prior to biomedical genetic applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07949-9. BioMed Central 2021-11-01 /pmc/articles/PMC8559369/ /pubmed/34719381 http://dx.doi.org/10.1186/s12864-021-07949-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Emde, Anne-Katrin Phipps-Green, Amanda Cadzow, Murray Gallagher, C. Scott Major, Tanya J. Merriman, Marilyn E. Topless, Ruth K. Takei, Riku Dalbeth, Nicola Murphy, Rinki Stamp, Lisa K. de Zoysa, Janak Wilcox, Philip L. Fox, Keolu Wasik, Kaja A. Merriman, Tony R. Castel, Stephane E. Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations |
title | Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations |
title_full | Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations |
title_fullStr | Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations |
title_full_unstemmed | Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations |
title_short | Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations |
title_sort | mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8559369/ https://www.ncbi.nlm.nih.gov/pubmed/34719381 http://dx.doi.org/10.1186/s12864-021-07949-9 |
work_keys_str_mv | AT emdeannekatrin midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT phippsgreenamanda midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT cadzowmurray midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT gallaghercscott midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT majortanyaj midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT merrimanmarilyne midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT toplessruthk midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT takeiriku midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT dalbethnicola midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT murphyrinki midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT stamplisak midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT dezoysajanak midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT wilcoxphilipl midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT foxkeolu midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT wasikkajaa midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT merrimantonyr midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations AT castelstephanee midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations |