Cargando…

Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations

BACKGROUND: Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to...

Descripción completa

Detalles Bibliográficos
Autores principales: Emde, Anne-Katrin, Phipps-Green, Amanda, Cadzow, Murray, Gallagher, C. Scott, Major, Tanya J., Merriman, Marilyn E., Topless, Ruth K., Takei, Riku, Dalbeth, Nicola, Murphy, Rinki, Stamp, Lisa K., de Zoysa, Janak, Wilcox, Philip L., Fox, Keolu, Wasik, Kaja A., Merriman, Tony R., Castel, Stephane E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8559369/
https://www.ncbi.nlm.nih.gov/pubmed/34719381
http://dx.doi.org/10.1186/s12864-021-07949-9
_version_ 1784592746573463552
author Emde, Anne-Katrin
Phipps-Green, Amanda
Cadzow, Murray
Gallagher, C. Scott
Major, Tanya J.
Merriman, Marilyn E.
Topless, Ruth K.
Takei, Riku
Dalbeth, Nicola
Murphy, Rinki
Stamp, Lisa K.
de Zoysa, Janak
Wilcox, Philip L.
Fox, Keolu
Wasik, Kaja A.
Merriman, Tony R.
Castel, Stephane E.
author_facet Emde, Anne-Katrin
Phipps-Green, Amanda
Cadzow, Murray
Gallagher, C. Scott
Major, Tanya J.
Merriman, Marilyn E.
Topless, Ruth K.
Takei, Riku
Dalbeth, Nicola
Murphy, Rinki
Stamp, Lisa K.
de Zoysa, Janak
Wilcox, Philip L.
Fox, Keolu
Wasik, Kaja A.
Merriman, Tony R.
Castel, Stephane E.
author_sort Emde, Anne-Katrin
collection PubMed
description BACKGROUND: Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to deepening global health disparities. Whole genome sequencing (WGS) better captures genetic variation but remains prohibitively expensive. Thus, we explored WGS at “mid-pass” 1-7x coverage. RESULTS: Here, we developed and benchmarked methods for mid-pass sequencing. When applied to a population without an existing genomic reference panel, 4x mid-pass performed consistently well across ethnicities, with high recall (98%) and precision (97.5%). CONCLUSION: Compared to array data imputed into 1000 Genomes, mid-pass performed better across all metrics and identified novel population-specific variants with potential disease relevance. We hope our work will reduce financial barriers for geneticists from underrepresented populations to characterize their genomes prior to biomedical genetic applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07949-9.
format Online
Article
Text
id pubmed-8559369
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-85593692021-11-03 Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations Emde, Anne-Katrin Phipps-Green, Amanda Cadzow, Murray Gallagher, C. Scott Major, Tanya J. Merriman, Marilyn E. Topless, Ruth K. Takei, Riku Dalbeth, Nicola Murphy, Rinki Stamp, Lisa K. de Zoysa, Janak Wilcox, Philip L. Fox, Keolu Wasik, Kaja A. Merriman, Tony R. Castel, Stephane E. BMC Genomics Research Article BACKGROUND: Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to deepening global health disparities. Whole genome sequencing (WGS) better captures genetic variation but remains prohibitively expensive. Thus, we explored WGS at “mid-pass” 1-7x coverage. RESULTS: Here, we developed and benchmarked methods for mid-pass sequencing. When applied to a population without an existing genomic reference panel, 4x mid-pass performed consistently well across ethnicities, with high recall (98%) and precision (97.5%). CONCLUSION: Compared to array data imputed into 1000 Genomes, mid-pass performed better across all metrics and identified novel population-specific variants with potential disease relevance. We hope our work will reduce financial barriers for geneticists from underrepresented populations to characterize their genomes prior to biomedical genetic applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07949-9. BioMed Central 2021-11-01 /pmc/articles/PMC8559369/ /pubmed/34719381 http://dx.doi.org/10.1186/s12864-021-07949-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Emde, Anne-Katrin
Phipps-Green, Amanda
Cadzow, Murray
Gallagher, C. Scott
Major, Tanya J.
Merriman, Marilyn E.
Topless, Ruth K.
Takei, Riku
Dalbeth, Nicola
Murphy, Rinki
Stamp, Lisa K.
de Zoysa, Janak
Wilcox, Philip L.
Fox, Keolu
Wasik, Kaja A.
Merriman, Tony R.
Castel, Stephane E.
Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations
title Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations
title_full Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations
title_fullStr Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations
title_full_unstemmed Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations
title_short Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations
title_sort mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8559369/
https://www.ncbi.nlm.nih.gov/pubmed/34719381
http://dx.doi.org/10.1186/s12864-021-07949-9
work_keys_str_mv AT emdeannekatrin midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT phippsgreenamanda midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT cadzowmurray midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT gallaghercscott midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT majortanyaj midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT merrimanmarilyne midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT toplessruthk midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT takeiriku midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT dalbethnicola midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT murphyrinki midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT stamplisak midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT dezoysajanak midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT wilcoxphilipl midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT foxkeolu midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT wasikkajaa midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT merrimantonyr midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations
AT castelstephanee midpasswholegenomesequencingenablesbiomedicalgeneticstudiesofdiversepopulations