Cargando…

Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations

BACKGROUND: Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to...

Descripción completa

Detalles Bibliográficos
Autores principales: Emde, Anne-Katrin, Phipps-Green, Amanda, Cadzow, Murray, Gallagher, C. Scott, Major, Tanya J., Merriman, Marilyn E., Topless, Ruth K., Takei, Riku, Dalbeth, Nicola, Murphy, Rinki, Stamp, Lisa K., de Zoysa, Janak, Wilcox, Philip L., Fox, Keolu, Wasik, Kaja A., Merriman, Tony R., Castel, Stephane E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8559369/
https://www.ncbi.nlm.nih.gov/pubmed/34719381
http://dx.doi.org/10.1186/s12864-021-07949-9
Descripción
Sumario:BACKGROUND: Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to deepening global health disparities. Whole genome sequencing (WGS) better captures genetic variation but remains prohibitively expensive. Thus, we explored WGS at “mid-pass” 1-7x coverage. RESULTS: Here, we developed and benchmarked methods for mid-pass sequencing. When applied to a population without an existing genomic reference panel, 4x mid-pass performed consistently well across ethnicities, with high recall (98%) and precision (97.5%). CONCLUSION: Compared to array data imputed into 1000 Genomes, mid-pass performed better across all metrics and identified novel population-specific variants with potential disease relevance. We hope our work will reduce financial barriers for geneticists from underrepresented populations to characterize their genomes prior to biomedical genetic applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07949-9.