Cargando…

Detection of trait-associated structural variations using short-read sequencing

Genomic structural variation (SV) affects genetic and phenotypic characteristics in diverse organisms, but the lack of reliable methods to detect SV has hindered genetic analysis. We developed a computational algorithm (MOPline) that includes missing call recovery combined with high-confidence SV ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Kosugi, Shunichi, Kamatani, Yoichiro, Harada, Katsutoshi, Tomizuka, Kohei, Momozawa, Yukihide, Morisaki, Takayuki, Terao, Chikashi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10300613/
https://www.ncbi.nlm.nih.gov/pubmed/37388916
http://dx.doi.org/10.1016/j.xgen.2023.100328
_version_ 1785064625358766080
author Kosugi, Shunichi
Kamatani, Yoichiro
Harada, Katsutoshi
Tomizuka, Kohei
Momozawa, Yukihide
Morisaki, Takayuki
Terao, Chikashi
author_facet Kosugi, Shunichi
Kamatani, Yoichiro
Harada, Katsutoshi
Tomizuka, Kohei
Momozawa, Yukihide
Morisaki, Takayuki
Terao, Chikashi
author_sort Kosugi, Shunichi
collection PubMed
description Genomic structural variation (SV) affects genetic and phenotypic characteristics in diverse organisms, but the lack of reliable methods to detect SV has hindered genetic analysis. We developed a computational algorithm (MOPline) that includes missing call recovery combined with high-confidence SV call selection and genotyping using short-read whole-genome sequencing (WGS) data. Using 3,672 high-coverage WGS datasets, MOPline stably detected ∼16,000 SVs per individual, which is over ∼1.7–3.3-fold higher than previous large-scale projects while exhibiting a comparable level of statistical quality metrics. We imputed SVs from 181,622 Japanese individuals for 42 diseases and 60 quantitative traits. A genome-wide association study with the imputed SVs revealed 41 top-ranked or nearly top-ranked genome-wide significant SVs, including 8 exonic SVs with 5 novel associations and enriched mobile element insertions. This study demonstrates that short-read WGS data can be used to identify rare and common SVs associated with a variety of traits.
format Online
Article
Text
id pubmed-10300613
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-103006132023-06-29 Detection of trait-associated structural variations using short-read sequencing Kosugi, Shunichi Kamatani, Yoichiro Harada, Katsutoshi Tomizuka, Kohei Momozawa, Yukihide Morisaki, Takayuki Terao, Chikashi Cell Genom Technology Genomic structural variation (SV) affects genetic and phenotypic characteristics in diverse organisms, but the lack of reliable methods to detect SV has hindered genetic analysis. We developed a computational algorithm (MOPline) that includes missing call recovery combined with high-confidence SV call selection and genotyping using short-read whole-genome sequencing (WGS) data. Using 3,672 high-coverage WGS datasets, MOPline stably detected ∼16,000 SVs per individual, which is over ∼1.7–3.3-fold higher than previous large-scale projects while exhibiting a comparable level of statistical quality metrics. We imputed SVs from 181,622 Japanese individuals for 42 diseases and 60 quantitative traits. A genome-wide association study with the imputed SVs revealed 41 top-ranked or nearly top-ranked genome-wide significant SVs, including 8 exonic SVs with 5 novel associations and enriched mobile element insertions. This study demonstrates that short-read WGS data can be used to identify rare and common SVs associated with a variety of traits. Elsevier 2023-05-18 /pmc/articles/PMC10300613/ /pubmed/37388916 http://dx.doi.org/10.1016/j.xgen.2023.100328 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Technology
Kosugi, Shunichi
Kamatani, Yoichiro
Harada, Katsutoshi
Tomizuka, Kohei
Momozawa, Yukihide
Morisaki, Takayuki
Terao, Chikashi
Detection of trait-associated structural variations using short-read sequencing
title Detection of trait-associated structural variations using short-read sequencing
title_full Detection of trait-associated structural variations using short-read sequencing
title_fullStr Detection of trait-associated structural variations using short-read sequencing
title_full_unstemmed Detection of trait-associated structural variations using short-read sequencing
title_short Detection of trait-associated structural variations using short-read sequencing
title_sort detection of trait-associated structural variations using short-read sequencing
topic Technology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10300613/
https://www.ncbi.nlm.nih.gov/pubmed/37388916
http://dx.doi.org/10.1016/j.xgen.2023.100328
work_keys_str_mv AT kosugishunichi detectionoftraitassociatedstructuralvariationsusingshortreadsequencing
AT kamataniyoichiro detectionoftraitassociatedstructuralvariationsusingshortreadsequencing
AT haradakatsutoshi detectionoftraitassociatedstructuralvariationsusingshortreadsequencing
AT tomizukakohei detectionoftraitassociatedstructuralvariationsusingshortreadsequencing
AT momozawayukihide detectionoftraitassociatedstructuralvariationsusingshortreadsequencing
AT morisakitakayuki detectionoftraitassociatedstructuralvariationsusingshortreadsequencing
AT detectionoftraitassociatedstructuralvariationsusingshortreadsequencing
AT teraochikashi detectionoftraitassociatedstructuralvariationsusingshortreadsequencing