Cargando…

SMCKAT, a Sequential Multi-Dimensional CNV Kernel-Based Association Test

Copy number variants (CNVs) are the most common form of structural genetic variation, reflecting the gain or loss of DNA segments compared with a reference genome. Studies have identified CNV association with different diseases. However, the association between the sequential order of CNVs and disea...

Descripción completa

Detalles Bibliográficos
Autores principales: Maus Esfahani, Nastaran, Catchpoole, Daniel, Kennedy, Paul J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709152/
https://www.ncbi.nlm.nih.gov/pubmed/34947833
http://dx.doi.org/10.3390/life11121302
_version_ 1784622864304963584
author Maus Esfahani, Nastaran
Catchpoole, Daniel
Kennedy, Paul J.
author_facet Maus Esfahani, Nastaran
Catchpoole, Daniel
Kennedy, Paul J.
author_sort Maus Esfahani, Nastaran
collection PubMed
description Copy number variants (CNVs) are the most common form of structural genetic variation, reflecting the gain or loss of DNA segments compared with a reference genome. Studies have identified CNV association with different diseases. However, the association between the sequential order of CNVs and disease-related traits has not been studied, to our knowledge, and it is still unclear that CNVs function individually or whether they work in coordination with other CNVs to manifest a disease or trait. Consequently, we propose the first such method to test the association between the sequential order of CNVs and diseases. Our sequential multi-dimensional CNV kernel-based association test (SMCKAT) consists of three parts: (1) a single CNV group kernel measuring the similarity between two groups of CNVs; (2) a whole genome group kernel that aggregates several single group kernels to summarize the similarity between CNV groups in a single chromosome or the whole genome; and (3) an association test between the CNV sequential order and disease-related traits using a random effect model. We evaluate SMCKAT on CNV data sets exhibiting rare or common CNVs, demonstrating that it can detect specific biologically relevant chromosomal regions supported by the biomedical literature. We compare the performance of SMCKAT with MCKAT, a multi-dimensional kernel association test. Based on the results, SMCKAT can detect more specific chromosomal regions compared with MCKAT that not only have CNV characteristics, but the CNV order on them are significantly associated with the disease-related trait.
format Online
Article
Text
id pubmed-8709152
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87091522021-12-25 SMCKAT, a Sequential Multi-Dimensional CNV Kernel-Based Association Test Maus Esfahani, Nastaran Catchpoole, Daniel Kennedy, Paul J. Life (Basel) Article Copy number variants (CNVs) are the most common form of structural genetic variation, reflecting the gain or loss of DNA segments compared with a reference genome. Studies have identified CNV association with different diseases. However, the association between the sequential order of CNVs and disease-related traits has not been studied, to our knowledge, and it is still unclear that CNVs function individually or whether they work in coordination with other CNVs to manifest a disease or trait. Consequently, we propose the first such method to test the association between the sequential order of CNVs and diseases. Our sequential multi-dimensional CNV kernel-based association test (SMCKAT) consists of three parts: (1) a single CNV group kernel measuring the similarity between two groups of CNVs; (2) a whole genome group kernel that aggregates several single group kernels to summarize the similarity between CNV groups in a single chromosome or the whole genome; and (3) an association test between the CNV sequential order and disease-related traits using a random effect model. We evaluate SMCKAT on CNV data sets exhibiting rare or common CNVs, demonstrating that it can detect specific biologically relevant chromosomal regions supported by the biomedical literature. We compare the performance of SMCKAT with MCKAT, a multi-dimensional kernel association test. Based on the results, SMCKAT can detect more specific chromosomal regions compared with MCKAT that not only have CNV characteristics, but the CNV order on them are significantly associated with the disease-related trait. MDPI 2021-11-26 /pmc/articles/PMC8709152/ /pubmed/34947833 http://dx.doi.org/10.3390/life11121302 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Maus Esfahani, Nastaran
Catchpoole, Daniel
Kennedy, Paul J.
SMCKAT, a Sequential Multi-Dimensional CNV Kernel-Based Association Test
title SMCKAT, a Sequential Multi-Dimensional CNV Kernel-Based Association Test
title_full SMCKAT, a Sequential Multi-Dimensional CNV Kernel-Based Association Test
title_fullStr SMCKAT, a Sequential Multi-Dimensional CNV Kernel-Based Association Test
title_full_unstemmed SMCKAT, a Sequential Multi-Dimensional CNV Kernel-Based Association Test
title_short SMCKAT, a Sequential Multi-Dimensional CNV Kernel-Based Association Test
title_sort smckat, a sequential multi-dimensional cnv kernel-based association test
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709152/
https://www.ncbi.nlm.nih.gov/pubmed/34947833
http://dx.doi.org/10.3390/life11121302
work_keys_str_mv AT mausesfahaninastaran smckatasequentialmultidimensionalcnvkernelbasedassociationtest
AT catchpooledaniel smckatasequentialmultidimensionalcnvkernelbasedassociationtest
AT kennedypaulj smckatasequentialmultidimensionalcnvkernelbasedassociationtest