Cargando…

MCKAT: a multi-dimensional copy number variant kernel association test

BACKGROUND: Copy number variants (CNVs) are the gain or loss of DNA segments in the genome. Studies have shown that CNVs are linked to various disorders, including autism, intellectual disability, and schizophrenia. Consequently, the interest in studying a possible association of CNVs to specific di...

Descripción completa

Detalles Bibliográficos
Autores principales: Maus Esfahani, Nastaran, Catchpoole, Daniel, Khan, Javed, Kennedy, Paul J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8666084/
https://www.ncbi.nlm.nih.gov/pubmed/34895138
http://dx.doi.org/10.1186/s12859-021-04494-w
_version_ 1784614137504989184
author Maus Esfahani, Nastaran
Catchpoole, Daniel
Khan, Javed
Kennedy, Paul J.
author_facet Maus Esfahani, Nastaran
Catchpoole, Daniel
Khan, Javed
Kennedy, Paul J.
author_sort Maus Esfahani, Nastaran
collection PubMed
description BACKGROUND: Copy number variants (CNVs) are the gain or loss of DNA segments in the genome. Studies have shown that CNVs are linked to various disorders, including autism, intellectual disability, and schizophrenia. Consequently, the interest in studying a possible association of CNVs to specific disease traits is growing. However, due to the specific multi-dimensional characteristics of the CNVs, methods for testing the association between CNVs and the disease-related traits are still underdeveloped. We propose a novel multi-dimensional CNV kernel association test (MCKAT) in this paper. We aim to find significant associations between CNVs and disease-related traits using kernel-based methods. RESULTS: We address the multi-dimensionality in CNV characteristics. We first design a single pair CNV kernel, which contains three sub-kernels to summarize the similarity between two CNVs considering all CNV characteristics. Then, aggregate single pair CNV kernel to the whole chromosome CNV kernel, which summarizes the similarity between CNVs in two or more chromosomes. Finally, the association between the CNVs and disease-related traits is evaluated by comparing the similarity in the trait with kernel-based similarity using a score test in a random effect model. We apply MCKAT on genome-wide CNV datasets to examine the association between CNVs and disease-related traits, which demonstrates the potential usefulness the proposed method has for the CNV association tests. We compare the performance of MCKAT with CKAT, a uni-dimensional kernel method. Based on the results, MCKAT indicates stronger evidence, smaller p-value, in detecting significant associations between CNVs and disease-related traits in both rare and common CNV datasets. CONCLUSION: A multi-dimensional copy number variant kernel association test can detect statistically significant associated CNV regions with any disease-related trait. MCKAT can provide biologists with CNV hot spots at the cytogenetic band level that CNVs on them may have a significant association with disease-related traits. Using MCKAT, biologists can narrow their investigation from the whole genome, including many genes and CNVs, to more specific cytogenetic bands that MCKAT identifies. Furthermore, MCKAT can help biologists detect significantly associated CNVs with disease-related traits across a patient group instead of examining each subject’s CNVs case by case.
format Online
Article
Text
id pubmed-8666084
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-86660842021-12-13 MCKAT: a multi-dimensional copy number variant kernel association test Maus Esfahani, Nastaran Catchpoole, Daniel Khan, Javed Kennedy, Paul J. BMC Bioinformatics Methodology Article BACKGROUND: Copy number variants (CNVs) are the gain or loss of DNA segments in the genome. Studies have shown that CNVs are linked to various disorders, including autism, intellectual disability, and schizophrenia. Consequently, the interest in studying a possible association of CNVs to specific disease traits is growing. However, due to the specific multi-dimensional characteristics of the CNVs, methods for testing the association between CNVs and the disease-related traits are still underdeveloped. We propose a novel multi-dimensional CNV kernel association test (MCKAT) in this paper. We aim to find significant associations between CNVs and disease-related traits using kernel-based methods. RESULTS: We address the multi-dimensionality in CNV characteristics. We first design a single pair CNV kernel, which contains three sub-kernels to summarize the similarity between two CNVs considering all CNV characteristics. Then, aggregate single pair CNV kernel to the whole chromosome CNV kernel, which summarizes the similarity between CNVs in two or more chromosomes. Finally, the association between the CNVs and disease-related traits is evaluated by comparing the similarity in the trait with kernel-based similarity using a score test in a random effect model. We apply MCKAT on genome-wide CNV datasets to examine the association between CNVs and disease-related traits, which demonstrates the potential usefulness the proposed method has for the CNV association tests. We compare the performance of MCKAT with CKAT, a uni-dimensional kernel method. Based on the results, MCKAT indicates stronger evidence, smaller p-value, in detecting significant associations between CNVs and disease-related traits in both rare and common CNV datasets. CONCLUSION: A multi-dimensional copy number variant kernel association test can detect statistically significant associated CNV regions with any disease-related trait. MCKAT can provide biologists with CNV hot spots at the cytogenetic band level that CNVs on them may have a significant association with disease-related traits. Using MCKAT, biologists can narrow their investigation from the whole genome, including many genes and CNVs, to more specific cytogenetic bands that MCKAT identifies. Furthermore, MCKAT can help biologists detect significantly associated CNVs with disease-related traits across a patient group instead of examining each subject’s CNVs case by case. BioMed Central 2021-12-11 /pmc/articles/PMC8666084/ /pubmed/34895138 http://dx.doi.org/10.1186/s12859-021-04494-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Maus Esfahani, Nastaran
Catchpoole, Daniel
Khan, Javed
Kennedy, Paul J.
MCKAT: a multi-dimensional copy number variant kernel association test
title MCKAT: a multi-dimensional copy number variant kernel association test
title_full MCKAT: a multi-dimensional copy number variant kernel association test
title_fullStr MCKAT: a multi-dimensional copy number variant kernel association test
title_full_unstemmed MCKAT: a multi-dimensional copy number variant kernel association test
title_short MCKAT: a multi-dimensional copy number variant kernel association test
title_sort mckat: a multi-dimensional copy number variant kernel association test
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8666084/
https://www.ncbi.nlm.nih.gov/pubmed/34895138
http://dx.doi.org/10.1186/s12859-021-04494-w
work_keys_str_mv AT mausesfahaninastaran mckatamultidimensionalcopynumbervariantkernelassociationtest
AT catchpooledaniel mckatamultidimensionalcopynumbervariantkernelassociationtest
AT khanjaved mckatamultidimensionalcopynumbervariantkernelassociationtest
AT kennedypaulj mckatamultidimensionalcopynumbervariantkernelassociationtest