Cargando…

An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data

Genomic studies have been a major approach to elucidating disease etiology and to exploring potential targets for treatments of many complex diseases. Statistical analyses in these studies often face the challenges of multiplicity, weak signals, and the nature of dependence among genetic markers. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Qi-You, Lu, Tzu-Pin, Hsiao, Tzu-Hung, Lin, Ching-Heng, Wu, Chi-Yun, Tzeng, Jung-Ying, Hsiao, Chuhsing Kate
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8456116/
https://www.ncbi.nlm.nih.gov/pubmed/34567069
http://dx.doi.org/10.3389/fgene.2021.709555
_version_ 1784570812716548096
author Yu, Qi-You
Lu, Tzu-Pin
Hsiao, Tzu-Hung
Lin, Ching-Heng
Wu, Chi-Yun
Tzeng, Jung-Ying
Hsiao, Chuhsing Kate
author_facet Yu, Qi-You
Lu, Tzu-Pin
Hsiao, Tzu-Hung
Lin, Ching-Heng
Wu, Chi-Yun
Tzeng, Jung-Ying
Hsiao, Chuhsing Kate
author_sort Yu, Qi-You
collection PubMed
description Genomic studies have been a major approach to elucidating disease etiology and to exploring potential targets for treatments of many complex diseases. Statistical analyses in these studies often face the challenges of multiplicity, weak signals, and the nature of dependence among genetic markers. This situation becomes even more complicated when multi-omics data are available. To integrate the data from different platforms, various integrative analyses have been adopted, ranging from the direct union or intersection operation on sets derived from different single-platform analysis to complex hierarchical multi-level models. The former ignores the biological relationship between molecules while the latter can be hard to interpret. We propose in this study an integrative approach that combines both single nucleotide variants (SNVs) and copy number variations (CNVs) in the same genomic unit to co-localize the concurrent effect and to deal with the sparsity due to rare variants. This approach is illustrated with simulation studies to evaluate its performance and is applied to low-density lipoprotein cholesterol and triglyceride measurements from Taiwan Biobank. The results show that the proposed method can more effectively detect the collective effect from both SNVs and CNVs compared to traditional methods. For the biobank analysis, the identified genetic regions including the gene VNN2 could be novel and deserve further investigation.
format Online
Article
Text
id pubmed-8456116
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84561162021-09-23 An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data Yu, Qi-You Lu, Tzu-Pin Hsiao, Tzu-Hung Lin, Ching-Heng Wu, Chi-Yun Tzeng, Jung-Ying Hsiao, Chuhsing Kate Front Genet Genetics Genomic studies have been a major approach to elucidating disease etiology and to exploring potential targets for treatments of many complex diseases. Statistical analyses in these studies often face the challenges of multiplicity, weak signals, and the nature of dependence among genetic markers. This situation becomes even more complicated when multi-omics data are available. To integrate the data from different platforms, various integrative analyses have been adopted, ranging from the direct union or intersection operation on sets derived from different single-platform analysis to complex hierarchical multi-level models. The former ignores the biological relationship between molecules while the latter can be hard to interpret. We propose in this study an integrative approach that combines both single nucleotide variants (SNVs) and copy number variations (CNVs) in the same genomic unit to co-localize the concurrent effect and to deal with the sparsity due to rare variants. This approach is illustrated with simulation studies to evaluate its performance and is applied to low-density lipoprotein cholesterol and triglyceride measurements from Taiwan Biobank. The results show that the proposed method can more effectively detect the collective effect from both SNVs and CNVs compared to traditional methods. For the biobank analysis, the identified genetic regions including the gene VNN2 could be novel and deserve further investigation. Frontiers Media S.A. 2021-09-08 /pmc/articles/PMC8456116/ /pubmed/34567069 http://dx.doi.org/10.3389/fgene.2021.709555 Text en Copyright © 2021 Yu, Lu, Hsiao, Lin, Wu, Tzeng and Hsiao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Yu, Qi-You
Lu, Tzu-Pin
Hsiao, Tzu-Hung
Lin, Ching-Heng
Wu, Chi-Yun
Tzeng, Jung-Ying
Hsiao, Chuhsing Kate
An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data
title An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data
title_full An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data
title_fullStr An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data
title_full_unstemmed An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data
title_short An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data
title_sort integrative co-localization (inco) analysis for snv and cnv genomic features with an application to taiwan biobank data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8456116/
https://www.ncbi.nlm.nih.gov/pubmed/34567069
http://dx.doi.org/10.3389/fgene.2021.709555
work_keys_str_mv AT yuqiyou anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT lutzupin anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT hsiaotzuhung anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT linchingheng anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT wuchiyun anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT tzengjungying anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT hsiaochuhsingkate anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT yuqiyou integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT lutzupin integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT hsiaotzuhung integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT linchingheng integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT wuchiyun integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT tzengjungying integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata
AT hsiaochuhsingkate integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata