Cargando…
An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data
Genomic studies have been a major approach to elucidating disease etiology and to exploring potential targets for treatments of many complex diseases. Statistical analyses in these studies often face the challenges of multiplicity, weak signals, and the nature of dependence among genetic markers. Th...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8456116/ https://www.ncbi.nlm.nih.gov/pubmed/34567069 http://dx.doi.org/10.3389/fgene.2021.709555 |
_version_ | 1784570812716548096 |
---|---|
author | Yu, Qi-You Lu, Tzu-Pin Hsiao, Tzu-Hung Lin, Ching-Heng Wu, Chi-Yun Tzeng, Jung-Ying Hsiao, Chuhsing Kate |
author_facet | Yu, Qi-You Lu, Tzu-Pin Hsiao, Tzu-Hung Lin, Ching-Heng Wu, Chi-Yun Tzeng, Jung-Ying Hsiao, Chuhsing Kate |
author_sort | Yu, Qi-You |
collection | PubMed |
description | Genomic studies have been a major approach to elucidating disease etiology and to exploring potential targets for treatments of many complex diseases. Statistical analyses in these studies often face the challenges of multiplicity, weak signals, and the nature of dependence among genetic markers. This situation becomes even more complicated when multi-omics data are available. To integrate the data from different platforms, various integrative analyses have been adopted, ranging from the direct union or intersection operation on sets derived from different single-platform analysis to complex hierarchical multi-level models. The former ignores the biological relationship between molecules while the latter can be hard to interpret. We propose in this study an integrative approach that combines both single nucleotide variants (SNVs) and copy number variations (CNVs) in the same genomic unit to co-localize the concurrent effect and to deal with the sparsity due to rare variants. This approach is illustrated with simulation studies to evaluate its performance and is applied to low-density lipoprotein cholesterol and triglyceride measurements from Taiwan Biobank. The results show that the proposed method can more effectively detect the collective effect from both SNVs and CNVs compared to traditional methods. For the biobank analysis, the identified genetic regions including the gene VNN2 could be novel and deserve further investigation. |
format | Online Article Text |
id | pubmed-8456116 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84561162021-09-23 An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data Yu, Qi-You Lu, Tzu-Pin Hsiao, Tzu-Hung Lin, Ching-Heng Wu, Chi-Yun Tzeng, Jung-Ying Hsiao, Chuhsing Kate Front Genet Genetics Genomic studies have been a major approach to elucidating disease etiology and to exploring potential targets for treatments of many complex diseases. Statistical analyses in these studies often face the challenges of multiplicity, weak signals, and the nature of dependence among genetic markers. This situation becomes even more complicated when multi-omics data are available. To integrate the data from different platforms, various integrative analyses have been adopted, ranging from the direct union or intersection operation on sets derived from different single-platform analysis to complex hierarchical multi-level models. The former ignores the biological relationship between molecules while the latter can be hard to interpret. We propose in this study an integrative approach that combines both single nucleotide variants (SNVs) and copy number variations (CNVs) in the same genomic unit to co-localize the concurrent effect and to deal with the sparsity due to rare variants. This approach is illustrated with simulation studies to evaluate its performance and is applied to low-density lipoprotein cholesterol and triglyceride measurements from Taiwan Biobank. The results show that the proposed method can more effectively detect the collective effect from both SNVs and CNVs compared to traditional methods. For the biobank analysis, the identified genetic regions including the gene VNN2 could be novel and deserve further investigation. Frontiers Media S.A. 2021-09-08 /pmc/articles/PMC8456116/ /pubmed/34567069 http://dx.doi.org/10.3389/fgene.2021.709555 Text en Copyright © 2021 Yu, Lu, Hsiao, Lin, Wu, Tzeng and Hsiao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Yu, Qi-You Lu, Tzu-Pin Hsiao, Tzu-Hung Lin, Ching-Heng Wu, Chi-Yun Tzeng, Jung-Ying Hsiao, Chuhsing Kate An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data |
title | An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data |
title_full | An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data |
title_fullStr | An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data |
title_full_unstemmed | An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data |
title_short | An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data |
title_sort | integrative co-localization (inco) analysis for snv and cnv genomic features with an application to taiwan biobank data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8456116/ https://www.ncbi.nlm.nih.gov/pubmed/34567069 http://dx.doi.org/10.3389/fgene.2021.709555 |
work_keys_str_mv | AT yuqiyou anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT lutzupin anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT hsiaotzuhung anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT linchingheng anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT wuchiyun anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT tzengjungying anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT hsiaochuhsingkate anintegrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT yuqiyou integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT lutzupin integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT hsiaotzuhung integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT linchingheng integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT wuchiyun integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT tzengjungying integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata AT hsiaochuhsingkate integrativecolocalizationincoanalysisforsnvandcnvgenomicfeatureswithanapplicationtotaiwanbiobankdata |