Cargando…
On the association analysis of CNV data: a fast and robust family-based association method
BACKGROUND: Copy number variation (CNV) is known to play an important role in the genetics of complex diseases and several methods have been proposed to detect association of CNV with phenotypes of interest. Statistical methods for CNV association analysis can be categorized into two different strat...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5395793/ https://www.ncbi.nlm.nih.gov/pubmed/28420343 http://dx.doi.org/10.1186/s12859-017-1622-z |
_version_ | 1783229937652596736 |
---|---|
author | Liu, Meiling Moon, Sanghoon Wang, Longfei Kim, Sulgi Kim, Yeon-Jung Hwang, Mi Yeong Kim, Young Jin Elston, Robert C. Kim, Bong-Jo Won, Sungho |
author_facet | Liu, Meiling Moon, Sanghoon Wang, Longfei Kim, Sulgi Kim, Yeon-Jung Hwang, Mi Yeong Kim, Young Jin Elston, Robert C. Kim, Bong-Jo Won, Sungho |
author_sort | Liu, Meiling |
collection | PubMed |
description | BACKGROUND: Copy number variation (CNV) is known to play an important role in the genetics of complex diseases and several methods have been proposed to detect association of CNV with phenotypes of interest. Statistical methods for CNV association analysis can be categorized into two different strategies. First, the copy number is estimated by maximum likelihood and association of the expected copy number with the phenotype is tested. Second, the observed probe intensity measurements can be directly used to detect association of CNV with the phenotypes of interest. RESULTS: For each strategy we provide a statistic that can be applied to extended families. The computational efficiency of the proposed methods enables genome-wide association analysis and we show with simulation studies that the proposed methods outperform other existing approaches. In particular, we found that the first strategy is always more efficient than the second strategy no matter whether copy numbers for each individual are well identified or not. With the proposed methods, we performed genome-wide CNV association analyses of hematological trait, hematocrit, on 521 Korean family samples. CONCLUSIONS: We found that statistical analysis with the expected copy number is more powerful than the statistic with the probe intensity measurements regardless of the accuracy of the estimation of copy numbers. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1622-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5395793 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53957932017-04-20 On the association analysis of CNV data: a fast and robust family-based association method Liu, Meiling Moon, Sanghoon Wang, Longfei Kim, Sulgi Kim, Yeon-Jung Hwang, Mi Yeong Kim, Young Jin Elston, Robert C. Kim, Bong-Jo Won, Sungho BMC Bioinformatics Research Article BACKGROUND: Copy number variation (CNV) is known to play an important role in the genetics of complex diseases and several methods have been proposed to detect association of CNV with phenotypes of interest. Statistical methods for CNV association analysis can be categorized into two different strategies. First, the copy number is estimated by maximum likelihood and association of the expected copy number with the phenotype is tested. Second, the observed probe intensity measurements can be directly used to detect association of CNV with the phenotypes of interest. RESULTS: For each strategy we provide a statistic that can be applied to extended families. The computational efficiency of the proposed methods enables genome-wide association analysis and we show with simulation studies that the proposed methods outperform other existing approaches. In particular, we found that the first strategy is always more efficient than the second strategy no matter whether copy numbers for each individual are well identified or not. With the proposed methods, we performed genome-wide CNV association analyses of hematological trait, hematocrit, on 521 Korean family samples. CONCLUSIONS: We found that statistical analysis with the expected copy number is more powerful than the statistic with the probe intensity measurements regardless of the accuracy of the estimation of copy numbers. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1622-z) contains supplementary material, which is available to authorized users. BioMed Central 2017-04-18 /pmc/articles/PMC5395793/ /pubmed/28420343 http://dx.doi.org/10.1186/s12859-017-1622-z Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Liu, Meiling Moon, Sanghoon Wang, Longfei Kim, Sulgi Kim, Yeon-Jung Hwang, Mi Yeong Kim, Young Jin Elston, Robert C. Kim, Bong-Jo Won, Sungho On the association analysis of CNV data: a fast and robust family-based association method |
title | On the association analysis of CNV data: a fast and robust family-based association method |
title_full | On the association analysis of CNV data: a fast and robust family-based association method |
title_fullStr | On the association analysis of CNV data: a fast and robust family-based association method |
title_full_unstemmed | On the association analysis of CNV data: a fast and robust family-based association method |
title_short | On the association analysis of CNV data: a fast and robust family-based association method |
title_sort | on the association analysis of cnv data: a fast and robust family-based association method |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5395793/ https://www.ncbi.nlm.nih.gov/pubmed/28420343 http://dx.doi.org/10.1186/s12859-017-1622-z |
work_keys_str_mv | AT liumeiling ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT moonsanghoon ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT wanglongfei ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT kimsulgi ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT kimyeonjung ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT hwangmiyeong ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT kimyoungjin ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT elstonrobertc ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT kimbongjo ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod AT wonsungho ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod |