Cargando…

On the association analysis of CNV data: a fast and robust family-based association method

BACKGROUND: Copy number variation (CNV) is known to play an important role in the genetics of complex diseases and several methods have been proposed to detect association of CNV with phenotypes of interest. Statistical methods for CNV association analysis can be categorized into two different strat...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Meiling, Moon, Sanghoon, Wang, Longfei, Kim, Sulgi, Kim, Yeon-Jung, Hwang, Mi Yeong, Kim, Young Jin, Elston, Robert C., Kim, Bong-Jo, Won, Sungho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5395793/
https://www.ncbi.nlm.nih.gov/pubmed/28420343
http://dx.doi.org/10.1186/s12859-017-1622-z
_version_ 1783229937652596736
author Liu, Meiling
Moon, Sanghoon
Wang, Longfei
Kim, Sulgi
Kim, Yeon-Jung
Hwang, Mi Yeong
Kim, Young Jin
Elston, Robert C.
Kim, Bong-Jo
Won, Sungho
author_facet Liu, Meiling
Moon, Sanghoon
Wang, Longfei
Kim, Sulgi
Kim, Yeon-Jung
Hwang, Mi Yeong
Kim, Young Jin
Elston, Robert C.
Kim, Bong-Jo
Won, Sungho
author_sort Liu, Meiling
collection PubMed
description BACKGROUND: Copy number variation (CNV) is known to play an important role in the genetics of complex diseases and several methods have been proposed to detect association of CNV with phenotypes of interest. Statistical methods for CNV association analysis can be categorized into two different strategies. First, the copy number is estimated by maximum likelihood and association of the expected copy number with the phenotype is tested. Second, the observed probe intensity measurements can be directly used to detect association of CNV with the phenotypes of interest. RESULTS: For each strategy we provide a statistic that can be applied to extended families. The computational efficiency of the proposed methods enables genome-wide association analysis and we show with simulation studies that the proposed methods outperform other existing approaches. In particular, we found that the first strategy is always more efficient than the second strategy no matter whether copy numbers for each individual are well identified or not. With the proposed methods, we performed genome-wide CNV association analyses of hematological trait, hematocrit, on 521 Korean family samples. CONCLUSIONS: We found that statistical analysis with the expected copy number is more powerful than the statistic with the probe intensity measurements regardless of the accuracy of the estimation of copy numbers. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1622-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5395793
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53957932017-04-20 On the association analysis of CNV data: a fast and robust family-based association method Liu, Meiling Moon, Sanghoon Wang, Longfei Kim, Sulgi Kim, Yeon-Jung Hwang, Mi Yeong Kim, Young Jin Elston, Robert C. Kim, Bong-Jo Won, Sungho BMC Bioinformatics Research Article BACKGROUND: Copy number variation (CNV) is known to play an important role in the genetics of complex diseases and several methods have been proposed to detect association of CNV with phenotypes of interest. Statistical methods for CNV association analysis can be categorized into two different strategies. First, the copy number is estimated by maximum likelihood and association of the expected copy number with the phenotype is tested. Second, the observed probe intensity measurements can be directly used to detect association of CNV with the phenotypes of interest. RESULTS: For each strategy we provide a statistic that can be applied to extended families. The computational efficiency of the proposed methods enables genome-wide association analysis and we show with simulation studies that the proposed methods outperform other existing approaches. In particular, we found that the first strategy is always more efficient than the second strategy no matter whether copy numbers for each individual are well identified or not. With the proposed methods, we performed genome-wide CNV association analyses of hematological trait, hematocrit, on 521 Korean family samples. CONCLUSIONS: We found that statistical analysis with the expected copy number is more powerful than the statistic with the probe intensity measurements regardless of the accuracy of the estimation of copy numbers. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1622-z) contains supplementary material, which is available to authorized users. BioMed Central 2017-04-18 /pmc/articles/PMC5395793/ /pubmed/28420343 http://dx.doi.org/10.1186/s12859-017-1622-z Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Liu, Meiling
Moon, Sanghoon
Wang, Longfei
Kim, Sulgi
Kim, Yeon-Jung
Hwang, Mi Yeong
Kim, Young Jin
Elston, Robert C.
Kim, Bong-Jo
Won, Sungho
On the association analysis of CNV data: a fast and robust family-based association method
title On the association analysis of CNV data: a fast and robust family-based association method
title_full On the association analysis of CNV data: a fast and robust family-based association method
title_fullStr On the association analysis of CNV data: a fast and robust family-based association method
title_full_unstemmed On the association analysis of CNV data: a fast and robust family-based association method
title_short On the association analysis of CNV data: a fast and robust family-based association method
title_sort on the association analysis of cnv data: a fast and robust family-based association method
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5395793/
https://www.ncbi.nlm.nih.gov/pubmed/28420343
http://dx.doi.org/10.1186/s12859-017-1622-z
work_keys_str_mv AT liumeiling ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT moonsanghoon ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT wanglongfei ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT kimsulgi ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT kimyeonjung ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT hwangmiyeong ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT kimyoungjin ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT elstonrobertc ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT kimbongjo ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod
AT wonsungho ontheassociationanalysisofcnvdataafastandrobustfamilybasedassociationmethod