Cargando…

Clustering and principal-components approach based on heritability for mapping multiple gene expressions

When the number of phenotypes in a genetic study is on the scale of thousands, such as in studies concerning thousands of gene expression levels, the single-trait analysis is computationally intensive, and heavy adjustment of multiple comparisons is required. Traditional multivariate genetic linkage...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yuanjia, Fang, Yixin, Wang, Shuang
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367519/
https://www.ncbi.nlm.nih.gov/pubmed/18466463
_version_ 1782154311275380736
author Wang, Yuanjia
Fang, Yixin
Wang, Shuang
author_facet Wang, Yuanjia
Fang, Yixin
Wang, Shuang
author_sort Wang, Yuanjia
collection PubMed
description When the number of phenotypes in a genetic study is on the scale of thousands, such as in studies concerning thousands of gene expression levels, the single-trait analysis is computationally intensive, and heavy adjustment of multiple comparisons is required. Traditional multivariate genetic linkage analysis for quantitative traits focuses on mapping only a few phenotypes and is not feasible for a large number of traits. To cope with high-dimensional phenotype data, clustering analysis and principal-component analysis (PCA) are proposed to reduce the data dimensionality and to map shared genetic contributions for multiple traits. However, standard clustering analysis and PCA are applicable for independent observations. In most genetic studies, where family data are collected, these standard analyses can only be applied to founders and can lead to the loss of information. Here, we proposed a clustering method that can exploit family structure information and applied the method to 29 gene expression levels mapped to a reported hot spot on chromosome 14. We then used a PCA approach based on heritability applicable to small number of traits to combine phenotypes in the clusters. Lastly, we used a penalized PCA approach based on heritability applicable to arbitrary number of traits to combine 150 gene expression levels with the highest heritability. Genome-wide multipoint linkage analysis was carried out on the individual traits and on the combined traits. Two previously reported peaks on chromosomes 14 and 20 were identified. Linkage evidence was stronger for traits derived from methods that incorporate family structure information.
format Text
id pubmed-2367519
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23675192008-05-06 Clustering and principal-components approach based on heritability for mapping multiple gene expressions Wang, Yuanjia Fang, Yixin Wang, Shuang BMC Proc Proceedings When the number of phenotypes in a genetic study is on the scale of thousands, such as in studies concerning thousands of gene expression levels, the single-trait analysis is computationally intensive, and heavy adjustment of multiple comparisons is required. Traditional multivariate genetic linkage analysis for quantitative traits focuses on mapping only a few phenotypes and is not feasible for a large number of traits. To cope with high-dimensional phenotype data, clustering analysis and principal-component analysis (PCA) are proposed to reduce the data dimensionality and to map shared genetic contributions for multiple traits. However, standard clustering analysis and PCA are applicable for independent observations. In most genetic studies, where family data are collected, these standard analyses can only be applied to founders and can lead to the loss of information. Here, we proposed a clustering method that can exploit family structure information and applied the method to 29 gene expression levels mapped to a reported hot spot on chromosome 14. We then used a PCA approach based on heritability applicable to small number of traits to combine phenotypes in the clusters. Lastly, we used a penalized PCA approach based on heritability applicable to arbitrary number of traits to combine 150 gene expression levels with the highest heritability. Genome-wide multipoint linkage analysis was carried out on the individual traits and on the combined traits. Two previously reported peaks on chromosomes 14 and 20 were identified. Linkage evidence was stronger for traits derived from methods that incorporate family structure information. BioMed Central 2007-12-18 /pmc/articles/PMC2367519/ /pubmed/18466463 Text en Copyright © 2007 Wang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Wang, Yuanjia
Fang, Yixin
Wang, Shuang
Clustering and principal-components approach based on heritability for mapping multiple gene expressions
title Clustering and principal-components approach based on heritability for mapping multiple gene expressions
title_full Clustering and principal-components approach based on heritability for mapping multiple gene expressions
title_fullStr Clustering and principal-components approach based on heritability for mapping multiple gene expressions
title_full_unstemmed Clustering and principal-components approach based on heritability for mapping multiple gene expressions
title_short Clustering and principal-components approach based on heritability for mapping multiple gene expressions
title_sort clustering and principal-components approach based on heritability for mapping multiple gene expressions
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367519/
https://www.ncbi.nlm.nih.gov/pubmed/18466463
work_keys_str_mv AT wangyuanjia clusteringandprincipalcomponentsapproachbasedonheritabilityformappingmultiplegeneexpressions
AT fangyixin clusteringandprincipalcomponentsapproachbasedonheritabilityformappingmultiplegeneexpressions
AT wangshuang clusteringandprincipalcomponentsapproachbasedonheritabilityformappingmultiplegeneexpressions