Cargando…
Identifying rare-variant associations in parent-child trios using a Gaussian support vector machine
As the availability of cost-effective high-throughput sequencing technology increases, genetic research is beginning to focus on identifying the contributions of rare variants (RVs) to complex traits. Using RVs to detect associated genes requires statistical approaches that mitigate the lack of powe...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143758/ https://www.ncbi.nlm.nih.gov/pubmed/25519420 http://dx.doi.org/10.1186/1753-6561-8-S1-S98 |
_version_ | 1782331954730893312 |
---|---|
author | Lu, Ake T Cantor, Rita M |
author_facet | Lu, Ake T Cantor, Rita M |
author_sort | Lu, Ake T |
collection | PubMed |
description | As the availability of cost-effective high-throughput sequencing technology increases, genetic research is beginning to focus on identifying the contributions of rare variants (RVs) to complex traits. Using RVs to detect associated genes requires statistical approaches that mitigate the lack of power with the analysis of single RVs. Here we report the development and application of an approach that aggregates and evaluates the transmissions of RVs in parent-child trios. An initial score that incorporates the distortion in transmission of the observed RVs from the parents to their offspring is calculated for each variant. The scores are analyzed using a support vector machine that handles these data by mapping the transmission distortion of the multiple RVs into a one-dimensional score in a nonlinear fashion when parent-child trios with affected and nonaffected children are contrasted. We refer to this approach as Trio-SVM. A total of 275 trios were available in the Genetic Analysis Workshop 18 data for analysis. Because of their nonindependence and the extended linkage disequilibrium (LD) within pedigrees, Trio-SVM was vulnerable to type I errors in detecting association. Using the GAW18 data with simulated trait values, Trio-SVM has an appropriate type I error, but it lacks power with a sample of 267 trios. Larger samples of 500 to 1000 trios, derived from combining the simulated data, provided sufficient power. Two chromosome 3 candidate genes were tested in the real GAW18 data with Trio-SVM, and they showed marginal associations with hypertension. |
format | Online Article Text |
id | pubmed-4143758 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41437582014-09-02 Identifying rare-variant associations in parent-child trios using a Gaussian support vector machine Lu, Ake T Cantor, Rita M BMC Proc Proceedings As the availability of cost-effective high-throughput sequencing technology increases, genetic research is beginning to focus on identifying the contributions of rare variants (RVs) to complex traits. Using RVs to detect associated genes requires statistical approaches that mitigate the lack of power with the analysis of single RVs. Here we report the development and application of an approach that aggregates and evaluates the transmissions of RVs in parent-child trios. An initial score that incorporates the distortion in transmission of the observed RVs from the parents to their offspring is calculated for each variant. The scores are analyzed using a support vector machine that handles these data by mapping the transmission distortion of the multiple RVs into a one-dimensional score in a nonlinear fashion when parent-child trios with affected and nonaffected children are contrasted. We refer to this approach as Trio-SVM. A total of 275 trios were available in the Genetic Analysis Workshop 18 data for analysis. Because of their nonindependence and the extended linkage disequilibrium (LD) within pedigrees, Trio-SVM was vulnerable to type I errors in detecting association. Using the GAW18 data with simulated trait values, Trio-SVM has an appropriate type I error, but it lacks power with a sample of 267 trios. Larger samples of 500 to 1000 trios, derived from combining the simulated data, provided sufficient power. Two chromosome 3 candidate genes were tested in the real GAW18 data with Trio-SVM, and they showed marginal associations with hypertension. BioMed Central 2014-06-17 /pmc/articles/PMC4143758/ /pubmed/25519420 http://dx.doi.org/10.1186/1753-6561-8-S1-S98 Text en Copyright © 2014 Lu and Cantor; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Lu, Ake T Cantor, Rita M Identifying rare-variant associations in parent-child trios using a Gaussian support vector machine |
title | Identifying rare-variant associations in parent-child trios using a Gaussian support vector machine |
title_full | Identifying rare-variant associations in parent-child trios using a Gaussian support vector machine |
title_fullStr | Identifying rare-variant associations in parent-child trios using a Gaussian support vector machine |
title_full_unstemmed | Identifying rare-variant associations in parent-child trios using a Gaussian support vector machine |
title_short | Identifying rare-variant associations in parent-child trios using a Gaussian support vector machine |
title_sort | identifying rare-variant associations in parent-child trios using a gaussian support vector machine |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143758/ https://www.ncbi.nlm.nih.gov/pubmed/25519420 http://dx.doi.org/10.1186/1753-6561-8-S1-S98 |
work_keys_str_mv | AT luaket identifyingrarevariantassociationsinparentchildtriosusingagaussiansupportvectormachine AT cantorritam identifyingrarevariantassociationsinparentchildtriosusingagaussiansupportvectormachine |