Cargando…
Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
BACKGROUND: Multivariate approaches have been successfully applied to genome wide association studies. Recently, a Partial Least Squares (PLS) based approach was introduced for mapping yeast genotype-phenotype relations, where background information such as gene function classification, gene dispens...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3598729/ https://www.ncbi.nlm.nih.gov/pubmed/23216988 http://dx.doi.org/10.1186/1471-2105-13-327 |
_version_ | 1782262810102726656 |
---|---|
author | Mehmood, Tahir Warringer, Jonas Snipen, Lars Sæbø, Solve |
author_facet | Mehmood, Tahir Warringer, Jonas Snipen, Lars Sæbø, Solve |
author_sort | Mehmood, Tahir |
collection | PubMed |
description | BACKGROUND: Multivariate approaches have been successfully applied to genome wide association studies. Recently, a Partial Least Squares (PLS) based approach was introduced for mapping yeast genotype-phenotype relations, where background information such as gene function classification, gene dispensability, recent or ancient gene copy number variations and the presence of premature stop codons or frameshift mutations in reading frames, were used post hoc to explain selected genes. One of the latest advancement in PLS named L-Partial Least Squares (L-PLS), where ‘L’ presents the used data structure, enables the use of background information at the modeling level. Here, a modification of L-PLS with variable importance on projection (VIP) was implemented using a stepwise regularized procedure for gene and background information selection. Results were compared to PLS-based procedures, where no background information was used. RESULTS: Applying the proposed methodology to yeast Saccharomyces cerevisiae data, we found the relationship between genotype-phenotype to have improved understandability. Phenotypic variations were explained by the variations of relatively stable genes and stable background variations. The suggested procedure provides an automatic way for genotype-phenotype mapping. The selected phenotype influencing genes were evolving 29% faster than non-influential genes, and the current results are supported by a recently conducted study. Further power analysis on simulated data verified that the proposed methodology selects relevant variables. CONCLUSIONS: A modification of L-PLS with VIP in a stepwise regularized elimination procedure can improve the understandability and stability of selected genes and background information. The approach is recommended for genome wide association studies where background information is available. |
format | Online Article Text |
id | pubmed-3598729 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35987292013-03-26 Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression Mehmood, Tahir Warringer, Jonas Snipen, Lars Sæbø, Solve BMC Bioinformatics Research Article BACKGROUND: Multivariate approaches have been successfully applied to genome wide association studies. Recently, a Partial Least Squares (PLS) based approach was introduced for mapping yeast genotype-phenotype relations, where background information such as gene function classification, gene dispensability, recent or ancient gene copy number variations and the presence of premature stop codons or frameshift mutations in reading frames, were used post hoc to explain selected genes. One of the latest advancement in PLS named L-Partial Least Squares (L-PLS), where ‘L’ presents the used data structure, enables the use of background information at the modeling level. Here, a modification of L-PLS with variable importance on projection (VIP) was implemented using a stepwise regularized procedure for gene and background information selection. Results were compared to PLS-based procedures, where no background information was used. RESULTS: Applying the proposed methodology to yeast Saccharomyces cerevisiae data, we found the relationship between genotype-phenotype to have improved understandability. Phenotypic variations were explained by the variations of relatively stable genes and stable background variations. The suggested procedure provides an automatic way for genotype-phenotype mapping. The selected phenotype influencing genes were evolving 29% faster than non-influential genes, and the current results are supported by a recently conducted study. Further power analysis on simulated data verified that the proposed methodology selects relevant variables. CONCLUSIONS: A modification of L-PLS with VIP in a stepwise regularized elimination procedure can improve the understandability and stability of selected genes and background information. The approach is recommended for genome wide association studies where background information is available. BioMed Central 2012-12-08 /pmc/articles/PMC3598729/ /pubmed/23216988 http://dx.doi.org/10.1186/1471-2105-13-327 Text en Copyright ©2012 Mehmood et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Mehmood, Tahir Warringer, Jonas Snipen, Lars Sæbø, Solve Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression |
title | Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression |
title_full | Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression |
title_fullStr | Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression |
title_full_unstemmed | Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression |
title_short | Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression |
title_sort | improving stability and understandability of genotype-phenotype mapping in saccharomyces using regularized variable selection in l-pls regression |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3598729/ https://www.ncbi.nlm.nih.gov/pubmed/23216988 http://dx.doi.org/10.1186/1471-2105-13-327 |
work_keys_str_mv | AT mehmoodtahir improvingstabilityandunderstandabilityofgenotypephenotypemappinginsaccharomycesusingregularizedvariableselectioninlplsregression AT warringerjonas improvingstabilityandunderstandabilityofgenotypephenotypemappinginsaccharomycesusingregularizedvariableselectioninlplsregression AT snipenlars improvingstabilityandunderstandabilityofgenotypephenotypemappinginsaccharomycesusingregularizedvariableselectioninlplsregression AT sæbøsolve improvingstabilityandunderstandabilityofgenotypephenotypemappinginsaccharomycesusingregularizedvariableselectioninlplsregression |